yesterday i modified my network so that different activation functions could be used in each layers, and set it so that the output layer had linear activation functions rather than the typical sigmoid. the network was trained overnight on about 563000 randomly generated training patterns. results don’t seem as good as the sigmoidal outputs… here is the network’s output for the four-note scale:

the last three notes were hit fairly well, but it looks like it completely missed the first one. i think it might be best to go back to the sigmoid outputs for now.
today i need to figure out a better way for checking the network’s error. i am thinking maybe generating 100 training patterns and finding the error for each of those, then averaging. i also need to look more into how to write to files. the way i’m using now (writing as “Package”) output all the data without rounding anything off, but i don’t know if it can easily be imported back into mathematica.


