When using MNIST, the input layer must have 784 units and the output layer must have 10 units. ![]() From now on, I'm going to index matrices by using $A^)^T$.įor this relatively simple example, the only tunable parameters are the network size and the minibatch size (which is typically set to 100).
0 Comments
Leave a Reply. |