0 votes
16 views
by
When not to use Adam optimizer?

1 Answer

0 votes
by
Adam uses a moving average of the parameters, which means that it can take longer to converge than other optimizers. This may not be a problem for many problems, but for tasks with a large number of parameters or very small data sets, Adam may be too slow.
...