Better averages for online machine-learning

Posted by David Owen on 2022-03-04

Averages are used, in some form or other, and many machine-learning algorithms. Stochastic gradient descent is a great example of an average in disguise, thin though it may be.

Picking the right kind of average can be critical. As learning algorithms explore sub-optimal choices, the resulting negative impact on backed-up state values can persist over epochs, hampering performance. Alternatively, some kinds of average don't converge, preventing the algorithm from settling into optimal outcomes.

Here, I officially release a paper on a particular kind of average that's adaptable like an exponential moving average, but has guaranteed convergence like a simple average.

This average is probably best suited to non-stochastic training, such as I mentioned in Action-selection and learning-rates in Q-learning.

Download the paper, An adaptable, convergent moving average for online machine-learning.

David Owen’s blog

Better averages for online machine-learning

Trackbacks

Comments

Add Comment