I don't know yet, but a wave around a midpoint might even be more general for every dataset than a fixed value. I don't like the cosine annealing with restarts and such, they waste to much steps at the bottom. but this one seems to work because it stays mostly around the LR you want