flax
Add AdaBelief in flax.optim, which adapts stepsize according to "belief" in gradient, and achieves good generalization, fast convergence and training stability.
#1488
Merged

Add AdaBelief in flax.optim, which adapts stepsize according to "belief" in gradient, and achieves good generalization, fast convergence and training stability. #1488

copybara-service merged 1 commit into main from test_390475374
copybara-service
google-cla
google-cla google-cla added cla: no
copybara-service copybara-service force pushed 4 years ago
google-cla
copybara-service copybara-service force pushed 4 years ago
copybara-service copybara-service changed the title adabelief in flax Add AdaBelief in flax.optim, which adapts stepsize according to "belief" in gradient, and achieves good generalization, fast convergence and training stability. 4 years ago
google-cla
codecov-commenter
copybara-service copybara-service force pushed 4 years ago
google-cla
copybara-service copybara-service force pushed to 0537805b 4 years ago
google-cla
copybara-service copybara-service force pushed from 0537805b 4 years ago
google-cla
Add AdaBelief in flax.optim, which adapts stepsize according to "beli…
bb1f8073
copybara-service copybara-service force pushed to bb1f8073 4 years ago
copybara-service copybara-service merged bb1f8073 into main 4 years ago
copybara-service copybara-service deleted the test_390475374 branch 4 years ago
google-cla

Login to write a write a comment.

Login via GitHub

Reviewers
No reviews
Assignees
No one assigned
Labels
Milestone