flax
Add AdaBelief in flax.optim, which adapts stepsize according to "belief" in gradient, and achieves good generalization, fast convergence and training stability.
#1488

Merged

Add AdaBelief in flax.optim, which adapts stepsize according to "belief" in gradient, and achieves good generalization, fast convergence and training stability. #1488

copybara-service merged 1 commit into main from test_390475374

google-cla added cla: no

copybara-service force pushed 4 years ago

copybara-service changed the title ~~adabelief in flax~~ Add AdaBelief in flax.optim, which adapts stepsize according to "belief" in gradient, and achieves good generalization, fast convergence and training stability. 4 years ago

copybara-service force pushed 4 years ago

copybara-service force pushed to 0537805b 4 years ago

copybara-service force pushed from 0537805b 4 years ago

Add AdaBelief in flax.optim, which adapts stepsize according to "beli…

bb1f8073

copybara-service force pushed to bb1f8073 4 years ago

copybara-service merged bb1f8073 into main 4 years ago

copybara-service deleted the test_390475374 branch 4 years ago

Reviewers

No reviews

Assignees

No one assigned

Labels

cla: no

Milestone

No milestone

flax Add AdaBelief in flax.optim, which adapts stepsize according to "belief" in gradient, and achieves good generalization, fast convergence and training stability. #1488 Merged

Add AdaBelief in flax.optim, which adapts stepsize according to "belief" in gradient, and achieves good generalization, fast convergence and training stability. #1488

flax
Add AdaBelief in flax.optim, which adapts stepsize according to "belief" in gradient, and achieves good generalization, fast convergence and training stability.
#1488

Merged