Race Result Details |
| Racer | clu0 (clu0) |
| Race Number | 5 |
| Date | Wed, 27 Aug 2025 1:57:12 |
| Universe | magic |
| Speed |
84 WPM
Try to beat?
|
| Accuracy | 97% |
| Rank | 4th place (out of 8) |
| Opponents | emef (7th place) foresterdaniel (2nd place) seanmor5 (1st place) shawns28 (6th place) vig0 (5th place) xskape (3rd place) |
Text typed:
|
We validate the performance of Loss-Free Balancing on MoE models with up to 3B parameters trained on up to 200B tokens. Experimental results show that Loss-Free Balancing achieves both better performance and better load balance compared with traditional auxiliary-loss-controlled load balancing strategies.
— (other)
by AUXILIARY-LOSS-FREE LOAD BALANCING STRATEGY FOR MIXTURE-OF-EXPERTS
(see stats)
|
