Race Result Details |
| Racer | Shawn (shawns28) |
| Race Number | 3 |
| Date | Wed, 27 Aug 2025 1:57:19 |
| Universe | magic |
| Speed |
73 WPM
Try to beat?
|
| Accuracy | 95.7% |
| Rank | 6th place (out of 8) |
| Opponents | clu0 (4th place) emef (7th place) foresterdaniel (2nd place) seanmor5 (1st place) vig0 (5th place) xskape (3rd place) |
Text typed:
|
We validate the performance of Loss-Free Balancing on MoE models with up to 3B parameters trained on up to 200B tokens. Experimental results show that Loss-Free Balancing achieves both better performance and better load balance compared with traditional auxiliary-loss-controlled load balancing strategies.
— (other)
by AUXILIARY-LOSS-FREE LOAD BALANCING STRATEGY FOR MIXTURE-OF-EXPERTS
(see stats)
|
