This is a heavily interactive web application, and JavaScript is required. Simple HTML interfaces are possible, but that is not what this is.
Post
Quanquan Gu
quanquangu.bsky.social
did:plc:wlqlperwhmu6q47gdjjt3sa6
Pretraining will only end once we find the optimal scaling law.
2024-12-14T08:07:01.053Z