This is a heavily interactive web application, and JavaScript is required. Simple HTML interfaces are possible, but that is not what this is.
Post
Josh Susskind
kindsuss.bsky.social
did:plc:cpxxd73hrwut2iibgoduyvdi
Here's a great paper on scaling laws for teacher-student neural network distillation led by @dbusbridge.bsky.social and Apple colleagues. I've often seen people struggle to get distillation working well enough in practical settings, and I expect the insights in this paper can really help!
[contains quote post or other embedded content]
2025-02-14T03:30:05.856Z