This is a heavily interactive web application, and JavaScript is required. Simple HTML interfaces are possible, but that is not what this is.
Post
Common Crawl Foundation
commoncrawl.bsky.social
did:plc:7w4zm6pfg3k5giduan34rbux
The Common Crawl Foundation, MLCommons, EleutherAI, and John Hopkins' Center for Language and Speech Processing have the pleasure of inviting you to register for the 1st shared task on Language Identification for web data.
https://commoncrawl.org/blog/wmdqs-shared-task-on-language-identification
2025-07-21T22:34:09.341Z