This is a heavily interactive web application, and JavaScript is required. Simple HTML interfaces are possible, but that is not what this is.
Post
Morten Vassvik
vassvik.bsky.social
did:plc:egzztun2gcwkch7oaggv74lu
The core idea is to take the 10x10x10 footprint accessed by all threads in the workgroup and split it into chunks, and then fetch these chunks cooperatively in a way that efficiently aligns with subgroup boundaries in such a way that we don't introduce divergence, and then store to shared memory
2024-11-17T22:58:52.348Z