This is a heavily interactive web application, and JavaScript is required. Simple HTML interfaces are possible, but that is not what this is.
Post
Parameter Lab
parameterlab.bsky.social
did:plc:omedfcx75hu7gp6uuldjys7j
We challenge the view that reasoning traces are a safe internal part of a model’s process. Our work shows they can leak information, through both deliberate attacks and accidental leakage.
RTAI: https://researchtrend.ai/papers/2506.15674
ArXiv: arxiv.org/abs/2506.15674
Code: https://github.com/parameterlab/leaky_thoughts
2/2
2025-08-21T15:14:45.431Z