<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"><channel><description>Postdoc at the interpretable deep learning lab at Northeastern University, deep learning, LLMs, mechanistic interpretability</description><link>https://bsky.app/profile/wendlerc.bsky.social</link><title>@wendlerc.bsky.social - Chris Wendler</title><item><link>https://bsky.app/profile/wendlerc.bsky.social/post/3mfmmcemf5c2n</link><description>I am not very disciplined about syncing my bluesky and x account, if you are interested what I am up to please check out my x account x.com/wendlerch or website wendlerc.github.io&#xA;https://wendlerc.github.io</description><pubDate>24 Feb 2026 16:41 +0000</pubDate><guid isPermaLink="false">at://did:plc:5te5aeoznqwlt3yady3lcdbi/app.bsky.feed.post/3mfmmcemf5c2n</guid></item><item><link>https://bsky.app/profile/wendlerc.bsky.social/post/3lmcjcjidw22o</link><description>Check out Sheridan’s work on concept induction circuits -- the soft version of induction we were promised a while ago :) &#xA;&#xA;During our multilingual concept patching experiments I have always been wondering whether it is those circuits doing the work. Finally, some evidence:&#xA;&#xA;[contains quote post or other embedded content]</description><pubDate>08 Apr 2025 12:51 +0000</pubDate><guid isPermaLink="false">at://did:plc:5te5aeoznqwlt3yady3lcdbi/app.bsky.feed.post/3lmcjcjidw22o</guid></item><item><link>https://bsky.app/profile/wendlerc.bsky.social/post/3lkvxo3fxnk2p</link><description>In case you ever wondered what you could do if you had SAEs for intermediate results of diffusion models, we trained SDXL Turbo SAEs on 4 blocks for you.  We noticed that they specialize into a &#34;composition&#34;, a &#34;detail&#34;, and a &#34;style&#34; block. And one that is hard to make sense of.</description><pubDate>21 Mar 2025 19:39 +0000</pubDate><guid isPermaLink="false">at://did:plc:5te5aeoznqwlt3yady3lcdbi/app.bsky.feed.post/3lkvxo3fxnk2p</guid></item><item><link>https://bsky.app/profile/wendlerc.bsky.social/post/3lknwx7h2fc22</link><description>Apply to Akhil&#39;s lab, he is great!&#xA;&#xA;[contains quote post or other embedded content]</description><pubDate>18 Mar 2025 15:04 +0000</pubDate><guid isPermaLink="false">at://did:plc:5te5aeoznqwlt3yady3lcdbi/app.bsky.feed.post/3lknwx7h2fc22</guid></item><item><link>https://bsky.app/profile/wendlerc.bsky.social/post/3lifahdiw6s2f</link><description>This seems like an elegant idea!&#xA;&#xA;[contains quote post or other embedded content]</description><pubDate>17 Feb 2025 17:10 +0000</pubDate><guid isPermaLink="false">at://did:plc:5te5aeoznqwlt3yady3lcdbi/app.bsky.feed.post/3lifahdiw6s2f</guid></item><item><link>https://bsky.app/profile/wendlerc.bsky.social/post/3ld6fxnim3s2h</link><description>The resources you find online on transformers are just next level... My jaw dropped when I first stumbled upon this video series: https://www.youtube.com/watch?v=V3NQaDR3xI4&amp;list=PLoyGOS2WIonajhAVqKUgEMNmeq3nEeM51</description><pubDate>13 Dec 2024 08:54 +0000</pubDate><guid isPermaLink="false">at://did:plc:5te5aeoznqwlt3yady3lcdbi/app.bsky.feed.post/3ld6fxnim3s2h</guid></item><item><link>https://bsky.app/profile/wendlerc.bsky.social/post/3lbsiayjet22b</link><description>bit grumpy but great summary of the tokenformer paper&#xA;&#xA;https://www.youtube.com/watch?v=gfU5y7qCxF0</description><pubDate>25 Nov 2024 21:38 +0000</pubDate><guid isPermaLink="false">at://did:plc:5te5aeoznqwlt3yady3lcdbi/app.bsky.feed.post/3lbsiayjet22b</guid></item><item><link>https://bsky.app/profile/wendlerc.bsky.social/post/3lbevptbqjk2x</link><description>In case you also wondered how to derive the maximal update parametrisation (muP) learning rate for ADAM. I did a short write up: tinyurl.com/mup-for-adam. Thanks Ilia Badanin and Eugene Golikov for your help on this.&#xA;https://tinyurl.com/mup-for-adam</description><pubDate>20 Nov 2024 12:02 +0000</pubDate><guid isPermaLink="false">at://did:plc:5te5aeoznqwlt3yady3lcdbi/app.bsky.feed.post/3lbevptbqjk2x</guid></item></channel></rss>