<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"><channel><description>Co-Founder at Zentropi (Trustworthy AI). Formerly Meta Civic Integrity Founder, Google X and Google Civic Innovation Lead, and Groq CPO.</description><link>https://bsky.app/profile/samidh.bsky.social</link><title>@samidh.bsky.social - Samidh</title><item><link>https://bsky.app/profile/samidh.bsky.social/post/3mllogu6a522o</link><description>Check out how the @oversightboard.bsky.social used Zentropi to better understand how child marriage-related content manifests on Meta&#39;s platforms. Fantastic example of how advanced content labeling technologies can strengthen both our online and offline world. https://blog.zentropi.ai/how-the-oversight-board-uses-zentropi-to-study-policy-impact-at-scale/</description><pubDate>11 May 2026 16:18 +0000</pubDate><guid isPermaLink="false">at://did:plc:zfekat7e2s7iftys552ro7b4/app.bsky.feed.post/3mllogu6a522o</guid></item><item><link>https://bsky.app/profile/samidh.bsky.social/post/3mjhvha4ygk2x</link><description>It has been incredible partnering with character.ai since the very start of zentropi.ai. We&#39;re excited to share some details of that partnership with this case study. Anyone creating AI-powered systems might find it interesting! https://blog.zentropi.ai/how-zentropi-partners-with-character-ai/</description><pubDate>14 Apr 2026 17:23 +0000</pubDate><guid isPermaLink="false">at://did:plc:zfekat7e2s7iftys552ro7b4/app.bsky.feed.post/3mjhvha4ygk2x</guid></item><item><link>https://bsky.app/profile/samidh.bsky.social/post/3mhdxbjecls2m</link><description>One of the things we&#39;ve been thinking about a lot at Zentropi is: what happens when AI agents need to make judgment calls about content — not humans reviewing a queue, but agents acting autonomously?</description><pubDate>18 Mar 2026 16:54 +0000</pubDate><guid isPermaLink="false">at://did:plc:zfekat7e2s7iftys552ro7b4/app.bsky.feed.post/3mhdxbjecls2m</guid></item><item><link>https://bsky.app/profile/samidh.bsky.social/post/3mgqfa2tdws2j</link><description>There&#39;s a major gap in content safety tooling: classifiers typically only score complete text. When you&#39;re working with generative AI, &#34;complete text&#34; means the user already saw it. That&#39;s too late.&#xA;&#xA;So we built a streaming classifier that we&#39;re releasing today! Here&#39;s what we did and why. &#xA;&#xA;🧵...</description><pubDate>10 Mar 2026 22:11 +0000</pubDate><guid isPermaLink="false">at://did:plc:zfekat7e2s7iftys552ro7b4/app.bsky.feed.post/3mgqfa2tdws2j</guid></item><item><link>https://bsky.app/profile/samidh.bsky.social/post/3mfabhegvoc2r</link><description>Zentropi is now integrated into Coop, @roost.tools&#39;s open source moderation platform. You can write a content policy in plain English on Zentropi, plug it into Coop as a signal, and have a moderation pipeline running in minutes.</description><pubDate>19 Feb 2026 18:55 +0000</pubDate><guid isPermaLink="false">at://did:plc:zfekat7e2s7iftys552ro7b4/app.bsky.feed.post/3mfabhegvoc2r</guid></item><item><link>https://bsky.app/profile/samidh.bsky.social/post/3mddwadrhcs2g</link><description>I can has cats.&#xA;&#xA;[contains quote post or other embedded content]</description><pubDate>26 Jan 2026 18:55 +0000</pubDate><guid isPermaLink="false">at://did:plc:zfekat7e2s7iftys552ro7b4/app.bsky.feed.post/3mddwadrhcs2g</guid></item><item><link>https://bsky.app/profile/samidh.bsky.social/post/3mddtdoi3e22o</link><description>Just shipped Zentropi&#39;s most requested feature: image classification!&#xA;&#xA;Now analyze images against your own policies, at scale. &#xA;&#xA;To power it we built cope-b-12b, a new multimodal model w/ native vision.&#xA;&#xA;Check out the cat detector we made in &lt; 1 min. 🐱&#xA;blog.zentropi.ai/zentropi-now-labels-images/&#xA;https://blog.zentropi.ai/zentropi-now-labels-images/</description><pubDate>26 Jan 2026 18:03 +0000</pubDate><guid isPermaLink="false">at://did:plc:zfekat7e2s7iftys552ro7b4/app.bsky.feed.post/3mddtdoi3e22o</guid></item><item><link>https://bsky.app/profile/samidh.bsky.social/post/3mcxsseejm22y</link><description>If you are looking for a technical description of how X rots your brain, look no further than their github post on the &#39;X algorithm&#39;. It is pure, unadulterated behavioral engagement maximization that amplifies the very worst human impulses. https://github.com/xai-org/x-algorithm</description><pubDate>21 Jan 2026 23:21 +0000</pubDate><guid isPermaLink="false">at://did:plc:zfekat7e2s7iftys552ro7b4/app.bsky.feed.post/3mcxsseejm22y</guid></item><item><link>https://bsky.app/profile/samidh.bsky.social/post/3mciclpq6zs2l</link><description>Why are we just giving away all our secrets? Well, it is our hope that it helps the ecosystem further advance the state of the art in policy-steerable content classification, which is foundational to a more trustworthy internet.&#xA;&#xA;[contains quote post or other embedded content]</description><pubDate>15 Jan 2026 19:21 +0000</pubDate><guid isPermaLink="false">at://did:plc:zfekat7e2s7iftys552ro7b4/app.bsky.feed.post/3mciclpq6zs2l</guid></item><item><link>https://bsky.app/profile/samidh.bsky.social/post/3mcdfiukgas2v</link><description>Dave just published a Zentropi labeler that can precisely identify requests at prompting an AI model to undress a person in a photo. The tools exist to easily deal with this problem -- platforms just need to choose to use them. If you are the developer of an AI system, please use this guardrail!&#xA;&#xA;[contains quote post or other embedded content]</description><pubDate>13 Jan 2026 20:30 +0000</pubDate><guid isPermaLink="false">at://did:plc:zfekat7e2s7iftys552ro7b4/app.bsky.feed.post/3mcdfiukgas2v</guid></item><item><link>https://bsky.app/profile/samidh.bsky.social/post/3m72ammvmcc2a</link><description>This was such a cool experiment that I created a Zentropi labeler with a simplified version of the authors&#39; Partisan Animosity criteria. Now anyone can experiment directly with using this labeler to try to reduce the temperature of affective polarization in their feeds. https://zentropi.ai/labelers/b3044134-88e5-4ff8-9f4c-b7387d693b39&#xA;&#xA;[contains quote post or other embedded content]</description><pubDate>03 Dec 2025 00:53 +0000</pubDate><guid isPermaLink="false">at://did:plc:zfekat7e2s7iftys552ro7b4/app.bsky.feed.post/3m72ammvmcc2a</guid></item><item><link>https://bsky.app/profile/samidh.bsky.social/post/3m5kaooizmc2w</link><description>We just wrote an in-depth post about Toxic Content labeling. It presents a new way of defining toxic speech online-- and illustrates the importance of observable features for accurate language model interpretability. Would love to hear how YOU define toxicity, too! https://blog.zentropi.ai/observations-on-toxicity/</description><pubDate>13 Nov 2025 22:47 +0000</pubDate><guid isPermaLink="false">at://did:plc:zfekat7e2s7iftys552ro7b4/app.bsky.feed.post/3m5kaooizmc2w</guid></item><item><link>https://bsky.app/profile/samidh.bsky.social/post/3m5feazxuhs2z</link><description>Awesome to see how this is already being used! One of the most useful aspects is that the published policies show what it takes to write content rules that can be accurately interpreted by language models. We hope this can be a boost to the broader content policy community.&#xA;&#xA;[contains quote post or other embedded content]</description><pubDate>12 Nov 2025 00:07 +0000</pubDate><guid isPermaLink="false">at://did:plc:zfekat7e2s7iftys552ro7b4/app.bsky.feed.post/3m5feazxuhs2z</guid></item><item><link>https://bsky.app/profile/samidh.bsky.social/post/3m5ctbuum622u</link><description>This was a fun launch! It turns Zentropi into a Github for Content Labelers. You can share content policies with others and build off each other&#39;s work. It&#39;s the easiest way of deploying a fully customizable classifier. Check out the policies @dwillner.bsky.social created at zentropi.ai/u/dave&#xA;&#xA;[contains quote post or other embedded content]</description><pubDate>10 Nov 2025 23:58 +0000</pubDate><guid isPermaLink="false">at://did:plc:zfekat7e2s7iftys552ro7b4/app.bsky.feed.post/3m5ctbuum622u</guid></item><item><link>https://bsky.app/profile/samidh.bsky.social/post/3lxeihyq6nk2z</link><description>This response to the Raine tragedy from OpenAI does something remarkable: it has the humility to acknowledge that a *product failure* led to real-world harm. Despite horrific circumstances, it has a rare degree of honesty that I wish tech companies would show more often. https://openai.com/index/helping-people-when-they-need-it-most/</description><pubDate>27 Aug 2025 07:19 +0000</pubDate><guid isPermaLink="false">at://did:plc:zfekat7e2s7iftys552ro7b4/app.bsky.feed.post/3lxeihyq6nk2z</guid></item><item><link>https://bsky.app/profile/samidh.bsky.social/post/3lwrbvpovjs2o</link><description>We are opening up Zentropi.ai to everyone today so that anyone can build their own content labeler. What started as a crazy academic idea 2 years ago is now a real thing that companies are using in production to safeguard their AI-powered systems. Give it a shot! https://blog.zentropi.ai/zentropi-build-your-own-content-labeler-in-minutes-not-months/</description><pubDate>19 Aug 2025 16:01 +0000</pubDate><guid isPermaLink="false">at://did:plc:zfekat7e2s7iftys552ro7b4/app.bsky.feed.post/3lwrbvpovjs2o</guid></item><item><link>https://bsky.app/profile/samidh.bsky.social/post/3lvc46pfcb22o</link><description>Don&#39;t take our word for it! Go kick the tires at zentropi.ai and build your own content labeler  (no subscription required!)&#xA;&#xA;[contains quote post or other embedded content]</description><pubDate>31 Jul 2025 21:43 +0000</pubDate><guid isPermaLink="false">at://did:plc:zfekat7e2s7iftys552ro7b4/app.bsky.feed.post/3lvc46pfcb22o</guid></item><item><link>https://bsky.app/profile/samidh.bsky.social/post/3lvc432g6xk2o</link><description>@mmasnick.bsky.social I have a bluesky demo for you that you might want to see :)&#xA;&#xA;[contains quote post or other embedded content]</description><pubDate>31 Jul 2025 21:41 +0000</pubDate><guid isPermaLink="false">at://did:plc:zfekat7e2s7iftys552ro7b4/app.bsky.feed.post/3lvc432g6xk2o</guid></item><item><link>https://bsky.app/profile/samidh.bsky.social/post/3lugsrhqcf22k</link><description>So excited for #TrustCon this week! We will be publicly unveiling Zentropi, a platform that helps people instantly build their own content labelers. We&#39;ll be opening it up for early access and open sourcing the underlying language model we trained for the task so that it is accessible to everyone.</description><pubDate>21 Jul 2025 01:13 +0000</pubDate><guid isPermaLink="false">at://did:plc:zfekat7e2s7iftys552ro7b4/app.bsky.feed.post/3lugsrhqcf22k</guid></item><item><link>https://bsky.app/profile/samidh.bsky.social/post/3lub6oihxt22p</link><description>I expect @dwillner.bsky.social  to run around like a maniac again at #Trustcon this year as he shows off Zentropi -- our platform that makes it simple to build your own CoPE-powered content labeler.&#xA;&#xA;[contains quote post or other embedded content]</description><pubDate>18 Jul 2025 19:30 +0000</pubDate><guid isPermaLink="false">at://did:plc:zfekat7e2s7iftys552ro7b4/app.bsky.feed.post/3lub6oihxt22p</guid></item><item><link>https://bsky.app/profile/samidh.bsky.social/post/3lfxnngruy227</link><description>The splinternet accelerates. If this stands, look for more countries in 2025 to ban Facebook, Instagram, YouTube, etc. out of fears of American surveillance. https://www.bloomberg.com/news/articles/2025-01-17/tiktok-ban-law-is-upheld-by-the-us-supreme-court&#xA;https://www.bloomberg.com/news/articles/2025-01-17/tiktok-ban-law-is-upheld-by-the-us-supreme-court?accessToken=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJzb3VyY2UiOiJTdWJzY3JpYmVyR2lmdGVkQXJ0aWNsZSIsImlhdCI6MTczNzEyNjcwNiwiZXhwIjoxNzM3NzMxNTA2LCJhcnRpY2xlSWQiOiJTUTRXVEZEV0xVNjgwMCIsImJjb25uZWN0SWQiOiI4OUM4OTNDMDhGOTQ0NThDQkQwQTQyREY1RDFCOTY0QyJ9.7W8jQCOpaltEgIvaecnYfmCxumwIDTOaYveZS43Fxe4</description><pubDate>17 Jan 2025 20:39 +0000</pubDate><guid isPermaLink="false">at://did:plc:zfekat7e2s7iftys552ro7b4/app.bsky.feed.post/3lfxnngruy227</guid></item></channel></rss>