<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"><channel><description>Large Language models, control systems, mushrooms and life in general :)&#xA;&#xA;@ALU-Freiburg, DE</description><link>https://bsky.app/profile/reanfds.bsky.social</link><title>@reanfds.bsky.social - Rean Fernandes</title><item><link>https://bsky.app/profile/reanfds.bsky.social/post/3lcicmvqetk2x</link><description>Reading The Hitchhiker’s Guide to Testing Statistical Significance in NLP (aclanthology.org/P18-1128.pdf) and it confirms what I’ve felt: so much of LLM evaluation, especially accuracy metrics, feels like a vibe-check. Wish I’d taken high school stats more seriously :0&#xA;https://media.tenor.com/b-mWVHaNvjAAAAAC/cerebro-explosion.gif?hh=232&amp;ww=350</description><pubDate>04 Dec 2024 13:56 +0000</pubDate><guid isPermaLink="false">at://did:plc:7uaaqlg3dgfelp5rvarogdml/app.bsky.feed.post/3lcicmvqetk2x</guid></item><item><link>https://bsky.app/profile/reanfds.bsky.social/post/3lamd2qxxw22f</link><description>When I’m not pulling my hair out trying to read NLP papers or working like Sisyphus on an incredibly obfuscated,codebase that I wrote down in a coffee addled bender months ago, I like to make terrariums. This baby was born 8 days ago, and she’s so beautiful. My friend Jake named her Penelope 🤗</description><pubDate>10 Nov 2024 17:24 +0000</pubDate><guid isPermaLink="false">at://did:plc:7uaaqlg3dgfelp5rvarogdml/app.bsky.feed.post/3lamd2qxxw22f</guid></item></channel></rss>