@m9e.bsky.social - Matt Wallace

CTO building #AIhttps://bsky.app/profile/m9e.bsky.social@m9e.bsky.social - Matt Wallacehttps://bsky.app/profile/m9e.bsky.social/post/3mf3wgcz4sc2gThis has to have been the most momentous GenAI month since 11/22. The best SOTA frontier models drop, and they're amazing. 4 massive open models drop and they're closer than ever to the lead. OpenClaw blows up and creates a zeitgeist. Man, what a time to be alive.18 Feb 2026 01:27 +0000at://did:plc:2hy2pr5qmv2li3k5xuxbnprx/app.bsky.feed.post/3mf3wgcz4sc2ghttps://bsky.app/profile/m9e.bsky.social/post/3mf3vsslqis2g18 Feb 2026 01:16 +0000at://did:plc:2hy2pr5qmv2li3k5xuxbnprx/app.bsky.feed.post/3mf3vsslqis2ghttps://bsky.app/profile/m9e.bsky.social/post/3lsfev5dpk22lLove this detail. reuters on the ruling: https://www.reuters.com/legal/litigation/anthropic-wins-key-ruling-ai-authors-copyright-lawsuit-2025-06-24/ [contains quote post or other embedded content]25 Jun 2025 00:41 +0000at://did:plc:2hy2pr5qmv2li3k5xuxbnprx/app.bsky.feed.post/3lsfev5dpk22lhttps://bsky.app/profile/m9e.bsky.social/post/3lquwmryxrs2nas an AI founder who is ridiculously all in, when I’m like flabbergasted by the temerity of your product I just can’t even…05 Jun 2025 18:18 +0000at://did:plc:2hy2pr5qmv2li3k5xuxbnprx/app.bsky.feed.post/3lquwmryxrs2nhttps://bsky.app/profile/m9e.bsky.social/post/3loygctkq5k2nSuno is so fucking good. Wow.12 May 2025 16:47 +0000at://did:plc:2hy2pr5qmv2li3k5xuxbnprx/app.bsky.feed.post/3loygctkq5k2nhttps://bsky.app/profile/m9e.bsky.social/post/3lolqphse522dThe year is 2030. nVidia announces new DGX with attached Fusion reactor.07 May 2025 15:48 +0000at://did:plc:2hy2pr5qmv2li3k5xuxbnprx/app.bsky.feed.post/3lolqphse522dhttps://bsky.app/profile/m9e.bsky.social/post/3lojdpimz4c24Claude did not understand the mission and tried to write his system prompt to Notion. haha!06 May 2025 16:50 +0000at://did:plc:2hy2pr5qmv2li3k5xuxbnprx/app.bsky.feed.post/3lojdpimz4c24https://bsky.app/profile/m9e.bsky.social/post/3loiythwkos2iHome setup: rtx3090+4090. testing AWQ Qwen3; both 32B and 32B w/ 4B speculative decoding TL;DR: ⚠️ on spec in vanilla vllm. w/ a 75% token acceptance generation in batch-4 went from 150t/s (no speculative) -> 50t/s (w/ speculative) I'd get 400t/s+ max tput w/out speculative, larger batch06 May 2025 13:36 +0000at://did:plc:2hy2pr5qmv2li3k5xuxbnprx/app.bsky.feed.post/3loiythwkos2ihttps://bsky.app/profile/m9e.bsky.social/post/3llre5yvcu223Wild and wonderful watching a keynote demo 100ft wide and being able to literally picture the code in your head. 😁01 Apr 2025 17:04 +0000at://did:plc:2hy2pr5qmv2li3k5xuxbnprx/app.bsky.feed.post/3llre5yvcu223https://bsky.app/profile/m9e.bsky.social/post/3llc5ew5mhs2vEvery schema for content should have a human vs ai flag (or perhaps an enum with human, ai, and then some hybrid roles).26 Mar 2025 15:53 +0000at://did:plc:2hy2pr5qmv2li3k5xuxbnprx/app.bsky.feed.post/3llc5ew5mhs2vhttps://bsky.app/profile/m9e.bsky.social/post/3lkqtqao2t22uJensen: "I'd never buy a hopper!" Azure: "We don't have any Ampere GPUs to turn up even." Me: 😠19 Mar 2025 18:45 +0000at://did:plc:2hy2pr5qmv2li3k5xuxbnprx/app.bsky.feed.post/3lkqtqao2t22uhttps://bsky.app/profile/m9e.bsky.social/post/3lkfk6xcug22a640KB ought to be enough for anybody.15 Mar 2025 06:55 +0000at://did:plc:2hy2pr5qmv2li3k5xuxbnprx/app.bsky.feed.post/3lkfk6xcug22ahttps://bsky.app/profile/m9e.bsky.social/post/3lk44reguw22jIt's interesting because the dials of draft model, draft model quant, --draft length, they all play into the sweet spot. and it's clearly not super consistent. like I was playing with 32B-Q8 w/ 3B vs 7B 4KL drafts; with those the default --draft 16 seems like the sweet spot. (>12, >32 informal test) [contains quote post or other embedded content]11 Mar 2025 13:01 +0000at://did:plc:2hy2pr5qmv2li3k5xuxbnprx/app.bsky.feed.post/3lk44reguw22jhttps://bsky.app/profile/m9e.bsky.social/post/3lk4463qnsc2j32B Coder-Q8 w/ and w/out 7B-Q4_K_L draft - PSA speculative decoding is in llamacpp and works. (depending on your hardware, experiment w/ diff model sizes - ymmv vary wildly)11 Mar 2025 12:50 +0000at://did:plc:2hy2pr5qmv2li3k5xuxbnprx/app.bsky.feed.post/3lk4463qnsc2jhttps://bsky.app/profile/m9e.bsky.social/post/3ljao3qkfbs2rReward function engineer28 Feb 2025 14:56 +0000at://did:plc:2hy2pr5qmv2li3k5xuxbnprx/app.bsky.feed.post/3ljao3qkfbs2rhttps://bsky.app/profile/m9e.bsky.social/post/3lixl2sliis2kok. told claude 3.7 to extra some settings mgmt components from an app layer into generic fastapi router+react component. I expected that to work. Him writing this beautiful readme with emojis I did *NOT* expect.25 Feb 2025 00:08 +0000at://did:plc:2hy2pr5qmv2li3k5xuxbnprx/app.bsky.feed.post/3lixl2sliis2khttps://bsky.app/profile/m9e.bsky.social/post/3lixkqrsiz22scontinue continue continue continue continue, and btw, I think you were interrupted and may have already written some stuff you can't see continue continue ifykyk25 Feb 2025 00:02 +0000at://did:plc:2hy2pr5qmv2li3k5xuxbnprx/app.bsky.feed.post/3lixkqrsiz22shttps://bsky.app/profile/m9e.bsky.social/post/3lil2obeocs2uThe universe came with a game genie we just had to assemble? https://azure.microsoft.com/en-us/blog/quantum/2025/02/19/microsoft-unveils-majorana-1-the-worlds-first-quantum-processor-powered-by-topological-qubits/20 Feb 2025 00:43 +0000at://did:plc:2hy2pr5qmv2li3k5xuxbnprx/app.bsky.feed.post/3lil2obeocs2uhttps://bsky.app/profile/m9e.bsky.social/post/3li6tqcruw227Agent-Oriented Programming15 Feb 2025 04:07 +0000at://did:plc:2hy2pr5qmv2li3k5xuxbnprx/app.bsky.feed.post/3li6tqcruw227https://bsky.app/profile/m9e.bsky.social/post/3lhtwksyrl22pAsked ChatGPT to compare and contrast the coconut (reasoning in latent space with reasoning tokens & passing hidden states) to the new recurrent depth paper. https://chatgpt.com/share/67aa5973-3cb8-800f-a75e-c8515a6c727b Coolest thing was GPT suggesting a hybrid approach which I had been thinking before I even got to the bottom10 Feb 2025 19:58 +0000at://did:plc:2hy2pr5qmv2li3k5xuxbnprx/app.bsky.feed.post/3lhtwksyrl22phttps://bsky.app/profile/m9e.bsky.social/post/3lhkt7p3kbs26I live in a world of language all day and startup land is not conducive to a lot of distraction, but damn if every time I pop into Suno I'm not blown away.07 Feb 2025 05:04 +0000at://did:plc:2hy2pr5qmv2li3k5xuxbnprx/app.bsky.feed.post/3lhkt7p3kbs26https://bsky.app/profile/m9e.bsky.social/post/3lheg33hm6s2bBased purely on the speed I feel like o3-mini is getting a solid reception. The tps is like 1/5th what it was last week04 Feb 2025 15:53 +0000at://did:plc:2hy2pr5qmv2li3k5xuxbnprx/app.bsky.feed.post/3lheg33hm6s2bhttps://bsky.app/profile/m9e.bsky.social/post/3lhbrkdo45k26deep research seems immediately useful and may have just solved an annoying sglang issue with a particular model for me.03 Feb 2025 14:40 +0000at://did:plc:2hy2pr5qmv2li3k5xuxbnprx/app.bsky.feed.post/3lhbrkdo45k26https://bsky.app/profile/m9e.bsky.social/post/3lgypykucbk2y🤣❤️31 Jan 2025 00:18 +0000at://did:plc:2hy2pr5qmv2li3k5xuxbnprx/app.bsky.feed.post/3lgypykucbk2yhttps://bsky.app/profile/m9e.bsky.social/post/3lgikpfs6oc2sheheh.24 Jan 2025 14:01 +0000at://did:plc:2hy2pr5qmv2li3k5xuxbnprx/app.bsky.feed.post/3lgikpfs6oc2shttps://bsky.app/profile/m9e.bsky.social/post/3lg6nx3hgh226Feel like the "peak of inflated expectations" call may age poorly, although the engineering to really run AI apps is bootstrapping fast into reality, so maybe it isn't that expectations go down but that eng catches up some?20 Jan 2025 15:33 +0000at://did:plc:2hy2pr5qmv2li3k5xuxbnprx/app.bsky.feed.post/3lg6nx3hgh226https://bsky.app/profile/m9e.bsky.social/post/3lfflkh2pr22p10 Jan 2025 16:13 +0000at://did:plc:2hy2pr5qmv2li3k5xuxbnprx/app.bsky.feed.post/3lfflkh2pr22phttps://bsky.app/profile/m9e.bsky.social/post/3lfdkgfuw3c22Happy to report we @kamiwaza-ai.bsky.social have raised $11m to speed up enterprise AI with our stack- and I’m hiring. https://www.geekwire.com/2025/seattle-vcs-invest-in-kamiwaza-a-new-enterprise-software-startup-helping-companies-adopt-ai/09 Jan 2025 20:48 +0000at://did:plc:2hy2pr5qmv2li3k5xuxbnprx/app.bsky.feed.post/3lfdkgfuw3c22https://bsky.app/profile/m9e.bsky.social/post/3lfaklubvy22tOk, I tinkered with Suno v4 this morning and holy shit. We are living in the best timeline.08 Jan 2025 16:13 +0000at://did:plc:2hy2pr5qmv2li3k5xuxbnprx/app.bsky.feed.post/3lfaklubvy22t