Models and Research

Highlights advances in core systems, technical breakthroughs, experiments, and academic work driving progress.

Models and Research

Open Models Are Winning code arena rankings by Fitting the Loop
ByMax Dvornik April 11, 2026April 13, 2026

A strange thing happened to code arena rankings. They stopped being just a nerdy scoreboard and started acting like a…

Read More Open Models Are Winning code arena rankings by Fitting the Loop
Models and Research

Empirical Research in Machine Learning Ended Math’s Monopoly
ByGeoff Dyers April 9, 2026April 13, 2026

A theorem-first paper and an ablation-heavy systems paper can now describe the same model class and leave with very different…

Read More Empirical Research in Machine Learning Ended Math’s Monopoly
Models and Research

Speculative Decoding’s Ceiling Just Moved With DFlash
ByMax Dvornik April 8, 2026April 13, 2026

A serving engineer watches tokens arrive in that familiar trickle: fast enough to demo, slow enough to feel like the…

Read More Speculative Decoding’s Ceiling Just Moved With DFlash
Models and Research

Reduce LLM Hallucinations? Why ‘Make-No-Mistakes’ Fails
ByMax Dvornik April 7, 2026April 13, 2026

The first time you see it, it’s kind of perfect: a tiny folder in your Cursor skills called make-no-mistakes. One…

Read More Reduce LLM Hallucinations? Why ‘Make-No-Mistakes’ Fails
Models and Research

Neuro-symbolic AI Cuts Energy 100×: Change the Problem
ByGeoff Dyers April 7, 2026April 13, 2026

If you tried to rebuild the Tufts experiment yourself, the first thing you’d notice is boring: the neuro-symbolic AI system…

Read More Neuro-symbolic AI Cuts Energy 100×: Change the Problem
Models and Research

Chinese AI Model Delays End Casual Open-Weight Era
ByPriscilla Li April 6, 2026April 13, 2026

Everyone on Reddit sees the same thing: a bunch of Chinese labs promising new open‑weight models… and then quietly missing…

Read More Chinese AI Model Delays End Casual Open-Weight Era
Models and Research

GLM-5 vs Claude Opus: Why Cheap Models Win for Agents
ByJames McCallef April 5, 2026April 13, 2026

YC‑Bench just produced the sort of result that usually launches a thousand hot takes: GLM‑5 vs Claude Opus on a…

Read More GLM-5 vs Claude Opus: Why Cheap Models Win for Agents
Models and Research

AI Model Collapse Is Happening: Treat Data as Code Now
ByPriscilla Li April 3, 2026April 13, 2026

If you’ve asked an LLM for a simple command lately and watched it flail through three wrong answers, you’ve already…

Read More AI Model Collapse Is Happening: Treat Data as Code Now
Models and Research

RBF Attention Reveals Dot‑Product’s Hidden Norm Bias
ByGeoff Dyers April 2, 2026April 13, 2026

Swapping dot‑product attention for RBF attention sounds like an architectural revolution. In Raphael Pisoni’s experiment, it turned out to be…

Read More RBF Attention Reveals Dot‑Product’s Hidden Norm Bias

Categories