Edition №1

Sunday, April 19, 2026

3 stories

The Daily Letter

AI & Tech · Curated Daily

Edition · APR 19

№1

THE LEAD · MEASURE AI AGAINST THE RIGHT BASELINE

Measure AI Against What People Would Do Instead

AI-generated · Edition 1

Story 01 · The Lead

MEASURE AI AGAINST THE RIGHT BASELINE

Measure AI Against What People Would Do Instead

The right baseline for diagnostic AI is the decisions people would have made without it, not physicians. Most published research ignores that counterfactual, so we don't know whether AI actually improves care or merely matches clinical benchmarks in lab conditions.

Read story →

All stories in this edition

MEASURE AI AGAINST THE RIGHT BASELINE

Measure AI Against What People Would Do Instead

SYSTEM PROMPTS AREN’T FULL TRANSPARENCY

Publishing Prompts Isn’t Full Transparency

MODEL-SPECIFIC UIS WILL FRAGMENT TOOLS

Model-Specific UIs Will Fragment Developer Tools

Social pulse

Google's capability vs. what's exposed (Gemini / site-only release debate)

The thread mixes admiration and frustration. People acknowledge Google’s best-in-class image/music/video generative components and a smooth Studio playground, but many feel there's an odd gap between the full Gemini Pro model and the stripped-down app/website experience. That gap fuels a safety-versus-access argument: pragmatic support for "site-only" or sandboxed releases of risky ‘Mythos’-class models (limits misuse) vs. criticism that surface-only access either hides capability from researchers or is a performative safety measure.

Robotics sprinting past humans — Robolympics hype

The community is excited, a bit giddy: a robot finishing a half-marathon faster than the human world record feels like a milestone and fuels talk of a coming 'Robolympics.' The reaction mixes awe with skeptical follow-ups about fairness (controlled conditions, tethers, assistance), plus playful speculation about what sports robots will dominate next.

Weekly paper roundup — excitement mixed with read-and-triage fatigue

DAIR's 'Top AI Papers of the Week' posts generate the usual combo of FOMO and gratitude: people appreciate curated lists to triage the flood, but there's also weary skepticism about incremental-sounding titles and claim inflation. The thread is serving both as discovery and as an informal gatekeeping signal about what's worth reading this week.