Back Original

The Hallway Track No. 013: Provenance vocabulary, Wikipedia contamination, OpenAI rare-disease diagnoses, ARA paper format

This week's links converge on a single question: what infrastructure needs to exist before AI tools for research can be trustworthy? Carpenter names the vocabulary problem. We keep saying "citation" when we mean five different things. Northwestern quantifies how long broken provenance persists on Wikipedia (median: 3.68 years). And the policy landscape keeps fragmenting: a survey of 802 journals shows STEM and humanities drawing completely different lines on AI in peer review. Meanwhile, the artifact itself is mutating: two arXiv papers this week propose papers explicitly designed for AI consumption. And NEJM AI publishes the first peer-reviewed case of OpenAI's o3 helping Boston Children's clinicians solve previously-unsolved pediatric rare disease cases, a concrete capability demonstration against the "ruin" narrative Nature surveyed in social sciences. The shared frameworks aren't materializing, but the practice isn't waiting.

Provenance & the Record

Peer Review & Policy

Capability & Community