A new study of clinical AI agents reveals that reinforcement learning from automated feedback fails dramatically on healthcare tasks, with pure RL reaching only 18.2% accuracy versus 34.1% for supervised methods. Researchers identified two structural obstacles: capability ceilings where models have zero base performance on certain task types, and format-knowledge barriers requiring exact clinical codes that exploration cannot discover.
Why it matters: As healthcare systems increasingly deploy AI agents to execute clinical protocols, understanding why RL fails—and prescribing hybrid approaches combining supervised learning with RL—is essential for building reliable medical decision-support systems.