OpenAI's HealthBench shows AI's medical advice is improving - but who will listen?
The HealthBench test can't possibly tell us the critical factor: How humans would respond to chatbots under real-world conditions.

The HealthBench test can't possibly tell us the critical factor: How humans would respond to chatbots under real-world conditions.