A randomized study of 1,298 UK adults found that while large language models perform well on medical tasks alone, they do not improve and can worsen decision-making when used by the public. Failures ...