AI models can't distinguish between users’ beliefs and facts, which could be really detrimental in medical settings where patients could have incorrect beliefs about their medical conditions. All models struggle on tasks involving false beliefs reported in the first-person. https://spectrum.ieee.org/ai-reasoning-failures?utm_source=mastodon&utm_medium=social&utm_campaign=fedica-Mastodon-Daily-Pipeline
