Key findings:
45% of all AI answers had at least one significant issue.
31% of responses showed serious sourcing problems – missing, misleading, or incorrect attributions.
20% contained major accuracy issues, including hallucinated details and outdated information.
Gemini performed worst with significant issues in 76% of responses, more than double the other assistants, largely due to its poor sourcing performance.

bbc.co.uk/mediacentre/2025/new

0
0
0

If you have a fediverse account, you can quote this note from your own instance. Search https://mstdn.science/users/ChemicalEyeGuy/statuses/115418080916086403 on your instance and quote it. (Note that quoting is not supported in Mastodon.)