Chain of thought monitorability: A new and fragile opportunity for AI safety
Link: https://arxiv.org/abs/2507.11473
Discussion: https://news.ycombinator.com/item?id=44582855
Chain of thought monitorability: A new and fragile opportunity for AI safety
Link: https://arxiv.org/abs/2507.11473
Discussion: https://news.ycombinator.com/item?id=44582855
If you have a fediverse account, you can quote this note from your own instance. Search https://social.lansky.name/users/hn100/statuses/114865806334777974 on your instance and quote it. (Note that quoting is not supported in Mastodon.)