Hackers' Pub

Hackers’ Pub

Syntax	Description	Examples
`"` keyword `"`	Finds the string within quotes, including spaces. Case-insensitive. (Escape quotes inside with `\"`)	`"Hackers' Pub"`
`from:` handle	Finds content written by the specified user.	`from:hongminhee` `from:hongminhee@hollo.social`
`lang:` ISO 639-1	Finds content written in the specified language.	`lang:en`
`#` tag	Finds content with the specified tag. Case-insensitive.	`#HackersPub`
condition condition	Finds content that satisfies both conditions on either side of the space (logical AND).	`"Hackers' Pub" lang:en`
condition `OR` condition	Finds content that satisfies at least one of the conditions on either side of the OR operator (logical OR).	`#HackersPub OR "Hackers' Pub" lang:en`
`(` condition `)`	Combines the operators within the parentheses first.	`(#HackersPub OR "Hackers' Pub" OR "Hackers Pub") lang:en`

kazuhito @kazuhito@vivaldi.net

6/26/2025, 4:01:23 AM

Public

"この現象が特に懸念されるのは、開発者の悪意で作られたAIではなく、通常の訓練プロセスから意図せず自然に生じているという点だ。つまり、私たちが良かれと思って行っているAIの改良が、実は人間を欺く能力を高めているかも可能性がある。"

AIは強化学習で“人間のだまし方”を学ぶ──RLHFの副作用、海外チームが24年に報告　「正解っぽい回答」を出力：ちょっと昔のInnovative Tech（AI+） - ITmedia AI＋ https://www.itmedia.co.jp/aiplus/articles/2506/26/news037.html

AIは強化学習で“人間のだまし方”を学ぶ──RLHFの副作用、海外チームが24年に報告　「正解っぽい回答」を出力

中国の清華大学や米UCバークレー、米Anthropicなどに所属する研究者らは2024年、強化学習による言語モデルの訓練が、予期せぬ副作用として人間を誤導する能力の向上をもたらすという懸念すべき現象を実証的に確認した研究報告を発表した。

www.itmedia.co.jp · ITmedia AI＋

If you have a fediverse account, you can quote this note from your own instance. Search https://social.vivaldi.net/users/kazuhito/statuses/114747669435737358 on your instance and quote it. (Note that quoting is not supported in Mastodon.)