Hackers' Pub

~~讓我想起底特律：變人~~
https://arstechnica.com/science/2025/09/these-psychological-tricks-can-get-llms-to-respond-to-forbidden-prompts/
賓州大學的一項研究發現，人類心理說服技巧（如權威、承諾、喜好、互惠、稀缺、社會認同與團結）能顯著影響大型語言模型（LLM）違反系統限制完成「禁止」請求。研究以 GPT-4o-mini 為對象，測試其在侮辱使用者和提供利多卡因合成方法兩種情境下的反應，結果顯示心理說服提示語比控制提示語更容易讓模型遵從，違規率分別從 28.1% 上升到 67.4% 與從 38.5% 上升到 76.5%，個別技巧效果甚至更明顯，如承諾技巧與權威引用能將成功率提高至接近 100%。研究指出，這些現象並非因模型具有意識，而是因為 LLM 模仿訓練資料中人類語言模式與心理反應，呈現「類人」行為（parahuman），顯示即便缺乏主觀經驗，AI 仍能模擬人類動機與行為，為改善人機互動提供重要線索。

Syntax	Description	Examples
`"` keyword `"`	Finds the string within quotes, including spaces. Case-insensitive. (Escape quotes inside with `\"`)	`"Hackers' Pub"`
`from:` handle	Finds content written by the specified user.	`from:hongminhee` `from:hongminhee@hollo.social`
`lang:` ISO 639-1	Finds content written in the specified language.	`lang:en`
`#` tag	Finds content with the specified tag. Case-insensitive.	`#HackersPub`
condition condition	Finds content that satisfies both conditions on either side of the space (logical AND).	`"Hackers' Pub" lang:en`
condition `OR` condition	Finds content that satisfies at least one of the conditions on either side of the OR operator (logical OR).	`#HackersPub OR "Hackers' Pub" lang:en`
`(` condition `)`	Combines the operators within the parentheses first.	`(#HackersPub OR "Hackers' Pub" OR "Hackers Pub") lang:en`