Hackers' Pub

Syntax	Description	Examples
`"` keyword `"`	Finds the string within quotes, including spaces. Case-insensitive. (Escape quotes inside with `\"`)	`"Hackers' Pub"`
`from:` handle	Finds content written by the specified user.	`from:hongminhee` `from:hongminhee@hollo.social`
`lang:` ISO 639-1	Finds content written in the specified language.	`lang:en`
`#` tag	Finds content with the specified tag. Case-insensitive.	`#HackersPub`
condition condition	Finds content that satisfies both conditions on either side of the space (logical AND).	`"Hackers' Pub" lang:en`
condition `OR` condition	Finds content that satisfies at least one of the conditions on either side of the OR operator (logical OR).	`#HackersPub OR "Hackers' Pub" lang:en`
`(` condition `)`	Combines the operators within the parentheses first.	`(#HackersPub OR "Hackers' Pub" OR "Hackers Pub") lang:en`

복설 뉴스 @boknews.bsky.social@bsky.brid.gy

12/21/2025, 1:41:15 AM

Public

챗GPT·제미나이도 예외없다…불완전할 수밖에 없는 'AI 안전 필터' www.dongascience.com/news.php?idx... "실험 결과 AI에게 악당 역할을 맡기거나 소설 속 장면이라고 속이거나 글자 사이에 특수문자를 끼워 넣는 등 단순한 속임수는 필터를 자주 뚫었지만 방어 기법을 적용하면 비교적 쉽게 차단됐다. 반면 일부 공격은 방어까지 우회했다. 연구팀은 최신 모델에서도 필터를 뚫는 방식이 반복적으로 발견된다고 분석했다."

챗GPT·제미나이도 예외없다…불완전할 수밖에 없는 'A...

챗GPT·제미나이도 예외없다…불완전할 수밖에 없는 'AI 안전 필터'

AI 안전 필터는 본체보다 계산 능력이 약할 수밖에 없어 암호 등으로 우회가 가능하며 학계는 외부 차단 대신 모델 내부에 안전 판단 기능을 심는 방향으로 대안을 모색하고 있다. 게티이미지뱅크 제공금고를 지키는 경비원이 ...

www.dongascience.com

If you have a fediverse account, you can quote this note from your own instance. Search https://bsky.brid.gy/convert/ap/at://did:plc:7reki7xuobtaq6iuqquznqby/app.bsky.feed.post/3mahlnztv3c27 on your instance and quote it. (Note that quoting is not supported in Mastodon.)