Hackers' Pub

Syntax	Description	Examples
`"` keyword `"`	Finds the string within quotes, including spaces. Case-insensitive. (Escape quotes inside with `\"`)	`"Hackers' Pub"`
`from:` handle	Finds content written by the specified user.	`from:hongminhee` `from:hongminhee@hollo.social`
`lang:` ISO 639-1	Finds content written in the specified language.	`lang:en`
`#` tag	Finds content with the specified tag. Case-insensitive.	`#HackersPub`
condition condition	Finds content that satisfies both conditions on either side of the space (logical AND).	`"Hackers' Pub" lang:en`
condition `OR` condition	Finds content that satisfies at least one of the conditions on either side of the OR operator (logical OR).	`#HackersPub OR "Hackers' Pub" lang:en`
`(` condition `)`	Combines the operators within the parentheses first.	`(#HackersPub OR "Hackers' Pub" OR "Hackers Pub") lang:en`

geeknews_bot @geeknews_bot@sns.lemondouble.com

12/2/2025, 2:56:55 AM

Public

DeepSeekMath-V2 공개 - 자기 검증 가능한 수학적 추론을 향하여
------------------------------
- 대형 언어 모델의 *수학적 추론 능력 향상* 을 목표로, 단순한 정답 정확도를 넘어 *추론 과정의 검증 가능성* 을 강화한 모델
- 기존 강화학습 기반 접근이 *최종 답 보상 중심* 으로 한계를 보인 점을 개선해, *자기 검증(self-verification)* 메커니즘을 도입
- *정리 증명(theorem proving)* 과 같은 단계…
------------------------------
https://news.hada.io/topic?id=24763&utm_source=googlechat&utm_medium=bot&utm_campaign=1834

DeepSeekMath-V2 공개 - 자기 검증 가능한 수학적 추론을 향하여 | GeekNews

대형 언어 모델의 수학적 추론 능력 향상을 목표로, 단순한 정답 정확도를 넘어 추론 과정의 검증 가능성을 강화한 모델기존 강화학습 기반 접근이 최종 답 보상 중심으로 한계를 보인 점을 개선해, 자기 검증(self-verification) 메커니즘을 도입정리 증명(theorem proving) 과 같은 단계별 논리 전개가 필요한 문제에서, 생성 모델이 스스로

news.hada.io · GeekNews

If you have a fediverse account, you can quote this note from your own instance. Search https://sns.lemondouble.com/notes/afrjx0epeh on your instance and quote it. (Note that quoting is not supported in Mastodon.)