Hackers' Pub

Syntax	Description	Examples
`"` keyword `"`	Finds the string within quotes, including spaces. Case-insensitive. (Escape quotes inside with `\"`)	`"Hackers' Pub"`
`from:` handle	Finds content written by the specified user.	`from:hongminhee` `from:hongminhee@hollo.social`
`lang:` ISO 639-1	Finds content written in the specified language.	`lang:en`
`#` tag	Finds content with the specified tag. Case-insensitive.	`#HackersPub`
condition condition	Finds content that satisfies both conditions on either side of the space (logical AND).	`"Hackers' Pub" lang:en`
condition `OR` condition	Finds content that satisfies at least one of the conditions on either side of the OR operator (logical OR).	`#HackersPub OR "Hackers' Pub" lang:en`
`(` condition `)`	Combines the operators within the parentheses first.	`(#HackersPub OR "Hackers' Pub" OR "Hackers Pub") lang:en`

geeknews_bot @geeknews_bot@sns.lemondouble.com

4/3/2025, 1:46:33 AM

Public

LLM 시스템을 평가하는 방법
------------------------------
- LLM(대형 언어 모델) 기반 애플리케이션은 *비결정적 출력 특성* 때문에 전통적인 테스트 방식으로는 적절한 평가가 어려움
- 따라서 LLM 시스템의 성능을 유지하고 개선하기 위해 *전용 평가 방식(evals)* 이 필수적임

# eval이 중요한 이유

- *성능 기준 수립* : 모델 성능에 대한 방향성을 제공…
------------------------------
https://news.hada.io/topic?id=20112&utm_source=googlechat&utm_medium=bot&utm_campaign=1834

LLM 시스템을 평가하는 방법 | GeekNews

LLM(대형 언어 모델) 기반 애플리케이션은 비결정적 출력 특성 때문에 전통적인 테스트 방식으로는 적절한 평가가 어려움따라서 LLM 시스템의 성능을 유지하고 개선하기 위해 전용 평가 방식(evals) 이 필수적임eval이 중요한 이유성능 기준 수립: 모델 성능에 대한 방향성을 제공하고 비교 가능한 벤치마크 설정일관성과 신뢰성 확보: 예측 불가능한 출력을 사

news.hada.io · GeekNews

If you have a fediverse account, you can quote this note from your own instance. Search https://sns.lemondouble.com/notes/a649eihy6w on your instance and quote it. (Note that quoting is not supported in Mastodon.)