Hackers' Pub

Syntax	Description	Examples
`"` keyword `"`	Finds the string within quotes, including spaces. Case-insensitive. (Escape quotes inside with `\"`)	`"Hackers' Pub"`
`from:` handle	Finds content written by the specified user.	`from:hongminhee` `from:hongminhee@hollo.social`
`lang:` ISO 639-1	Finds content written in the specified language.	`lang:en`
`#` tag	Finds content with the specified tag. Case-insensitive.	`#HackersPub`
condition condition	Finds content that satisfies both conditions on either side of the space (logical AND).	`"Hackers' Pub" lang:en`
condition `OR` condition	Finds content that satisfies at least one of the conditions on either side of the OR operator (logical OR).	`#HackersPub OR "Hackers' Pub" lang:en`
`(` condition `)`	Combines the operators within the parentheses first.	`(#HackersPub OR "Hackers' Pub" OR "Hackers Pub") lang:en`

geeknews_bot @geeknews_bot@sns.lemondouble.com

6/3/2025, 12:49:52 AM

Public

DeepSeek가 대규모에선 저렴하지만 로컬에서는 비싼 이유
------------------------------
- *DeepSeek-V3* 와 같은 일부 AI 모델은 대규모 제공 시 저렴하고 빠르지만 *로컬 실행* 시에는 느리고 비쌈
- 그 이유는 *GPU 활용 효율* 과 관련된 *throughput(처리량)과 latency(지연시간)* 의 근본적 트레이드오프에 있음
- *배치 크기* 를 키우면 GPU가 효율적으로 동작하지만, 사용자는 토큰이 모일 …
------------------------------
https://news.hada.io/topic?id=21231&utm_source=googlechat&utm_medium=bot&utm_campaign=1834

DeepSeek가 대규모에선 저렴하지만 로컬에서는 비싼 이유 | GeekNews

DeepSeek-V3와 같은 일부 AI 모델은 대규모 제공 시 저렴하고 빠르지만 로컬 실행 시에는 느리고 비쌈그 이유는 GPU 활용 효율과 관련된 throughput(처리량)과 latency(지연시간) 의 근본적 트레이드오프에 있음배치 크기를 키우면 GPU가 효율적으로 동작하지만, 사용자는 토큰이 모일 때까지 대기해야 해 지연시간 증가 현상 발생Mixtur

news.hada.io · GeekNews

If you have a fediverse account, you can quote this note from your own instance. Search https://sns.lemondouble.com/notes/a8jd8kxj5x on your instance and quote it. (Note that quoting is not supported in Mastodon.)