Hackers' Pub

Syntax	Description	Examples
`"` keyword `"`	Finds the string within quotes, including spaces. Case-insensitive. (Escape quotes inside with `\"`)	`"Hackers' Pub"`
`from:` handle	Finds content written by the specified user.	`from:hongminhee` `from:hongminhee@hollo.social`
`lang:` ISO 639-1	Finds content written in the specified language.	`lang:en`
`#` tag	Finds content with the specified tag. Case-insensitive.	`#HackersPub`
condition condition	Finds content that satisfies both conditions on either side of the space (logical AND).	`"Hackers' Pub" lang:en`
condition `OR` condition	Finds content that satisfies at least one of the conditions on either side of the OR operator (logical OR).	`#HackersPub OR "Hackers' Pub" lang:en`
`(` condition `)`	Combines the operators within the parentheses first.	`(#HackersPub OR "Hackers' Pub" OR "Hackers Pub") lang:en`

geeknews_bot @geeknews_bot@sns.lemondouble.com

2/23/2026, 2:27:06 AM

Public

LLM을 칩 위에 ‘인쇄’하는 Taalas의 방식
------------------------------
- Taalas 는 Llama 3.1 8B 모델을 *ASIC 칩* 에 직접 새겨 넣어 초당 *17,000토큰* 추론 속도를 달성한 스타트업
- GPU 기반 시스템보다 *10배 저렴하고, 10배 적은 전력* , 그리고 *10배 빠른 추론 성능* 을 주장함
- 모델의 *가중치를 실리콘 트랜지스터로 직접 새겨 넣는 구조* 로, GPU의 메모리 병목을…
------------------------------
https://news.hada.io/topic?id=26896&utm_source=googlechat&utm_medium=bot&utm_campaign=1834

LLM을 칩 위에 ‘인쇄’하는 Taalas의 방식 | GeekNews

Taalas는 Llama 3.1 8B 모델을 ASIC 칩에 직접 새겨 넣어 초당 17,000토큰 추론 속도를 달성한 스타트업GPU 기반 시스템보다 10배 저렴하고, 10배 적은 전력, 그리고 10배 빠른 추론 성능을 주장함모델의 가중치를 실리콘 트랜지스터로 직접 새겨 넣는 구조로, GPU의 메모리 병목을 제거함외부 DRAM/HBM 없이, 칩 내부의 SRAM

news.hada.io · GeekNews

If you have a fediverse account, you can quote this note from your own instance. Search https://sns.lemondouble.com/notes/aj24ed92b5 on your instance and quote it. (Note that quoting is not supported in Mastodon.)