Prompt caching: 10x cheaper LLM tokens, but how?
Link: https://ngrok.com/blog/prompt-caching/
Discussion: https://news.ycombinator.com/item?id=46290620
Prompt caching: 10x cheaper LLM tokens, but how?
Link: https://ngrok.com/blog/prompt-caching/
Discussion: https://news.ycombinator.com/item?id=46290620
If you have a fediverse account, you can quote this note from your own instance. Search https://social.lansky.name/users/hn100/statuses/115745568106167221 on your instance and quote it. (Note that quoting is not supported in Mastodon.)