Mastodawn

DevTo VN Bot Dec 17

Giảm 40-70% chi phí LLM chỉ trong 24 giờ với 5 chiến lược: (1) Prompt Caching (tiết kiệm 50-90%), (2) Định tuyến mô hình (20-60%), (3) Bộ nhớ đệm ngữ nghĩa (15-30%), (4) Xử lý hàng loạt (giảm 50%), (5) Dùng AI Gateway. Áp dụng ngay để tối ưu chi phí, đặc biệt với traffic lặp lại hoặc tác vụ đơn giản. #LLM #AIOptimization #CostSaving #PromptCaching #ModelRouting #AI #MachineLearning #TốiƯuChiPhí #TríTuệNhânTạo #AIgateway #BatchProcessing

https://dev.to/scalemind/how-to-reduce-llm-costs-by-40-in-

N-gated Hacker News Dec 17

Oh look, another genius idea from the depths of corporate innovation 🤔: cut costs with 'prompt caching' and save those precious LLM tokens 💰. Because clearly, the problem is not the convoluted explanations but *how* to make them cheaper in bulk. As if slapping a price tag on incomprehensibility is the ultimate solution 🎉.
https://ngrok.com/blog/prompt-caching/ #corporateinnovation #promptcaching #costcutting #LLMtokens #techsatire #businessstrategy #HackerNews #ngated

Prompt caching: 10x cheaper LLM tokens, but how? | ngrok blog

A far more detailed explanation of prompt caching than anyone asked for.

ngrok blog

Hacker News Dec 17

Prompt caching: 10x cheaper LLM tokens, but how?

https://ngrok.com/blog/prompt-caching/

#HackerNews #PromptCaching #LLMtokens #AItechnology #costefficiency #machinelearning

Prompt caching: 10x cheaper LLM tokens, but how? | ngrok blog

A far more detailed explanation of prompt caching than anyone asked for.

ngrok blog

Qiita - 人気の記事 Oct 25

Prompt Cachingをもう少し踏み込んで動かしてみた(Anthropic)
https://qiita.com/marudog/items/7e9edb841e28f8e7783b?utm_campaign=popular_items&utm_medium=feed&utm_source=popular_items

#qiita #bedrock #Anthropic #PromptCaching

Prompt Cachingをもう少し踏み込んで動かしてみた(Anthropic) - Qiita

Prompt Cachingの仕様はAnthropicの公式サイトに詳しい説明があります。とはいえ、うまく理解できない部分があったため、実際に動かして理解を深めてみました。また、理解を深められたことで、状況に応じてこう使えばいいんだと気づきを得た部分もありました。...

Qiita

Qiita - 人気の記事 Aug 22

Amazon BedrockのPrompt CachingでMCP TOOL一覧のトークン数を削減しよう
https://qiita.com/marudog/items/58ef942417cddd4b594f?utm_campaign=popular_items&utm_medium=feed&utm_source=popular_items

#qiita #bedrock #Anthropic #PromptCaching

Amazon BedrockのPrompt CachingでMCP TOOL一覧のトークン数を削減しよう - Qiita

はじめに MCPを利用した場合、MCPのツール一覧情報で毎回一定(それなり)のトークン数を消費することになります。この対応として、Prompt Caching を用いることでコストを削減できます。本記事では、Amazon Bedrock 経由で Claude Sonn...

Qiita

Qiita - 人気の記事 Apr 2, 2025

Amazon Bedrock の Prompt Caching が GA したっぽいからためそうぜ
https://qiita.com/kazuneet/items/23c23fe4a450c46d8b94?utm_campaign=popular_items&utm_medium=feed&utm_source=popular_items

#qiita #bedrock #AmazonBedrock #PromptCaching

Amazon Bedrock の Prompt Caching が GA したっぽいからためそうぜ - Qiita

Amazon Bedrock に Prompt Chaching 機能が GA していましたhttps://docs.aws.amazon.com/bedrock/latest/userguide…

Qiita