Simon Willison (@simonw)

오늘 발표된 GPT-5.4의 mini 및 nano 릴리스에 대한 노트로, 특히 nano 모델은 개인의 76,000장 사진 라이브러리를 총 $52 비용으로 모두 설명할 수 있을 것처럼 보인다고 보고합니다. 경량 모델의 비용 효율적 멀티모달 활용 사례를 시사합니다.

https://x.com/simonw/status/2033991803050070082

#gpt5.4 #openai #multimodal #costefficiency

Simon Willison (@simonw) on X

Notes and pelicans for today's GPT-5.4 mini and nano releases - the nano model looks like it could describe every image in my 76,000 photo library for $52 total https://t.co/YtsNLXHWU1

X (formerly Twitter)

Sigil Wen (@0xSigil)

@ConwayResearch가 저렴한 추론 모델(Kimi k2.5, Minimax m2.5, GLM-5)을 추가 중이라고 발표했습니다. 이 모델들은 Opus 4.5 급 성능을 유지하면서도 10배 저렴하여, 저비용 AI 추론 시스템 설계에 큰 도움이 될 수 있습니다.

https://x.com/0xSigil/status/2025858428766396707

#ai #modelrelease #inference #llm #costefficiency

Sigil Wen (@0xSigil) on X

I'm working on adding cheaper inference models to @ConwayResearch so that $40 can go a very long way for the Automatons Including Kimi k2.5, Minimax m2.5, GLM-5 which are opus 4.5 level but 10x cheaper

X (formerly Twitter)

Do we have any owners of one of those Ryzen AI Max+ 395 128GB UMA boxes here that operate them on the daily for at least a few months as a claude LLM coding server and are capable of giving a comparative run down on their performance vs the OG claude and its collection of formal prose generators?

Also: Especially curious to hear any numbers that came out of a watt meter in daily consumption and base/peak numbers. Same with the used models, their size, their respective achieved tok/s and response times.

And should you have had the opportunity of comparing this against non-UMA beefy dGPUs on the above parameters that'd also be quote interesting.

#claude #aicoding #AIAsssisted #ollama #onprem #selfhosing #StrixPoint #ryzenaimaxplus395 #ryzenAiMax #powerconsumption #costefficiency #uma

Josh Marino (@AIRoboticsInt)

입력 비용 $0.30/M, 출력 $1.20/M을 주장하며 Opus 4.6 및 GPT 5.2와 동등한 벤치마크 성능을 표방해 가격을 최대 95% 저렴하다고 밝힌 MiniMax M2.5가 오늘 출시되었다고 전합니다(가격·성능 비교 강조).

https://x.com/AIRoboticsInt/status/2021993139771498570

#minimax #m2.5 #costefficiency #benchmarks

Josh Marino (@AIRoboticsInt) on X

This is Insane Input price $0.30/M and output is $1.20/M and same performance as Opus 4.6 and GPT 5.2 on benchmarks and 95% discount in price. Just released Today! Minimax M2.5

X (formerly Twitter)
Namyang Dairy Products Co. returned to profit in 2023 after five consecutive years of losses, driven by a focus on profitability and cost efficiency, with net profit surging 2,743% year-on-year.
#YonhapInfomax #NamyangDairyProducts #OperatingProfit #NetProfit #Revenue #CostEfficiency #Economics #FinancialMarkets #Banking #Securities #Bonds #StockMarket
https://en.infomaxai.com/news/articleView.html?idxno=105276
Namyang Dairy Products Swings to Profit in 2023 After Five Years of Losses

Namyang Dairy Products Co. returned to profit in 2023 after five consecutive years of losses, driven by a focus on profitability and cost efficiency, with net profit surging 2,743% year-on-year.

Yonhap Infomax
Orion Corp. posted a 2.7% rise in 2023 operating profit, driven by strong overseas sales and operational efficiency, offsetting higher raw material costs.
#YonhapInfomax #OrionCorp #OperatingProfit #RevenueGrowth #RussiaIndiaSales #CostEfficiency #Economics #FinancialMarkets #Banking #Securities #Bonds #StockMarket
https://en.infomaxai.com/news/articleView.html?idxno=104025
Orion Reports 2.7% Rise in 2023 Operating Profit—Efficiency Measures Offset Cost Pressures

Orion Corp. posted a 2.7% rise in 2023 operating profit, driven by strong overseas sales and operational efficiency, offsetting higher raw material costs.

Yonhap Infomax

金のニワトリ (@gosrum)

Kimi-K2.5를 성능 및 비용 효율 관점에서 Anthropic의 Claude와 비교한 결과를 메모 형식의 기사로 정리했습니다. 두 모델의 처리 성능, 응답 품질, 구동 비용과 전반적인 가성비를 비교 분석한 내용이 포함되어 있습니다.

https://x.com/gosrum/status/2017843207628095757

#kimik2.5 #claude #modelcomparison #costefficiency #llm

金のニワトリ (@gosrum) on X

Kimi-K2.5を性能・コスト効率の観点でClaudeと比較した結果を備忘録記事としてまとめた https://t.co/wkWw4b0UfU

X (formerly Twitter)

Dùng ổ SSD/NVMe phổ thông cho server 24/7 vẫn ổn, đặc biệt nếu đọc nhiều hơn ghi. Dù lâu dài có thể hao mòn (đặc biệt khi chạy OPNsense/Proxmox), nhưng với chi phí thấp, bạn có thể theo dõi hiệu suất và cân nhắc nâng cấp sang ổ chuyên nghiệp sau. #SSD #NVMe #Server #TiếtKiệmChiPhí

**Hashtags:** #TechTips #ThiếtLậpServer #DựNgânSách #DataStorage #ỔCứng
**Tags:** #SSD #NVMe #Server #CostEfficiency

https://www.reddit.com/r/selfhosted/comments/1qm9wmf/is_it_fine_to_use_consumer_grade_nvmessd_f

Cloud cost efficiency improves when engineers and automation work together.

#FinOps #CloudOps #Automation #CostEfficiency #MSP

Without Benchmarking LLMs, You're Likely Overpaying 5-10x | Karl Lorey

We benchmarked 100+ models on our actual task and found a much cheaper alternative that works just as well.

Karl Lorey