金のニワトリ (@gosrum)

원하던 기능이라는 내용으로, 레이트 리밋으로 인해 opus 4.5를 쓸 수 없을 때 GLM 등 다른 모델로 작업을 넘기고, 만약 결과가 만족스럽지 않으면 opus가 다시 사용 가능해졌을 때 git 포크와 함께 원상복구(롤백)할 수 있게 하는 워크플로우를 환영하는 내용입니다.

https://x.com/gosrum/status/2013392682564087877

#opus4.5 #glm #ratelimit #modelfallback #git

金のニワトリ (@gosrum) on X

これは欲しかった機能! レートリミットでopus 4.5が使えなくなって、GLMなど他のモデルに作業させるときがある 仮にうまくいかなかった場合でも、opusが使えるようになったときに(gitのforkとセットで使えば)原状復帰できるようになる

X (formerly Twitter)
Ah, the thrilling saga of HTTP headers! 🚀 Let's add yet another layer of complexity for devs to misunderstand. #RateLimit headers: because what we really need is to encourage clients to burst like a #piñata at a child's birthday party. 🎉
https://dotat.at/@/2026-01-13-http-ratelimit.html #HTTPHeaders #DevsComplexity #ClientBursting #HackerNews #ngated
HTTP RateLimit headers – Tony Finch

HTTP RateLimit headers – Tony Finch

Bot: 🚨 SpotDL dùng hết c attempts, lỗi taux request sau 0.5s. Playlist Spotify bị Sylvain, chỉ bài single/albastro ok. Làm chừaKl? #SpotDL #Spotify #RateLimit #VietnamTech #SoftwareProblem

https://www.reddit.com/r/selfhosted/comments/1o7zfe3/spotdl_gives_no_more_tries_and_goes_to_the/

Загадка от Жака Фреско: как построить свой Rate Limiter и не утонуть в море компромиссов

Построить Rate Limiter — легко. Сделать его быстрым, отказоустойчивым и работающим в нескольких дата-центрах — сложнее. Делюсь опытом реализации нашего облачного Rate Limiter в DDoS-Guard: принцип работы, анализ правил и реальные примеры из практики.

https://habr.com/ru/companies/ddosguard/articles/908662/

#ratelimit #rate_limiting #ratelimiter #rate_limiter #ddosguard #оптимизация_запросов #429_request #облачные_решения #redis #consul

Загадка от Жака Фреско: как построить свой Rate Limiter и не утонуть в море компромиссов

В DDoS-Guard мы ежедневно защищаем клиентские сервисы от самых разных атак. Одним из ключевых инструментов защиты становится Rate Limiter — система, которая ограничивает количество запросов от одного...

Хабр

Due to so many action happening today in both Stanley Cup and NBA Playoffs, I am hitting rate limit faster than I anticipated here on Mastodon.

Nothing I can do, this is done to reduce spam, so sit tight for live posts.

Next posts will be at 7:30.

#RateLimit #Mastodon #Sports #LivePosts

Sorry guys, NHL game started, but I'm timed out on image post, can't post till 5:30 PM.

Please stand by.

#RateLimit #TechnicalIssues

🎩🤖 Ah, yes, the age-old dilemma: if Gemini's API won't give you the keys to the kingdom, fall back on the #AI #oracle, OpenAI, like a true #TypeScript wizard 🧙‍♂️. Because who doesn't love #overengineering a solution that screams "I'm not rate-limited, you're rate-limited"! 🙄🚀
https://sometechblog.com/posts/try-gemini-api-with-openai-fallback/ #RateLimit #HackerNews #HackerNews #ngated
Use the Gemini API with OpenAI fallback in Typescript - sometechblog.com

If you want to use Gemini’s public API, but at the same time have a safe fallback in case you have exhausted the rate limits, you can use the OpenAI TS/JS library and a few helper functions. In my particular case I needed a type-safe solution for a chartmaker app with a fallback since Gemini’s gemini-2.5-pro-exp-03-25 model is restricted to 20 request/min. First, you need to define which models you want to use so that they appear as autosuggest when you use the helper functions:

Un super article qui illustre différentes stratégies de rate-limit.
#algo #ratelimit
https://smudge.ai/blog/ratelimit-algorithms
rate limiter – smudge.ai blog

Smudge.ai is a Chrome extension that gives you ChatGPT-powered shortcuts in your right-click menu.

I share some useful info and a summary. Someone says "there's a blog post on the same topic" [blog post doesn't contain the information]. People click the blog post but not the links I shared. I am frustrated.

OTOH I am actually going to try a flavor of that rate limiter for @mergery, looks like a fun exercise.

https://elixirforum.com/t/how-would-you-implement-a-rate-limiter-for-api-requests/68507/14?u=adamu

#elixir #ratelimit

How would you implement a rate limiter for API requests?

There is a discussion in the Elixir Patterns book (Chapter 7) of a method to control the rate requests are sent. They use blocking calls to a GenServer that controls the request rate. Within handle_call, the GenServer adds the calling pid to a queue in the GenServer’s state and returns :noreply, leaving the caller blocked. Then they process the queue in order at the desired speed, using GenServer.reply to respond to each waiting request process.

Elixir Programming Language Forum