Message Queue like tool needed for queueing limit (no internal queue)
If one person or organization makes an excessive number of requests, it could bog down the API for everyone else. By throttling the number of requests that a single user can make, OpenAI ensures that the most number of people have an opportunity to use the API without experiencing slowdowns.
OpenAI API Ratelimit Tools