(UserError) Received too many requests in a short amount of time when calling create_agent_evaluation() #206
-
|
I'm testing an Agent Evaluation workflow locally using the Azure AI Foundry SDK (azure-ai-projects), and evaluation triggers that previously worked are now consistently failing with a UserError: Received too many requests in a short amount of time. The issue persists even after multiple retries and across restarts, suggesting a potential rate-limit or quota enforcement on the evaluation endpoint. I’m using the Agent Framework with the following evaluation configuration and retry logic: Evaluation trigger: Evaluators Even with exponential backoff (up to 5 retries, 2–32 s delay), I still get: Full Stack trace: Current Region: WestUS2
|
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 1 reply
-
|
Beta Was this translation helpful? Give feedback.
What are the current evaluation request limits (per project, per user, or per region)?
I would have to get back to you on exact capacity limitations, but we are seeing higher usage in WestUS2 at this time should I would suggest a different region if possible. It is possible to BYO deployment if that would help your situation.
Does max_request_rate in the sampling configuration influence throttling, or is it informational only?
Yes max_request_rate does influence throttling.
Is there a recommended delay or concurrency policy for evaluations triggered through the Agent Framework?
What I would recommend is to firstly increase your sampling percentage. Your sampling config is set to 10 …