-
Notifications
You must be signed in to change notification settings - Fork 3.3k
[BUG] 404 Not Found on evaluate() with Azure OpenAI Graders - Possible Service/SDK Incompatibility #44763
Copy link
Copy link
Closed
Labels
EvaluationIssues related to the client library for Azure AI EvaluationIssues related to the client library for Azure AI EvaluationOpenAIService AttentionWorkflow: This issue is responsible by Azure service team.Workflow: This issue is responsible by Azure service team.customer-reportedIssues that are reported by GitHub users external to the Azure organization.Issues that are reported by GitHub users external to the Azure organization.needs-team-attentionWorkflow: This issue needs attention from Azure service team or SDK teamWorkflow: This issue needs attention from Azure service team or SDK teamquestionThe issue doesn't require a change to the product in order to be resolved. Most issues start as thatThe issue doesn't require a change to the product in order to be resolved. Most issues start as that
Metadata
Metadata
Labels
EvaluationIssues related to the client library for Azure AI EvaluationIssues related to the client library for Azure AI EvaluationOpenAIService AttentionWorkflow: This issue is responsible by Azure service team.Workflow: This issue is responsible by Azure service team.customer-reportedIssues that are reported by GitHub users external to the Azure organization.Issues that are reported by GitHub users external to the Azure organization.needs-team-attentionWorkflow: This issue needs attention from Azure service team or SDK teamWorkflow: This issue needs attention from Azure service team or SDK teamquestionThe issue doesn't require a change to the product in order to be resolved. Most issues start as thatThe issue doesn't require a change to the product in order to be resolved. Most issues start as that
Type
Fields
Give feedbackNo fields configured for issues without a type.
Description
I am encountering a persistent 404 Not Found error when attempting to run the Azure OpenAI Graders workflow, both via the Python SDK and the Azure AI Foundry (Portal) GUI. This occurs even when using the official sample dataset and following the lab documentation.
1. Environment & Setup
Notebook Source:
scenarios/evaluate/Azure_OpenAI_Graders/Azure_OpenAI_Graders.ipynbPython Version: 3.10 (Miniconda)
Libraries:
azure-ai-evaluation,openai(latest)Region:
canadaeastModels Tested:
gpt-4o,gpt-4.1-nano,o32. Identified Issues
A. Missing Module / Deprecation
The notebook initially fails on the following import:
Error: Module not found. I had to remove this line to proceed, indicating a possible breaking change in the recent
openailibrary versions that the notebook has not yet accounted for.B. Execution Failure (404 Error)
When calling the
evaluatefunction, the process fails with anopenai.NotFoundError: Error code: 404.Code Snippet:
Traceback Highlights:
The error originates deep within the openai base client during the evaluation group creation:
3. Troubleshooting Steps Taken
To rule out configuration errors, I performed the following:
Infrastructure: Completely re-deployed the Azure AI project and resources.
Credentials: Verified API keys, endpoints, and environment variables (all correct).
Model Variety: Switched between GPT-4o, GPT-4.1-nano, and o3 deployments; all returned the same 404.
Portal Testing: Attempted the same evaluation using the Azure AI Foundry GUI (Match Criteria tool) with the official
data.jsonl.Portal Result: The run starts and abruptly ends with
EvaluationException('Error code: 404')andINFO:__main__:RUN DOES NOT EXIST.4. Summary of Observations
Attempt | Method | Result -- | -- | -- Local Notebook | Python SDK | openai.NotFoundError: 404 Azure AI Foundry Portal | GUI / Match Criteria | EvaluationException: 404Expected Behavior
The
evaluate()function should communicate with the Azure OpenAI evaluation back-end, create the evaluation run, and return the metrics.Actual Behavior
The back-end returns a 404, implying that the endpoint the SDK is attempting to hit (
client.evals.create) is either missing, incorrectly constructed, or not supported in the current region/version.