Skip to content

Commit 296f502

Browse files
πŸ“™ docs: Add documentation for Direct Provider Attachments / OCR (#404)
* πŸ“™ docs: Add documentation for Direct Provider Attachments / OCR * docs: add note about Responses API * Revise OCR documentation for clarity and accuracy Updated section titles and clarified details regarding file uploads to OCR providers, including specific notes for OpenAI and Azure OpenAI. --------- Co-authored-by: Danny Avila <[email protected]>
1 parent 5e6f45b commit 296f502

File tree

1 file changed

+15
-0
lines changed

1 file changed

+15
-0
lines changed

β€Žpages/docs/features/ocr.mdxβ€Ž

Lines changed: 15 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -168,6 +168,21 @@ ocr:
168168

169169
Support for custom OCR providers and user-defined strategies is planned for future releases.
170170

171+
### 5. Upload Files to Provider (Direct)
172+
173+
For supported LLM Providers (**OpenAI, AzureOpenAI, Anthropic, and Google**) and their respective models, files can now be sent directly to the provider APIs as message attachments,
174+
allowing the provider to use their own native OCR implementations to parse files using the `Upload to Provider` option in the file attachment dropdown menu.
175+
176+
Currently all four of the aforementioned providers offer support for images and PDFs, with Google also including support for audio and video files when used in conjunction with compatible multimodal models.
177+
178+
<Callout type="note" title="Azure OpenAI PDF Upload Caveat" emoji='✏️'>
179+
For **Azure OpenAI** endpoints, the Upload to Provider option for PDF files is only available when using the Responses API. Azure OpenAI's Chat Completions API supports images but does not support PDF file attachments.
180+
181+
If you do not see 'Upload to Provider' as an option for PDFs in your chat's attachment dropdown menu with Azure OpenAI, ensure that the Responses API parameter is enabled in the Parameters panel.
182+
183+
Note: Standard OpenAI endpoints support PDF uploads in both Chat Completions and Responses APIs.
184+
</Callout>
185+
171186
## Detailed Configuration
172187

173188
For additional, detailed configuration options, see the [OCR Config Object Structure](/docs/configuration/librechat_yaml/object_structure/ocr).

0 commit comments

Comments
Β (0)