Convert scanned PDFs into searchable text locally using Vision LLMs (olmOCR). 100% private, offline, and free. Features a modern Web UI & CLI.
-
Updated
May 18, 2026 - Python
Convert scanned PDFs into searchable text locally using Vision LLMs (olmOCR). 100% private, offline, and free. Features a modern Web UI & CLI.
Egyptian ID Card Recognition System 💳 A Python-based application to detect and process Egyptian ID cards using YOLO and EasyOCR.
Advanced PDF processing, OCR, and AI vision analysis nodes for ComfyUI. Extract images from PDFs, perform multilingual OCR with Surya, detect objects with Florence-2, and analyze document layouts.
A pipeline for turning digital collections into structured data -- an LLM assisted, IIIF-native tool to jump into working with sources like digitized print directories.
🎓 Production-grade RAG Tutor for handwritten notes. Features Hybrid Search (BGE-M3 + BM25), Cross-Encoder Reranking, and an LLM-based OCR cleaning pipeline.
OSPA SuryaOCR – Advanced document processing framework for historical sources using Surya OCR and DocLayout-YOLO. Developed within TÜBİTAK Project 323K372.
Internship Project - IIT Bombay
Restaurant menu translate service - OCR engine for text recognition.
👁️ Convert PDFs to editable LaTeX files with images offline on macOS, optimized for Apple Silicon and secure with no cloud dependency.
Add a description, image, and links to the surya-ocr topic page so that developers can more easily learn about it.
To associate your repository with the surya-ocr topic, visit your repo's landing page and select "manage topics."