docuMentor is an AI-powered tool that brings together LocalRQA and CodeScribe into a unified system. It allows users to upload documents, ask questions, and receive accurate answers locally using Retrieval-Augmented Generation (RAG). Additionally, it supports automatic Python code commenting, helping developers understand and document their code with ease.
- ✅ Local Question Answering (LocalRQA): Ask questions on uploaded documents and receive contextually relevant answers using open-source LLMs — all locally.
- 🧠 LLM-Powered Code Commenting (CodeScribe): Automatically generates detailed comments for Python files, improving code readability and maintainability.
- 🔐 Privacy-first: No cloud dependencies. Everything runs locally.
Based on the paper:
"LocalRQA: From Generating Data to Locally Training, Testing, and Deploying Retrieval-Augmented QA Systems"
📎 ACL Anthology | arXiv PDF
- Generates synthetic ⟨question, answer, passage⟩ triples from your documents
- Trains local retriever and generator models (e.g., ColBERTv2, LLaMA)
- Evaluates performance via automated metrics
- Fully deployable QA pipeline that works on PDFs, TXT, or HTML
Inspired by the tool described in:
"Leveraging Large Language Models for Code Translation and Software Development in Scientific Computing"
📎 arXiv Link
- Annotates Python files with function-level comments
- Uses custom prompt templates to maintain context
- Can be extended to other languages in future
- Python 3.10+
- Node.js & npm/yarn (for frontend)
- CUDA-enabled GPU (optional, for model training)
- Git
git clone https://github.com/Adith1207/docuMentor.git
cd docuMentor
pip install -r requirements.txtcd backend
pip install -r requirements.txtcd ../frontend
npm install # or yarn install
npm run dev # start development server- Supports PDF, TXT, HTML
- Returns answer and source passage
- Automatically adds function-level and inline comments
- Optionally supports batch commenting for multiple files
We welcome contributions! You can:
- Report issues or bugs
- Suggest new features
- Contribute to additional language support
- Improve evaluation scripts
docuMentor is licensed under the MIT License — free for academic and commercial


.jpeg)
