Skip to content

Prasad998/DocuSense

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

📘 DocuSense

DocuSense is a powerful chat-based application that lets you interact with multiple PDF documents using natural language. Simply upload your PDFs, ask questions, and get intelligent responses generated from the content of your documents.

This project leverages modern language models to extract, process, and provide accurate answers, ensuring that your queries remain contextual to the loaded PDFs.


🚀 Features

  • 📂 Upload and chat with multiple PDFs simultaneously.
  • 🔍 Get context-aware answers based on document content.
  • ⚡ Efficient text chunking for better processing.
  • 🧠 Embedding-based similarity matching to ensure relevant responses.
  • 🌐 Streamlit-based UI for easy interaction.

🛠 How It Works

The workflow of DocuSense can be summarized in these steps:

  1. PDF Loading – Extracts text content from multiple uploaded PDFs.
  2. Text Chunking – Breaks down extracted text into smaller, manageable chunks.
  3. Embeddings Generation – Creates vector representations of text chunks using a language model.
  4. Similarity Matching – Matches your query with the most semantically similar text chunks.
  5. Response Generation – Passes relevant chunks to the language model to generate precise answers.

DocuSense Workflow


📦 Installation

  1. Clone the repository:

    git clone https://github.com/Prasad998/DocuSense.git
    cd DocuSense
  2. Install dependencies:

    pip install -r requirements.txt
  3. Add your OpenAI API key to a .env file in the project root:

    OPENAI_API_KEY=your_secret_api_key

▶️ Usage

  1. Ensure dependencies are installed and your API key is set up in .env.

  2. Start the app using Streamlit:

    streamlit run app.py
  3. The interface will open in your default browser.

  4. Upload one or more PDFs.

  5. Ask natural language questions about your documents via the chat interface.


📌 Notes

  • The app only responds to queries related to the content of the loaded PDFs.
  • Make sure your OpenAI API key has sufficient credits/permissions.

About

DocuSense: PDF Semantic Search and Q&A Assistant using LLMs

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages