RAG Chatbot

This project is a Retrieval-Augmented Generation (RAG) chatbot that allows users to upload multiple PDF files and ask questions based on the content of those files. The chatbot is built using Python, Flask, HTML, CSS, and JavaScript, Langchain framework, with MongoDB Atlas serving as the vector database for storing and retrieving document embeddings.

The chatbot offers two Large Language Model (LLM) options:

1. Gemini-1.5-Flash-001-Tuning
2. Llama3-8b-8192

For embedding generation, the project utilizes the GoogleGenerativeAIEmbeddings's "models/embedding-001" model. Additionally, the Cohere model "rerank-english-v3.0" is employed for reranking the retrieved results to improve the relevance of answers.

This RAG chatbot is designed to be easily integrated into various applications where users need to query large sets of documents and receive precise and contextually relevant answers.

API Reference

Upload Files

  POST /upload

Parameter	Type	Description
`files`	`file[]`	Required. The PDF files to be uploaded.

Query RAG

    POST /query

Parameter	Type	Description
`model`	`string`	Required. The model to use ('google' or 'llama').
`query`	`string`	Required. The question you want to ask the chatbot.

Tech Stack

Client: HTML, CSS, JavaScript

Server: Python, Flask

Database: MongoDB Atlas (Vector Database)

Embeddings: GoogleGenerativeAIEmbeddings (models/embedding-001)

Reranking: Cohere Rerank Model (rerank-english-v3.0)

LLM Models: Gemini-1.5-Flash-001-Tuning, Llama3-8b-8192

Features

Multiple PDF Uploads: Allows users to upload and process multiple PDF files simultaneously.
Customizable Query Models: Offers two Large Language Model (LLM) options—Gemini-1.5-Flash-001-Tuning and Llama3-8b-8192—for querying.
Advanced Embedding and Reranking: Utilizes GoogleGenerativeAIEmbeddings for creating embeddings and the Cohere Rerank model for improving query relevance.
Responsive Web Interface: Built with HTML, CSS, JavaScript, and Flask, ensuring a user-friendly experience across different devices.
Efficient Vector Search: Powered by MongoDB Atlas for fast and accurate similarity searches within large document collections.

Environment Variables

To run this project, you will need to add the following environment variables to your .env file

MONGODB_ATLAS_CLUSTER_URI: Your MongoDB Atlas Cluster URI

GOOGLE_API_KEY: API key for Google Generative AI

GROQ_API_KEY: API key for Groq

COHERE_API_KEY: API key for Cohere

Deployment

Create a Cluster on MongoDB Atlas:

Sign in to your MongoDB Atlas account.
Create a new cluster.
Once your cluster is set up, create a new database and collection that will be used for vector search.

Create a Vector Search Index:

Navigate to your MongoDB Atlas cluster and select your database and collection.
Create an index with the following configuration:

  {
  "fields": [
    {
      "numDimensions": 768,
      "path": "embedding",
      "similarity": "cosine",
      "type": "vector"
    },
    {
      "path": "source",
      "type": "filter"
    }
  ]
}

Allow Network Access:

Go to the "Network Access" section of MongoDB Atlas.
Add your current IP address to the IP whitelist to allow access to the database.

Set Up Environment Variables:

Create a .env file in the root of your project.
Add the following environment variables:

MONGODB_ATLAS_CLUSTER_URI=your_mongodb_atlas_uri
GOOGLE_API_KEY=your_google_api_key
GROQ_API_KEY=your_groq_api_key
COHERE_API_KEY=your_cohere_api_key

Install Dependencies:

Make sure you have all necessary dependencies installed by running:

pip install -r requirements.txt

Run the Application:

Start the Flask application by running:

python app.py

Screenshots

Acknowledgements

🚀 About Me

My name is Tarpit, and I completed my B.Tech in Computer Engineering with a specialization in AI and Machine Learning in 2024. I'm passionate about developing intelligent systems and leveraging AI to solve complex problems. This project is a reflection of my interest in combining cutting-edge technologies like RAG, MongoDB Atlas, and advanced LLMs to create powerful and user-friendly applications.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
base		base
.gitignore		.gitignore
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAG Chatbot

API Reference

Upload Files

Query RAG

Tech Stack

Features

Environment Variables

Deployment

Screenshots

Acknowledgements

🚀 About Me

🔗 Links

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

RAG Chatbot

API Reference

Upload Files

Query RAG

Tech Stack

Features

Environment Variables

Deployment

Screenshots

Acknowledgements

🚀 About Me

🔗 Links

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages