RAG Pipeline for Website Question Answering

A simple Retrieval-Augmented Generation (RAG) pipeline for answering questions based on website content. This project combines retrieval of relevant website information with generative models to deliver contextually accurate answers.

Features

Website Content Parsing: Extracts and indexes content from a specified website URL.
Retrieval-Augmented Generation (RAG): Uses a hybrid approach for question answering.
Streamlit Interface: User-friendly web interface for inputting URLs and questions.

Tools

Orchestration: LangChain
Guardrails: NeMo-Guardrails
Monitoring: Langsmith
Retrieval: Hugging Face
Vector Database: Pinecone
Generation: Groq, Llama3
Deployment: Render

Workflow Architecture

Data Loading: Accepts various data formats such as PDF, CSV, or URL links.
Chunking: Uses text splitters to divide content into manageable chunks.
Embedding: Generates embeddings from text using Hugging Face models.
Vector Store: Stores embeddings in Pinecone for efficient retrieval.
User Query (Input): Receives the user's question.
Input Check: Uses Guardrails (NeMo-Guardrails) to ensure the question is within the scope of the dataset.
Retriever: Retrieves relevant chunks from Pinecone based on the user’s query.
Generator: Groq or Llama3 generates responses based on retrieved information.
Monitoring: Langsmith monitors responses to ensure output quality.
Output: The final answer is presented to the user.

Getting Started

Prerequisites

Python 3.9+

Run

Install dependencies:
```
pip install -r requirements.txt
```
Start the app
```
streamlit run app.py
```

Usage

Enter a Website URL in the sidebar.
Ask Questions based on the website content.
The RAG pipeline retrieves relevant information and generates answers.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.streamlit		.streamlit
config		config
.gitignore		.gitignore
README.md		README.md
app.py		app.py
rag.py		rag.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAG Pipeline for Website Question Answering

Features

Tools

Workflow Architecture

Getting Started

Prerequisites

Run

Usage

About

Releases

Packages

Languages

kritikaparmar-programmer/RAG-Pipeline

Folders and files

Latest commit

History

Repository files navigation

RAG Pipeline for Website Question Answering

Features

Tools

Workflow Architecture

Getting Started

Prerequisites

Run

Usage

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages