RAG (Retrieval-Augmented Generation) Demo

Note: rag.py is adapted from Underfitted's demo posted here: https://github.com/svpino/gentle-intro-to-rag/blob/main/rag.ipynb

RAG (Retrieval-Augmented Generation) Demo

This project demonstrates a simple RAG application that answers questions based on content from a plain text file ("grimmstales.txt") using a local embedding API and the Llama 3.1 model.

Setup

Activate the virtual environment:

source .venv/bin/activate  # On Linux/macOS
# or
.\.venv\Scripts\activate  # On Windows

Install dependencies:
```
pip install -r requirements.txt
```
Make sure you have Ollama or LM Studio installed and running with the Llama 3.1 model:
```
ollama pull llama3.1
# or use LM Studio with the appropriate model loaded
```
You'll need a text file named grimmstales.txt in the same directory (this demo uses the public domain "Grimms' Fairy Tales").
- You can use any large text file by changing the TEXT_FILE variable in rag.py.

Running the Script

Make sure your virtual environment is activated, then run:

python rag.py

How it Works

This script:

Loads a plain text document ("grimmstales.txt")
Splits it into manageable chunks
Creates vector embeddings for each chunk using a local embedding API (compatible with LM Studio or similar)
Stores the embeddings in a FAISS vector store
Sets up a retrieval system to find relevant chunks based on questions
Configures an LLM (Llama 3.1 via Ollama or LM Studio) to generate answers
Combines everything into a RAG pipeline that answers questions based on the text file content

Example Questions

Here are some example questions you can use to demo the RAG system with "Grimms' Fairy Tales":

What lesson does the story of "Hansel and Gretel" teach about resourcefulness?
Which tale features a character who can spin straw into gold?
In "The Twelve Dancing Princesses," how do the princesses manage to escape each night?

Original Source

This script was converted from a Jupyter notebook created by Santiago Valdarrama (svpino): https://github.com/svpino/gentle-intro-to-rag/blob/main/rag.ipynb

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.devcontainer		.devcontainer
.gitignore		.gitignore
README.md		README.md
grimmstales.txt		grimmstales.txt
rag.py		rag.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

RAG (Retrieval-Augmented Generation) Demo

Setup

Running the Script

How it Works

Example Questions

Original Source

About

Uh oh!

Releases

Packages

Languages

Uh oh!

Uh oh!

keith-decker/ragdemos

Folders and files

Latest commit

History

Repository files navigation

RAG (Retrieval-Augmented Generation) Demo

Setup

Running the Script

How it Works

Example Questions

Original Source

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages