Skip to content

Add advanced local RAG notebook with ChromaDB, FastEmbed, and Ollama#272

Open
pm32900 wants to merge 3 commits intodeepset-ai:mainfrom
pm32900:add-advanced-rag-chroma-ollama
Open

Add advanced local RAG notebook with ChromaDB, FastEmbed, and Ollama#272
pm32900 wants to merge 3 commits intodeepset-ai:mainfrom
pm32900:add-advanced-rag-chroma-ollama

Conversation

@pm32900
Copy link
Copy Markdown

@pm32900 pm32900 commented Feb 21, 2026

Features:

  • Persistent vector store with ChromaDB
  • Fast local embeddings with FastEmbed (BAAI/bge-small-en-v1.5)
  • Local LLM generation with Ollama (llama3.2:3b)
  • Query decomposition into sub-queries
  • Reranking with cross-encoder (ms-marco-MiniLM-L-6-v2)
  • Retrieval evaluation: Hit-Rate@5 and MRR@5
  • Failure mode demonstrations (out-of-domain, hallucination resistance)

@pm32900 pm32900 requested a review from a team as a code owner February 21, 2026 18:16
@review-notebook-app
Copy link
Copy Markdown

Check out this pull request on  ReviewNB

See visual diffs & provide feedback on Jupyter Notebooks.


Powered by ReviewNB

@kacperlukawski kacperlukawski self-requested a review February 24, 2026 11:08
Copy link
Copy Markdown
Member

@kacperlukawski kacperlukawski left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hello @pm32900 Thanks for contributing with a cookbook!

I really liked the idea of query decomposition, and reranking, but in general feel the cookbook is too code-heavy to be merged as it is. We typically try to guide readers between the code cells so they can understand what will happen next. Would you be so kind and modify the notebook so it starts with some introduction describing the target system. and then introduce new concepts one by one? Happy to help with that process!

@pm32900
Copy link
Copy Markdown
Author

pm32900 commented Feb 24, 2026

Hi @kacperlukawski, thanks for the feedback! I've addressed this in the latest commit, the notebook now has detailed narrative markdown cells before each code section, introducing the concept and explaining the why before the how. The intro also includes a full system architecture diagram. Would love your thoughts!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants