目录
目录README.md

VectorDB-recipes


Dive into building GenAI applications! This repository contains examples, applications, starter code, & tutorials to help you kickstart your GenAI projects.
  • These are built using LanceDB, a free, open-source, serverless vectorDB that requires no setup.
  • It integrates into python data ecosystem so you can simply start using these in your existing data pipelines in pandas, arrow, pydantic etc.
  • LanceDB has native Typescript SDK using which you can run vector search in serverless functions!

Join our community for support - DiscordTwitter

This repository is divided into 2 sections:

  • Examples - Get right into the code with minimal introduction, aimed at getting you from an idea to PoC within minutes!
  • Applications - Ready to use Python and web apps using applied LLMs, VectorDB and GenAI tools

Examples

Applied examples that get right into the code with minimal introduction, aimed at getting you from an idea to PoC within minutes! Examples are available as:

  • Colab notebooks - that builds the application is stages allowing you to investigate results at every intermediate stage.
  • Python scripts - for cases where you’d like directly to use the file or snippets to integrate in your application
  • JS/TS scripts - Some examples are written using lancedb’s native js library! These script/snippets can also be directly integrated in your web applications.

The following examples are organized into different tables to make similar types of examples easily accessible.

Build from Scratch     Interactive Notebook & Scripts  
Build RAG from Scratch Open In Colab LLM beginner
Local RAG from Scratch with Llama3 Python local LLM beginner
Multimodal     Interactive Notebook & Scripts   Blog
Multimodal CLIP: DiffusionDB Open In Colab Python LLM beginner Ghost
Multimodal CLIP: Youtube videos Open In Colab Python LLM beginner Ghost
Multimodal Image + Text Search Open In Colab Python LLM intermediate Ghost
RAG     Interactive Notebook & Scripts   Blog
Improve RAG with Re-ranking Open In Colab LLM beginner Ghost
Instruct-Multitask Open In Colab Python LLM beginner Ghost
Improve RAG with HyDE Open In Colab LLM intermediate Ghost
Improve RAG with LOTR Open In Colab LLM intermediate Ghost
Advanced RAG: Parent Document Retriever Open In Colab LLM intermediate Ghost
Corrective RAG with Langgraph Open In Colab LLM intermediate Ghost
Contextual-Compression-with-RAG Open In Colab local LLM intermediate Ghost
Improve RAG with FLARE Open In Colab local LLM LLM advanced Ghost
Query Expansion and Reranker Open In Colab LLM advanced Ghost
RAG Fusion Open In Colab LLM advanced
Vector Search     Interactive Notebook & Scripts   Blog
Inbuilt Hybrid Search Open In Colab LLM beginner
Hybrid search BM25 & lancedb Open In Colab LLM beginner Ghost
NER powered Semantic Search Open In Colab local LLM beginner Ghost
Audio Search Open In Colab Python LLM beginner
Multi-lingual search Open In Colab Python LLM beginner
Facial Recognition Open In Colab beginner
Sentiment Analysis : Analysing Hotel Reviews Open In Colab local LLM beginner Ghost
Imagebind demo app hf spaces intermediate
Search Within Images Open In Colab local LLM intermediate Ghost
Vector Search with TransformersJS JS LLM advanced
Accelerate Vector Search Applications Using OpenVINO Open In Colab local LLM advanced Ghost
Chatbot     Interactive Notebook & Scripts   Blog  
Databricks DBRX Website Bot Python Databricks LLM beginner
CLI-based SDK Manual Chatbot with Phidata Python local LLM beginner
Youtube transcript search bot Open In Colab Python JS LLM intermediate
Langchain: Code Docs QA bot Open In Colab Python JS LLM intermediate
Context-Aware Chatbot using Llama 2 & LanceDB Open In Colab local LLM advanced Ghost
Evaluation     Interactive Notebook & Scripts   Blog
Evaluating Prompts with Prompttools Open In Colab LLM local LLM advanced
Evaluating RAG with RAGAs Open In Colab LLM intermediate
AI Agents     Interactive Notebook & Scripts   Blog
AI Agents: Reducing Hallucination Open In Colab Python JS LLM advanced Ghost
AI Trends Searcher with CrewAI Open In Colab LLM beginner Ghost
SuperAgent Autogen Open In Colab LLM intermediate
Recommender Systems     Interactive Notebook & Scripts   Blog
Movie Recommender Open In Colab Python beginner
Movie Recommender with Genre Open In Colab beginner Ghost
Product Recommender Open In Colab Pythonintermediate
Arxiv paper recommender Open In Colab Python LLM beginner
Concepts Interactive Notebook Blog Link
A Primer on Text Chunking and its Types Open In Colab beginner Ghost
Langchain LlamaIndex Chunking Open In Colab beginner Ghost
Comparing Cohere Rerankers with LanceDB beginner Ghost
Product Quantization: Compress High Dimensional Vectors intermediate Ghost
LLMs, RAG, & the missing storage layer for AI intermediate Ghost
Fine-Tuning LLM using PEFT & QLoRA Open In Colab local LLM advanced Ghost
Extracting Complex tables-text from PDFs using LlamaParse Open In Colab LLM LlamaCloud beginner

Projects & Applications

These are ready to use applications built using LanceDB serverless vector database. You can explore these open source projects, use parts of them in your projects or build your applications on top of these.

Project Name Description Screenshot
YOLOExplorer Iterate on your YOLO / CV datasets using SQL, Vector semantic search, and more within seconds YOLOExplorer
Website Chatbot (Deployable Vercel Template) Create a chatbot from the sitemap of any website/docs of your choice. Built using vectorDB serverless native javascript package. Chatbot
Chat with multiple URL/website Conversational AI for Any Website with Mistral,Bge Embedding & LanceDB webui_aa
Talk with Youtube Video using GPT4 Vision API Talk with Youtube Video using GPT4 Vision API and Langchain demo
Talk with Podcast Talk with Youtube Podcast using Ollama and insanely-fast-whisper demo
Talk with Wikipedia Talk with Wikipedia Pages demo
Talk with Github Talk with Github Codespaces using Qwen1.5 demo
Document Chat with Langroid Talk with your Documents using Langroid demo
Hr chatbot Hr chatbot - ask your personal query using zero-shot React agent & tools image
Advanced Chatbot with Parler TTS This Chatbot app uses Lancedb Hybrid search, FTS & reranker method with Parlers TTS library. image
Multi-Modal Search Engine Create a Multi-modal search engine app, to search images using both images or text Search
Multimodal Myntra Fashion Search Engine This app uses OpenAI’s CLIP to make a search engine that can understand and deal with both written words and pictures. image
Multilingual-RAG Multilingual RAG with cohere embedding & support 100+ languages image
Fastapi RAG template FastAPI based RAG template with Websocket support image
GTE MLX RAG mlx based RAG model using lancedb api support image
Healthcare Chatbot Healthcare chatbot using domain specific LLM & Embedding model image

🌟 New! 🌟 Applied GenAI and VectorDB course on Udacity Learn about GenAI and vectorDBs using LanceDB in the recently launched Udacity Course

Contributing Examples

If you’re working on some cool applications that you’d like to add to this repo, please open a PR!

    Gitlink(确实开源)
  • 加入我们
  • 官网邮箱:gitlink@ccf.org.cn
  • QQ群
  • QQ群
  • 公众号
  • 公众号

©Copyright 2023 CCF 开源发展委员会
Powered by Trustie& IntelliDE 京ICP备13000930号