Python vector-database

Open-source Python projects categorized as vector-database

Top 23 Python vector-database Projects

vector-database
  1. llama_index

    LlamaIndex is the leading framework for building LLM-powered agents over your data.

    Project mention: How to Build a RAG Solution with Llama Index, ChromaDB, and Ollama | dev.to | 2025-11-04

    Step 2: Set up LlamaIndex and Chroma DB

  2. Stream

    Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.

    Stream logo
  3. mem0

    Universal memory layer for AI Agents

    Project mention: Show HN: A file-based agent memory framework that works like skill | news.ycombinator.com | 2026-01-06
  4. txtai

    💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows

    Project mention: The AI-Native GraphDB + GraphRAG + Graph Memory Landscape & Market Catalog | dev.to | 2025-10-26

    GitHub: https://github.com/neuml/txtai

  5. cognee

    Memory for AI Agents in 6 lines of code

    Project mention: The AI-Native GraphDB + GraphRAG + Graph Memory Landscape & Market Catalog | dev.to | 2025-10-26

    URLs: https://github.com/topoteretes/cognee (hosted at cognee.ai / Cogwit)

  6. LEANN

    RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device.

    Project mention: First lightweight local semantic search MCP for Claude Code | news.ycombinator.com | 2025-08-15

    @Berkeley SkyLab, we’re the first to bring semantic search to Claude Code with a fully local index in a novel, lightweight structure — check it out at LEANN(https://github.com/yichuan-w/LEANN).

  7. deep-searcher

    Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.

    Project mention: Deep Searcher, Open source deep researcher on your private data | news.ycombinator.com | 2025-02-21

    github https://github.com/zilliztech/deep-searcher

  8. airweave

    Context retrieval for AI agents across apps and databases

    Project mention: Launch HN: Airweave (YC X25) – Let agents search any app | news.ycombinator.com | 2025-09-30
  9. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  10. pixeltable

    Data Infrastructure providing a declarative, incremental approach for multimodal AI workloads.

    Project mention: Stop Gluing Data Infrastructure Tools: Build Multimodal AI Workloads and Application with One Declarative Python SDK | dev.to | 2025-07-06

    Star us on GitHub: https://github.com/pixeltable/pixeltable

  11. raptor

    The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval

    Project mention: Graph RAG의 모든 것 | dev.to | 2025-04-20

    3.2. RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval (Stanford Univ, 2024)

  12. pymilvus

    Python SDK for Milvus Vector Database

  13. SeaGOAT

    local-first semantic code search engine

  14. qdrant-client

    Python client for Qdrant vector search engine

  15. OpenContracts

    Enterprise-grade and API-first LLM workspace for unstructured documents, including data extraction, redaction, rights management, prompt playground, and more!

  16. mcp-memory-service

    Stop re-explaining your project to AI every session. Automatic context memory for Claude, VS Code, Cursor, and 13+ AI tools.

    Project mention: Supercharging Productivity with Cursor AI: A React Developer's Guide to MCP Servers and JSON Prompts | dev.to | 2025-04-17

    Key Takeaway: cursor10x-mcp and Repomix excel for speed and context. MCP Memory Service is great for quick wins, and Pieces organizes prompts. But tools alone don’t cut it—prompts are the real magic.

  17. NeumAI

    Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.

  18. rag-demystified

    An LLM-powered advanced RAG pipeline built from scratch

  19. llmflows

    LLMFlows - Simple, Explicit and Transparent LLM Apps

  20. vectordb

    A Python vector database you just need - no more, no less. (by jina-ai)

  21. langchain-chatbot

    AI Chatbot for analyzing/extracting information from data in conversational format.

  22. GradCache

    Run Effective Large Batch Contrastive Learning Beyond GPU/TPU Memory Constraint

  23. redis-vl-python

    Redis Vector Library (RedisVL) -- the AI-native Python client for Redis.

  24. vector-db-benchmark

    Framework for benchmarking vector search engines

  25. vicinity

    Lightweight Nearest Neighbors with Flexible Backends (by MinishLab)

  26. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python vector-database discussion

Log in or Post with

Python vector-database related posts

  • The Database Zoo: Vector Databases and High-Dimensional Search

    5 projects | dev.to | 25 Nov 2025
  • Search Types in Cognee

    1 project | dev.to | 20 Oct 2025
  • Cognee: Building the Next Generation of Memory for AI Agents (OSS)

    1 project | dev.to | 17 Oct 2025
  • Launch HN: Airweave (YC X25) – Let agents search any app

    1 project | news.ycombinator.com | 30 Sep 2025
  • Show HN: Vectorless RAG

    6 projects | news.ycombinator.com | 27 Aug 2025
  • Show HN: Airweave – Let agents search any app

    1 project | news.ycombinator.com | 12 May 2025
  • Ingest (almost) any non-PDF document in a vector database, effortlessly

    4 projects | dev.to | 25 Apr 2025
  • A note from our sponsor - InfluxDB
    www.influxdata.com | 7 Jan 2026
    InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now. Learn more →

Index

What are some of the best open-source vector-database projects in Python? This list will help you:

# Project Stars
1 llama_index 46,198
2 mem0 44,973
3 txtai 11,978
4 cognee 10,753
5 LEANN 7,977
6 deep-searcher 7,282
7 airweave 5,523
8 pixeltable 1,526
9 raptor 1,485
10 pymilvus 1,320
11 SeaGOAT 1,242
12 qdrant-client 1,191
13 OpenContracts 1,120
14 mcp-memory-service 1,045
15 NeumAI 861
16 rag-demystified 854
17 llmflows 706
18 vectordb 632
19 langchain-chatbot 437
20 GradCache 420
21 redis-vl-python 361
22 vector-db-benchmark 344
23 vicinity 324

Sponsored
Stream - Scalable APIs for Chat, Feeds, Moderation, & Video.
Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
getstream.io

Did you know that Python is
the 2nd most popular programming language
based on number of references?

Morty Proxy This is a proxified and sanitized view of the page, visit original site.