local-first semantic code search engine
-
Updated
Mar 12, 2026 - Python
local-first semantic code search engine
Semantic QA with a markdown database: Query any markdown file using vector embedding, Pinecone vector database and GPT (langchain). A weaker version of privateGPT
DImensionality REduction in JAX
Automatos AI: Open-source platform for advanced context engineering and multi-agent orchestration in enterprise automation. Built on frontier research in RAG, vector embeddings, cognitive tools, emergent symbols, and neural field theory—powered by FastAPI, Next.js, and PostgreSQL.
Python scripts that converts PDF files to text, splits them into chunks, and stores their vector representations using GPT4All embeddings in a Chroma DB. It also provides a script to query the Chroma DB for similarity search based on user input.
V3CTRON | Vector Embeddings Data Retrieval | ChatGPT Plugin
A tool for performing semantic search within pdf documents leveraging sentence transformers.
Django AI Core provides developer-focused AI features for implementing AI tooling in to Django sites.
Flask API for generating text embeddings using OpenAI or sentence_transformers
Match celebrity users with their respective tweets by making use of Semantic Textual Similarity on over 900+ celebrity users' 2.5 million+ crawled tweets utilizing SBERT, streamlit, tweepy and FastAPI
Semantic Art Search – Explore art through meaning-driven search
Learning semantic embeddings from OSM data: A Pytorch implementation of the loc2vec general method outlined in: https://sentiance.com/loc2vec-learning-location-embeddings-w-triplet-loss-networks.
MCP server combining Qdrant vector search, Neo4j knowledge graphs, and Crawl4AI web intelligence with agentic RAG capabilities. FastMCP 2.0 architecture with enterprise security, monitoring, and Kubernetes deployment. AI/ML engineering powerhouse.
Python library that provides tools for ML inference, indexing, semantic search, clustering, classification and efficient batch processing.
Python Client SDK for CyborgDB: The Confidential Vector Database
The SEO Content Analyzer is a sophisticated Python script designed to perform in-depth semantic analysis of content for SEO purposes.
A comprehensive RAG FastAPI service that handles document uploads and retrievals, built with Python. Uses PyMuPDF for document processing, turbopuffer for vector storage, OpenAI for models, and cohere for reranking.
Nicolay is a digital history experiment that uses artificial intelligence to explore the speeches of Abraham Lincoln.
A Semantic A* Pathfinding agent that navigates Wikipedia using high-dimensional vector space. Built with Python, BeautifulSoup4, and Sentence-Transformers to bridge unrelated concepts through semantic context rather than just keywords.
A resilient, fault‑tolerant telemetry analytics pipeline designed to validate, benchmark, and stress‑test high‑frequency sensor data streams under real‑world failure conditions. Includes chaos testing, DLQ repair, GPU‑accelerated ingestion, and end‑to‑end reliability validation for motorsport‑grade telemetry environments.
Add a description, image, and links to the vector-embeddings topic page so that developers can more easily learn about it.
To associate your repository with the vector-embeddings topic, visit your repo's landing page and select "manage topics."