AI Search & Reasoning Pipeline
Multi-signal search ranking that combines semantic similarity, keyword relevance, and LLM scoring into a single ranked result without sacrificing latency.
- Dual-search: Pinecone vector search + Supabase full-text search fused with Reciprocal Rank Fusion (RRF)
- Multi-LLM routing via LiteLLM — OpenAI, Gemini, and Voyage AI behind a single interface
- RAG pipeline with context caching, token counting (tiktoken), and prompt compression
- Intent classification and query reformulation before retrieval
- A/B testing framework for ranking algorithm variants