Crawl the web. Index it (BM25 + HNSW dense). Search with hybrid
retrieval, MMR diversification, time-decay. Answer and research with
cited sources via any OpenAI-compatible LLM. Everything in one Go binary —
cosift.
bm25 (lexical) · dense (HNSW cosine over per-passage embeddings) · hybrid (RRF-fuse the two)
hyde appends an LLM-hypothesized passage to the query · paraphrase fans out N rewrites and RRF-fuses
Cross-encoder reorder of the candidate pool. HTTP (Cohere / Voyage / Jina / TEI) or LLM listwise.
Maximal Marginal Relevance — λ ∈ [0,1] trades relevance against diversity from already-selected sources.
Exponential time-decay half-life in days — exp(-ln 2 · age / H). Hits without a publish date are unchanged.
The retrieval label is the composed pipeline, e.g.
bm25+dense:rrf+rerank:cohere+mmr:0.70+decay:30d. Each knob composes
on every retrieval/synth endpoint (/search, /answer,
/research, /find_similar).
Direct API access: /docs · /openapi.json ·
curl '/search?q=…&retriever=hybrid&mmr=0.7&decay=30'
Multi-turn chat with streaming answers and cited sources. RAG over the live index.
Open chat →