Live Pipeline
Watch it run.
Every request flows through composable Python nodes — no magic, no black boxes.
Each node is plain Python. Swap, extend, or debug any step — no black boxes.
Architecture
Six composable layers.
Use any. Use all.
The Problem
Existing frameworks are bloated mazes.
You deserve better.
50+ dependencies. 200 MB install. For what?
Most LLM frameworks pull in half of PyPI. Every import is a surprise. SynapseKit needs only numpy and rank-bm25.
Async was bolted on. Not designed in.
Partial async support is unpredictable and fragile. SynapseKit is async/await native at every layer — no sync wrapper surprises.
Cost tracking sold separately as a SaaS product.
Observability shouldn't require a subscription. SynapseKit tracks cost, tokens, and latency out of the box — locally.
The Solution
SynapseKit does the same in 10 lines.
Plain Python. No magic classes. No global state. Just functions you can read, debug, and extend.
Full async/await throughout — no sync/async mismatch
Token-level streaming from every provider
Swap model or provider in one line
Cost tracking on every call, no SaaS needed
No hidden chains. Every step is plain Python.
Why SynapseKit
Every choice made deliberately.
Comparison
How we stack up.
| Feature | SynapseKit | LangChain | LlamaIndex |
|---|---|---|---|
| Hard dependencies | 2 | 50+ | 20+ |
| Install size | ~5 MB | ~200 MB+ | ~100 MB+ |
| Async-native | ✅ Default | ⚠️ Partial | ⚠️ Partial |
| Streaming | ✅ Default | ⚠️ Varies | ⚠️ Varies |
| Cost tracking | ✅ Built-in | ❌ SaaS add-on | ❌ No |
| Evaluation / EvalCI | ✅ CLI + GitHub Action | ❌ SaaS add-on | ✅ Built-in |
| Graph workflows | ✅ Built-in | ✅ Separate package | ❌ No |
| Agent federation | ✅ Built-in | ❌ No | ❌ No |
| Reasoning LLMs | ✅ Unified adapter | ⚠️ Manual | ⚠️ Manual |
| License | Apache 2.0 | MIT | MIT |
Capabilities
Everything you need. Nothing you don't.
53
Loaders
11
Vector Stores
3
Retrieval Modes
- →
53 loaders — PDF, Word, YouTube, S3, Notion, HubSpot, BigQuery, Salesforce, Airtable, Obsidian, and more
- →
11 vector stores — Chroma, Pinecone, Weaviate, pgvector, Redis, MongoDB Atlas, SQLiteVec, and more
- →
Hybrid search — semantic vector search + multi-hop Knowledge Graph retrieval in one call
- →
Built-in RAG evaluation — cost/benefit tracking, Slack alert sinks, per-call scoring
- →
Anthropic prompt caching via SmartContextManager — cuts costs on repeated context
EvalCI
Ship with confidence.
Gate quality on every PR.
EvalCI is a GitHub Action that runs your LLM evaluation suite on every pull request — before anything merges. Catch regressions automatically, not manually.
- ✓Define eval cases with @eval_case decorator
- ✓Compare every PR against a baseline model output
- ✓Block merges when quality drops below threshold
- ✓Track factual accuracy, faithfulness, cost per query
Ecosystem
Your entire stack, already supported.
33 LLM providers behind one unified API. Swap without rewriting a line.
Load from anywhere. Get Documents everywhere.
Every loader returns the same Document object — whether it's a PDF, a YouTube video, a Salesforce export, or a BigQuery table. Your pipeline never needs to change.
Start local. Go prod. Zero rewrites.
Chroma for your laptop, Pinecone for production, pgvector for your existing Postgres — all behind one interface. Change one line, not your entire codebase.
One decorator. Real-world actions.
Decorate any function with @tool and your agent can call it. Browser automation, SQL queries, GitHub PRs, Slack messages — all wired up and production-tested.
Agents that remember across sessions.
Episodic memory stores what happened. Semantic memory stores what matters. Both work across SQLite, Redis, and Postgres — start in-memory, scale to Redis in one line.
Install
Start in seconds.
Install only what you need. Extras are truly optional.
Full options → installation docs
Documentation