Engram · Fast path

Engram

The fast path in front of your expensive calls.

Engram is a lightweight retrieval layer that sits between a query and the expensive work behind it: an LLM call, a database hit, an external API. When a query matches something Engram already holds, it returns that answer immediately and the costly call never happens. It is keyword-indexed, learns which answers actually get used, and is small and transparent by design.

Open source · Apache-2.0 · available now

View on GitHub How it works

The problem

The same question, paid for twice.

Conversational systems ask the same things over and over, and every repeat pays full price: another inference, another query, another round trip.

The usual fix is a cache, but a naive cache matches exact strings and breaks on the first reworded question. A vector store answers every near-miss with an opaque similarity score you cannot read and cannot tune by hand.

You want the speed of a cache with retrieval you can actually reason about.

What it does

It answers before the expensive call runs.

Engram sits in front of expensive computation. A sufficient match returns stored knowledge and skips the downstream call entirely.

Matches on keyword overlap, so retrieval is fast and predictable, with lemmatization, stemming, and synonym fallbacks that catch reworded questions an exact-match cache would miss.
Learns from use. Hit-rate tracking records which stored answers actually get retrieved, a signal that sharpens retrieval over time and surfaces dead weight to prune.
Separates the permanent from the disposable. Protected statements stay put; evictable ones age out under a capacity bound, so durable facts survive while transient ones clear.
Carries a conversation. Sessions expand context across turns, so a follow-up like what is its population resolves against what came before.
Persists as plain JSON. Save, load, inspect, and move state without ceremony.

Why it’s different

Retrieval you can read.

A vector store hides its judgment inside an embedding distance you have to trust. Engram is deliberately legible: the reason it returned an answer is something you can read, predict, and tune by hand.

Legible

Keyword retrieval you can read and tune, not a black-box embedding distance.

Learns from use

Sharpens on what actually gets retrieved, rather than guessing at relevance.

Stays small

A library and a command-line tool you run yourself, not a service to stand up.

You own it

It runs where you run.

Engram is a library and a CLI, not a service you have to host. State persists as plain JSON you can inspect, version, and move. Nothing leaves your environment, and there is nothing to stand up. It is Apache-2.0 licensed, so you can read the source, fork it, and ship it.

Available now · Apache-2.0

Put it in front of your next call.

Engram is open source and usable as a library or from the command line. Read the source, run it yourself, and skip the calls you have already paid for.

View on GitHub