Does Agent Brain send my code or memories to a cloud service?

No. Agent Brain is self-hosted. Core memory and recall operations run on your machine or infrastructure.

Which agents can use it?

Agent Brain works with supported coding tools and local integrations.

The semantic cache checks whether a similar query has already been answered.

When a query clears the similarity threshold, a client can:

AI coding workflows often repeat the same questions with different wording. A semantic cache helps by:

Examples:

These can all point to the same cached context when the underlying meaning is close enough.

The default similarity threshold is configured in the Agent Brain environment. The README documents CACHE_SIMILARITY_THRESHOLD=0.85 as the default.

Tradeoff:

Use the desktop Cache Monitor to watch: