POST /v1/memories/search
Hybrid semantic search scores memories using both vector embeddings and full-text search, then fuses the two ranked lists with RRF to produce a single ordered result set. This gives strong recall across a wide variety of query styles — natural language, short keywords, and mixed phrasing — and is the recommended default for most agent retrieval workflows.Request
POST /v1/memories/search
The search query. Can be a natural language question, a short keyword phrase, or any text you want to match against stored memory content.
Restricts search to memories that belong to the specified scope. At least one of the following keys must be present:
Maximum number of memories to return. Results are ordered by descending relevance score.
Example Request
Response
A ranked list of memory objects, ordered by descending relevance score.
The total number of memories in scope that matched the query, before the
limit was applied.Example Response
POST /v1/memories/search/fast
Fast search skips the keyword-scoring pass and queries only the vector index, returning the nearest neighbours by embedding similarity. This reduces query latency compared to hybrid search. Use it when your application is latency-sensitive and your queries are well-represented by semantic similarity alone (e.g., dense natural language questions rather than sparse keyword lookups).Request
The request body is identical to/v1/memories/search. The same query, scope, and optional limit fields apply.
Example Request
Response
The response format is identical to/v1/memories/search: a memories array of ranked memory objects and a total count.
- Hybrid search
- Fast search
Use
POST /v1/memories/search when you want the highest relevance accuracy. Best for standard agent retrieval, RAG pipelines, and mixed natural-language and keyword queries.