Filtering

Use filters to restrict search results to vectors that match specific conditions. All filter conditions are combined with logical AND: a vector must satisfy every condition to be returned.

Operators

Operator	Description	Example
$eq	Exact match	{“status”: {“$eq”: “published”}}
$in	Match any value in a list	{“tags”: {“$in”: [“ai”, “ml”]}}
$range	Numeric range (inclusive on both ends)	{“score”: {“$range”: [70, 95]}}

Python


# $eq — exact match
results = index.query(
    vector=[...], top_k=5,
    filter=[{"category": {"$eq": "tech"}}]
)
 
# $in — match any value in a list
results = index.query(
    vector=[...], top_k=5,
    filter=[{"tags": {"$in": ["ai", "ml", "nlp"]}}]
)
 
# $range — numeric range (inclusive)
results = index.query(
    vector=[...], top_k=5,
    filter=[{"score": {"$range": [80, 100]}}]
)
 
# combined — all conditions must be satisfied (logical AND)
results = index.query(
    vector=[...], top_k=5,
    filter=[
        {"category": {"$eq": "tech"}},
        {"score": {"$range": [80, 100]}}
    ]
)

TypeScript


// $eq
const results = await index.query({
  vector: [...], topK: 5,
  filter: [{ category: { $eq: 'tech' } }],
});
 
// $in
const results = await index.query({
  vector: [...], topK: 5,
  filter: [{ tags: { $in: ['ai', 'ml', 'nlp'] } }],
});
 
// $range
const results = await index.query({
  vector: [...], topK: 5,
  filter: [{ score: { $range: [80, 100] } }],
});
 
// combined
const results = await index.query({
  vector: [...], topK: 5,
  filter: [
    { category: { $eq: 'tech' } },
    { score: { $range: [80, 100] } },
  ],
});

Java


import io.endee.client.types.QueryOptions;
import io.endee.client.types.QueryResult;
 
// $eq — exact match
List<QueryResult> results = index.query(
    QueryOptions.builder()
        .vector(new double[384]).topK(5)
        .filter(List.of(Map.of("category", Map.of("$eq", "tech"))))
        .build()
);
 
// $in — match any value in a list
List<QueryResult> results = index.query(
    QueryOptions.builder()
        .vector(new double[384]).topK(5)
        .filter(List.of(Map.of("tags", Map.of("$in", List.of("ai", "ml", "nlp")))))
        .build()
);
 
// $range — numeric range (inclusive)
List<QueryResult> results = index.query(
    QueryOptions.builder()
        .vector(new double[384]).topK(5)
        .filter(List.of(Map.of("score", Map.of("$range", List.of(80, 100)))))
        .build()
);
 
// combined — all conditions must be satisfied (logical AND)
List<QueryResult> results = index.query(
    QueryOptions.builder()
        .vector(new double[384]).topK(5)
        .filter(List.of(
            Map.of("category", Map.of("$eq", "tech")),
            Map.of("score", Map.of("$range", List.of(80, 100)))
        ))
        .build()
);

Notes:

Operators are case-sensitive
Multiple conditions must all be satisfied (logical AND)

Filter Tuning

When using filtered queries, two optional parameters let you tune the trade-off between search speed and recall.

Prefilter Cardinality Threshold

Controls when the search strategy switches from HNSW filtered search to brute-force prefiltering.

When very few vectors match your filter, HNSW may struggle to find enough valid candidates through graph traversal. In that case, scanning the matched subset directly (prefiltering) is faster and more accurate.

Default: 10,000
Valid range: 1,000 – 1,000,000
Raising the threshold → prefiltering kicks in more often (favors exhaustive scan)
Lowering the threshold → HNSW graph search is used more (favors speed on large datasets)

Filter Boost Percentage

When using HNSW filtered search, candidates explored during graph traversal that fail the filter are discarded, which can leave you with fewer results than top_k. This parameter expands the internal candidate pool before filtering is applied to compensate.

Default: 0 (no boost)
Maximum: 100 (doubles the candidate pool)

Python


results = index.query(
    vector=[...],
    top_k=10,
    filter=[{"category": {"$eq": "rare"}}],
    prefilter_cardinality_threshold=5000,
    filter_boost_percentage=25,
)

TypeScript


const results = await index.query({
  vector: [...],
  topK: 10,
  filter: [{ category: { $eq: 'rare' } }],
  prefilterCardinalityThreshold: 5000,
  filterBoostPercentage: 25,
});

Java


List<QueryResult> results = index.query(
    QueryOptions.builder()
        .vector(new double[384])
        .topK(10)
        .filter(List.of(Map.of("category", Map.of("$eq", "rare"))))
        .prefilterCardinalityThreshold(5000)
        .filterBoostPercentage(25)
        .build()
);

Start with the defaults. If filtered queries return fewer results than expected, increase filter_boost_percentage. If filtered queries are slow on selective filters, lower the cardinality threshold.

Updating Filters

You can update the filter fields of existing vectors by providing their IDs and a new filter object.

Parameter	Required	Description
ID	Yes	ID of the vector to update
Filter	Yes	New filter object (replaces the existing filters entirely)

Python


index.update_filters([
    {"id": "doc1", "filter": {"category": "science", "year": 2024}},
    {"id": "doc2", "filter": {"category": "tech"}},
])

TypeScript


await index.updateFilters([
  { id: 'doc1', filter: { category: 'science', year: 2024 } },
  { id: 'doc2', filter: { category: 'tech' } },
]);

Java


import io.endee.client.types.UpdateFilterParams;
 
index.updateFilters(List.of(
    new UpdateFilterParams("doc1", Map.<String, Object>of("category", "science", "year", 2024)),
    new UpdateFilterParams("doc2", Map.of("category", "tech"))
));

Filter updates are destructive replacements. Any filter keys not included in the new filter object will be removed from the vector. There is no partial-merge option.