Vector Search

Vector Search from the SDK, to enable AI integration, semantic search, and use of RAG frameworks.

Vector Search has been available in Couchbase Capella Operational and self-managed Server since version 7.6, using the Couchbase Search Service. Version 8.0 introduces vector query using Global Secondary Indexes (GSI), the Query Service index — using either a fast Hyperscale index, or a composite index to combine scalar queries with semantic search.

For fast and scalable vector queries, use one of the above two GSI choices — detailed in the next section. If you don’t require the speed and scale of vector query with GSI, or need to combine vector, geo-spatial search, range search, and traditional fuzzy text searches, then consider Vector Search With the Search Service.

Vector Search With the Query Service and GSI

From the SDK, a vector query using GSI is the same as any other query. However, you will need to build one or more indexes.

Prerequisites

Couchbase Server 8.0.0 or newer — or a recent Capella instance.
Your chosen vector index — hyperscale or composite.

You will need to refer to the Use Vector Indexes for AI Applications pages for a full discussion of using Vector Indexes with Vector Queries. In particular, you will need to create a Vector Index.

Examples

The Use Vector Indexes for AI Applications pages contain examples using both hyperscale and compound indexes.

Here is the Hyperscale Index example, wrapped inside the Python SDK Query API.

Hyperscale Index Example

  result = cluster.query(
  "SELECT d.id, d.question, d.wanted_similar_color_from_search, " +
  "  ARRAY_CONCAT( " +
    "d.couchbase_search_query.knn[0].vector[0:4], " +
    "['...'] " +
  ") AS vector " +
  "FROM `vector-sample`.`color`.`rgb-questions` AS d " +
  "WHERE d.id = '#87CEEB';",
  QueryOptions(metrics=True)

    for row in result.rows():
        print(f"Found match: {row}")

Parameterizing the query, as with regular queries, will allow the reuse of the Query Plan. This can be more efficient, unless you are doing a lot of optimization to your query.

Parameterized Vector Query

  result = cluster.query(
         "SELECT d.id, d.question, d.wanted_similar_color_from_search, " +
         "  ARRAY_CONCAT( " +
           "d.couchbase_search_query.knn[0].vector[0:4], " +
           "['...'] " +
         ") AS vector " +
      "FROM `vector-sample`.`color`.`rgb-questions` AS d " +
  	"WHERE d.id = $id;",
  QueryOptions(named_parameters={'id': '#87CEEB'})

    for row in result.rows():
        print(f"Found match: {row}")

Vector Search With the Search Service

Vector search is also implemented using Search Indexes, and can be combined with traditional full text search queries. Vector embeddings can be an array of floats or a base64 encoded string.

Prerequisites

Couchbase Server 7.6.0 (7.6.2 for base64-encoded vectors) — or a Capella Operational cluster.

Examples

Single vector query

In this first example we are performing a single vector query:

# NOTE: new imports needed for vector search
from couchbase.vector_search import VectorQuery, VectorSearch

vector_search = VectorSearch.from_vector_query(VectorQuery('vector_field',
                                                           query_vector))
request = search.SearchRequest.create(vector_search)
result = scope.search('vector-index', request)

Let’s break this down. We create a SearchRequest, which can contain a traditional FTS query SearchQuery and/or the new VectorSearch. Here we are just using the latter.

The VectorSearch allows us to perform one or more VectorQuery s.

The VectorQuery itself takes the name of the document field that contains embedded vectors ("vector_field" here), plus actual vector query in the form of a float[].

(Note that Couchbase itself is not involved in generating the vectors, and these will come from an external source such as an embeddings API.)

Finally we execute the SearchRequest against the FTS index "travel-sample-index", which has previously been setup to vector index the "vector_field" field.

This happens to be a scoped index so we are using scope.search(). If it was a global index we would use cluster.search() instead — see [Scoped vs Global Indexes].

It returns the same SearchResult detailed on the Search page.

Pre-Filters

From Couchbase Server 7.6.4 — and in Capella Operational clusters — pre-filtering with similarity search is available. This is a non-vector query that the server executes first to get an intermediate result. Then it executes the vector query on the intermediate result to get the final result.

prefilter = search.MatchQuery('primary', field='color_wheel_pos')
vector_query = VectorQuery.create('vector_field',
                                  query_vector,
                                  prefilter=prefilter)
vector_search = VectorSearch.from_vector_query(vector_query)
request = search.SearchRequest.create(vector_search)
result = scope.search('vector-index', request)

Here’s the signature of the method:

    def __init__(self,
                        field_name: str,
                        vector: Union[List[float], str],
                        num_candidates: Optional[int] = None,
                        boost: Optional[float] = None,
                        prefilter: Optional[SearchQuery] = None
                        ) -> None:

If no prefilter is specified, the server executes the vector query on all indexed documents.

Simple Match

VectorQuery.create(field_name, vector, num_candidates=num_candidates, prefilter=MatchQuery('primary', field='color_wheel_pos'))

Note that num_candidates sets how many similar vectors are returned. If it is not set, then the Cluster’s default of 3 will be used — this corresponds with k on the Server side, for K-Nearest Neighbors.

The prefilter can be any Search Query — from a simple match, as above, to a string query:

String Query

prefilter = search.QueryStringQuery('+description:sea -color_hex:fff5ee')
vector_query = VectorQuery.create('vector_field',
                                  query_vector,
                                  num_candidates=3,
                                  prefilter=prefilter)
vector_search = VectorSearch.from_vector_query(vector_query)
request = search.SearchRequest.create(vector_search)
result = scope.search('vector-index', request)

request = search.SearchRequest.create(VectorSearch([
    VectorQuery.create('vector_field',
                       query_vector,
                       num_candidates=2,
                       boost=0.3),
    VectorQuery.create('vector_field',
                       another_query_vector,
                       num_candidates=5,
                       boost=0.7)
]))
result = scope.search('vector-index', request)

request = (search.SearchRequest.create(search.MatchAllQuery())
           .with_vector_search(VectorSearch.from_vector_query(VectorQuery('vector_field',
                                                                          query_vector))))
result = scope.search('vector-and-fts-index', request)

request = search.SearchRequest.create(search.MatchAllQuery())
result = scope.search('travel-sample-index', request)

See the API reference.

Multiple vector queries

You can run multiple vector queries together:

request = search.SearchRequest.create(VectorSearch([
    VectorQuery.create('vector_field',
                       query_vector,
                       num_candidates=2,
                       boost=0.3),
    VectorQuery.create('vector_field',
                       another_query_vector,
                       num_candidates=5,
                       boost=0.7)
]))
result = scope.search('vector-index', request)

How the results are combined (ANDed or ORed) can be controlled with vector_query_combination in VectorSearchOptions.

Combining FTS and vector queries

request = (search.SearchRequest.create(search.MatchAllQuery())
           .with_vector_search(VectorSearch.from_vector_query(VectorQuery('vector_field',
                                                                          query_vector))))
result = scope.search('vector-and-fts-index', request)

How the results are combined (ANDed or ORed) can be controlled with vector_query_combination in VectorSearchOptions.

Scoring for these hybrid search queries combines the boost multipliers to get to the final score.

hit_score = (query_1_boost * query_1_hit_score) + (query_2_boost * query_2_hit_score)

Query Methods

As part of the Search service, you can use the same Search query methods as regular Searches. See a fuller list, with Vector properties, in the Capella docs.

Vector Search

Vector Search With the Query Service and GSI

Prerequisites

Examples

Vector Search With the Search Service

Prerequisites

Examples

Single vector query

Pre-Filters

Multiple vector queries

Combining FTS and vector queries

Query Methods

Further Reading

Vector Query

Vector Search