Sizing Guidelines

Evaluate the overall performance and capacity goals that you have for Couchbase, and use that information to determine the necessary resources that you’ll need in your deployment.

The most common and important questions you need to ask when deploying a new Couchbase Server cluster are how many nodes you need, and what size they need to be.

With the increasing number of Couchbase services and the flexibility of the Couchbase Data Platform, the answer to this question can be challenging. This guide aims to help you better size your deployment.

If you want detailed recommendations for your specific deployment, you can contact Couchbase Support.

The sizing recommendations and calculations discussed in this guide are based on an analysis of performance data and common use-cases.

General Considerations

The sizing of your Couchbase Server cluster is critical to its overall stability and performance. While there are some basic system requirements to run Couchbase Server, you still need to evaluate the overall performance and capacity requirements for your workload and dataset, and then divide that into the hardware and resources you have available.

Your application wants the majority of reads to come out of the cache, and to have the I/O capacity to handle the writes. There needs to be enough capacity in all areas to support everything the system is doing while maintaining the required level of performance.

Multi-Dimensional Scaling

Couchbase Services allow you to access and maintain your data. You can deploy, maintain, and provision these services independently of each other. This independent service model allows you to take advantage of Multi-Dimensional Scaling.

Multi-Dimensional Scaling lets you fine-tune your cluster for optimal handling of changing workload-requirements, for each individual Couchbase Service.

Every Service has different demands on hardware resources. Multi-Dimensional Scaling plays an important role when sizing your Couchbase cluster, both pre and post-deployment. For example, core Data Service operations can often benefit from scaling out smaller commodity nodes. Low latency operations with the Query Service might see a greater benefit from scaling up hardware resources on a given node.

For more information about the nature and resource demands of each Couchbase Service, see Services.

About Couchbase Server Resources

This guide discusses four types of resources that you should consider when sizing a Couchbase Server cluster node:

CPU

CPU controls the number of cores and the clock speed required to run your workload.

RAM

RAM is often the most crucial area to size. Cached documents provide low-latency reads and consistently high throughput.

Your RAM represents the main memory you allocate to Couchbase Server. Determine your allocation based on the following factors:

How much free RAM is available beyond your OS and other applications.
How much data you want to store in main memory.
How much latency you expect from your Data, Indexing, and Query Service performance.

Some components that require RAM are:

All index storage types which need sufficient memory quota allocation for proper functioning.
The Search Service.

Table 1. Minimum RAM Quota for Couchbase Server Components
Component	Minimum RAM
Data Service	256 MB
Index Service (Standard Global Secondary)	256 MB
Indexing Service (Memory-Optimized)	256 MB minimum, 1024 MB and above recommended
Search Service (Full-Text Search)	256 MB minimum; 2048 MB and above recommended
Query Service	The Query Service does not require a RAM allocation.
Eventing Service	256 MB
Analytics Service	1024 MB

Storage (disk space)

Requirements for your disk subsystem are:

Disk size — Specifies the disk storage space needed to hold your entire dataset.
Disk I/O — Combines your sustained read/write rate, database file compaction, and any other operations that requires disk access.

To better support Couchbase Server, keep in mind the following:

Disk space continues to grow if fragmentation ratio keeps climbing. To mitigate this, add enough buffer in your disk space to store all of the data. Monitor your cluster’s fragmentation ratio in the Couchbase Server Web Console and trigger compaction processes as needed.
Couchbase recommends using Solid State Drives (SSD) when possible. An SSD gives much better performance than a Hard Disk Drive (HDD) when it comes to disk throughput and latency.

Network

Enough network bandwidth is vital to the performance of Couchbase Server. A reliable high-speed network for intra-cluster and inter-cluster communications has a huge effect on overall performance and scalability of Couchbase Server.

Most deployments can achieve optimal performance with 1 Gbps interconnects, but some may need 10 Gbps.

Sizing Data Service Nodes

Data Service nodes handle data service operations, such as create/read/update/delete (CRUD). The following sizing information applies to both the Couchstore and Magma storage engines.

Couchbase recommends reviewing the differences between the available storage engines before attempting to size the Data Service nodes in your cluster. For information, see Storage Engines.

It’s important to keep use-cases and application workloads in mind since different application workloads have different resource requirements. For example, if your working set needs to be fully in-memory, your cluster might need more RAM. If your application requires only 10% of data in-memory, you need disks with enough space to store all of the data, and that are fast enough for your read/write operations.

RAM Sizing for Data Service Nodes

You can start sizing the Data Service nodes by answering the following questions:

Is the application primarily using individual document access?
Do you plan to use XDCR?
What’s your working set size and what are your data operation throughput and latency requirements?

Answers to the above questions can help you better understand the capacity requirement of your cluster and provide a better estimation for sizing.

The following tables show an example use-case for sizing RAM:

Table 2. Input Variables for Sizing RAM
Input Variable	Value
`documents_num`	1,000,000
`ID_size`	100 bytes
`value_size`	10,000 bytes
`number_of_replicas`	1
`working_set_percentage`	20%

Table 3. Constants for Sizing RAM
Constants	Value
Type of Storage	SSD
`overhead_percentage`	25%
`metadata_per_document`	56 bytes
`high_water_mark`	85%

Based on the provided data, a rough sizing guideline formula would be:

Table 4. Guideline Formula for Sizing a Cluster
Variable	Calculation
`no_of_copies`	`1 + number_of_replicas`
`total_metadata`	`(documents_num) * (metadata_per_document + ID_size) * (no_of_copies)`
`total_dataset`	`(documents_num) * (value_size) * (no_of_copies)`
`working_set`	`total_dataset * (working_set_percentage)`
Cluster RAM quota required	`(total_metadata + working_set) * (1 + overhead_percentage) / (high_water_mark)`
Number of nodes	`Cluster RAM quota required / per_node_ram_quota`

Based on the above formula, these are the suggested sizing guidelines:

Table 5. Suggested Sizing Guideline
Variable	Calculation
`no_of_copies`	= 1 for original and 1 for replica
`total_metadata`	= 1,000,000 * (100 + 56) * (2) = 312,000,000 bytes
`total_dataset`	= 1,000,000 * (10,000) * (2) = 20,000,000,000 bytes
`working_set`	= 20,000,000,000 * (0.2) = 4,000,000,000 bytes
Cluster RAM quota required	= (312,000,000 + 4,000,000,000) * (1+0.25)/(0.85) = 6,341,176,470 bytes

This tells you that the RAM requirement for the whole cluster is 7 GB.

This amount is in addition to the RAM requirements for the operating system and any other software that runs on the cluster nodes.

Disk Sizing for Data Service Nodes

A key concept to remember about Couchbase Server’s data storage is that it’s an append-only system. When an application mutates or deletes a document, the old version of the document is not immediately removed from disk. Instead, Couchbase Server marks them as stale. They remain on disk until a compaction process runs that reclaims the disk space. When sizing disk space for your cluster, you take this behavior into account by applying an append-only multiplier to your data size.

When sizing disk space for the Data Service nodes, you first must determine the following information:

The total number of documents that you plan to store in the cluster. If this value constantly grows, consider the growth rate into the future when sizing.
The average size of each document.
Whether the documents can be compressed, and if they can, what compression ratio Couchbase Server can achieve. Couchbase Server always compresses documents when storing them on disk. See Compression for more information about compression in Couchbase Server. Documents containing JSON data or binaries can be compressed. Binary data that’s already compressed (such as compressed images or videos) cannot be compressed further.

Couchbase Server uses the Snappy compression algorithm, which prioritizes speed while still providing reasonable compression. You can estimate the compression ratio Couchbase Server can achieve for your data by compressing a sample set of documents using a snappy-based command line tool such as snzip. Otherwise, you can choose to use an estimated compression ratio of 0.7 for JSON documents.
The number of replicas for your buckets. See Intra-Cluster Replication for more information about replicas.
The number of documents that you plan to delete each day. This number includes both the number of documents directly deleted by your applications and those that expire due to TTL (time to live) settings. See Expiration for more information about document expiration.

This value is important because in the short term, deletions actually take a bit more disk space rather than less. Because of Couchbase Server’s append-only system, the deleted documents remain on disk until a compaction process runs. Also, Couchbase Server creates a tombstone record for each deleted document. This record consumes a small amount of additional disk space.
The metadata purge interval you’ll use. This purge process removes tombstones that records the deletion of documents. The default purge interval is 3 days. For more information about the purge interval, see Metadata Purge Interval.
Which storage engine your cluster will use. The storage engine affects the append-only multiplier that you use when sizing disk space. See Storage Engines for more information

To determine the amount of storage you need in your cluster:

Calculate the size of the dataset by multiplying the total number of documents by the average document size. If the documents are compressible, also multiply by the estimated compression ratio:

\[S_{\mathrm{dataset}} = \text{# of documents} \times \text{avg. document size} \times \text{compression ratio}\]
Calculate the total metadata size by multiplying the total number of documents by 56 bytes (the average metadata size per document):

\[S_{\mathrm{metadata}} = \text{# of documents} \times 56\]
Calculate the key storage overhead by multiplying the total number of documents by the average key size.

\[S_{\mathrm{keys}} = \text{# of documents} \times \text{avg. key size}\]
Calculate the tombstone space in bytes using the following formula:

\[\begin{equation} \begin{split} S_{\mathrm{tombstones}} = & ( \text{avg. key size} + 60 ) \times \text{purge frequency in days} \\ & \times ( \text{# of replicas} + 1 ) \times \text{# documents deleted per day} \end{split} \end{equation}\]
Calculate the total disk space required using the following formula:

\[\begin{equation} \begin{split} \text{total disk space} = & ( ( S_{\mathrm{dataset}} \times (\text{# replicas} + 1) \\ & + S_{\mathrm{metadata}} + S_{\mathrm{keys}} ) \times F_{\text{append-multiplier}} ) + S_{\mathrm{tombstones}} \end{split} \end{equation}\]

Where $F_{\text{append-multiplier}}$ is the append-only multiplier. This value depends on the storage engine you use:
- For Couchstore storage engine, use an append-only multiplier of 3.
- For Magma storage engine, use an append-only multiplier of 2.2.

For example, suppose you’re planning a cluster with the following characteristics:

Total number of documents: 1,000,000
The average document size: 10,000 bytes.
The documents contain JSON data that have an estimated compression ratio of 0.7.
Average key size: 32 bytes.
Number of replicas: 1
Number of documents deleted per day: 5,000
Purge frequency in days: 3
Storage engine: Magma

Using the formulas above, you can calculate the total disk space required as follows:

Calculate the dataset:

\[S_{\mathrm{dataset}} = 1,000,000 \times 10,000 \times 0.7 = 7,000,000,000 \text{bytes}\]
Calculate the total metadata size:

\[S_{\mathrm{metadata}} = 1,000,000 \times 56 = 56,000,000 \text{bytes}\]
Calculate the total key size:

\[S_{\mathrm{keys}} = 1,000,000 \times 32 = 32,000,000 \text{bytes}\]
Calculate the tombstone space:

\[S_{\mathrm{tombstones}} = (32 + 60) \times 3 \times (1 + 1) \times 5,000 = 2,760,000 \text{bytes}\]
Calculate the total disk space:

\[\begin{equation} \begin{split} \text{total disk space} = & ( 7,000,000,000 \times (1 + 1) \\ & + 56,000,000 + 32,000,000 ) \\ & \times 2.2 \\ & + 2,760,000 \\ & = 30,996,360,000 \text{bytes} \end{split} \end{equation}\]

Therefore, for the cluster in this example, you need at least 31 GB of disk space to store your data.

CPU Overhead

When sizing, you must account for raw CPU overhead when using a high number of buckets.

Your best practice is to allocate 0.2 cores per bucket on each node to maintain operational stability. This overhead does not account for any front-end workloads. You should allocate additional CPU cores for these workloads.
For more information about monitoring CPU usage and System Limits, see Monitor.

Sizing Index Service Nodes

To create and maintain secondary indexes and perform index scans for SQL++ queries, you need to size your Index Service nodes.

Similar to the nodes that run the Data Service, answer the following questions to take care of your application needs:

What is the length of your document keys?
Which fields need to be indexed?
Will you be using simple or compound indexes?
What is the minimum, maximum, or average value size of the indexed fields?
How many indexes do you need?
How many documents need to be indexed?
What is the working set percentage of index required memory?

Answers to these questions can help you better understand the capacity requirement of your cluster, and provide a better estimation for sizing.

The following is an example use-case for sizing RAM for the Index service:

Use the following sizing guide to compute the memory requirement for each individual index and to determine the total RAM quota required for the Index Service.

Table 6. Input Variables for Sizing RAM
Input Variable	Value
`num_entries` (Number of index entries)	10,000,000
`ID_size` (Size of DocumentID)	30 bytes
`index_entry_size` (Size of secondary key)	50 bytes
`working_set_percentage` (Nitro, Plasma, ForestDB)	100%, 20%, 20%

Table 7. Constants for Sizing RAM
Constants	Value
`overhead_percentage`	25%
`metadata_back_index` (Nitro, Plasma, ForestDB)	46, 46, 40 bytes
`metadata_main_index` (Nitro, Plasma, ForestDB)	74, 74, 70 bytes
`metadata_per_entry` (Nitro, Plasma, ForestDB)	`metadata_back_index` + `metadata_main_index`

Based on the provided data, a rough sizing guideline formula would be:

Table 8. Guideline Formula for Sizing a Cluster
Variable	Calculation
`total_index_data(secondary index)` (Nitro)	`(num_entries) * (metadata_per_entry + ID_size + index_entry_size)`
`total_index_data(secondary index)` (Plasma, ForestDB)	`(num_entries) * (metadata_per_entry + ID_size + index_entry_size) * 2`
`total_index_data(primary index)` (Nitro, Plasma, ForestDB)	`(num_entries) * (metadata_main_index + ID_size + index_entry_size)`
`index_memory_required(100% resident)` (memdb)	`total_index_data * (1 + overhead_percentage)`
`index_memory_required(20% resident)` (Plasma, ForestDB)	`total_index_data * (1 + overhead_percentage) * working_set`

Based on the above formula, these are the suggested sizing guidelines:

Table 9. Suggested Sizing Guideline
Variable	Calculation
`total_index_data(secondary index)` (Nitro)	(10000000) * (120 + 30 + 50) = 2000000000 bytes
`total_index_data(secondary index)` (Plasma)	(10000000) * (120 + 30 + 50) * 2 = 4000000000 bytes
`total_index_data(secondary index)` (ForestDB)	(10000000) * (80 + 30 + 50) * 2 = 3200000000 bytes
`index_memory_required(100% resident)` (Nitro)	(2000000000) * (1 + 0.25) = 2500000000 bytes
`index_memory_required(20% resident)` (Plasma)	(2000000000) * (1 + 0.25) * 0.2 = 1000000000 bytes
`index_memory_required(20% resident)` (ForestDB)	(3200000000) * (1 + 0.25) * 0.2 = 800000000 bytes

The previous example shows the memory requirement of a secondary index with 10M index entries, each with a 50 bytes secondary key and a 30 bytes DocumentID. The memory usage requirements are 2.5 GB (Nitro, 100% resident), 1 GB (plasma, 20% resident), 800 MB (ForestDB, 20% resident).

The storage engine used in the sizing calculation corresponds to the storage mode chosen for Index Service as explained in the table below.

Table 10. Storage engine and storage mode
Storage Engine	Storage Mode
Standard GSI (Community Edition)	ForestDB
Standard GSI(Enterprise Edition)	Plasma
Memory-Optimized (Enterprise Edition)	Nitro

Sizing Search Service Nodes

Search Service nodes manage Search indexes and serve your Search queries.

Basic Search indexes are lists of all the unique terms that appear in the documents on your cluster. For each term, the Search index also contains a list of the documents where that term appears, known as an inverted index. These lists inside a Search index can cause the Search index to be larger or smaller than your original dataset, depending on the complexity of your data. For more information about the structure of a Search index, see Search Index Architecture.

Specific options in your Search index configuration can also increase its size, such as Store, Include in _all field, and Include Term Vectors. For more information about what options can increase index size and storage requirements, see Child Field Options.

In general, when sizing nodes for a deployment that uses the Search Service, you need to determine the number of vCPUs and the amount of RAM that will support your workload.

Calculating Node Requirements

To size the Search Service nodes in your cluster, you need the following information:

The number of documents you need to include in your Search index or indexes.
The average size of the documents that need to be included in your Search index, in KB.
A sample document or documents that show the structure of your data.
The specific queries per second (QPS) target you need from the Search Service.

You should also consider your replication, recovery, and high availability needs.

With all this information, you can work with Couchbase Support to get the most accurate sizing for your Search workload.

If you want to try sizing your cluster yourself, you can use some of the following guidelines to size your vCPUS and RAM, using averages and estimates from other Search deployments.

To size your cluster for a geospatial search or Vector Search workload, or to get the best sizing results for any workload, contact Couchbase Support.

vCPUS

A heavy QPS workload requires more vCPUs. If your workload requires a high QPS, this is the most important part of your sizing for the Search Service.

For example, if your target QPS is 30,000 and your queries are less complex, divide your total QPS target by 200 to get your required vCPUs:

\[30,0000_{\mathrm{QPS}} \div 200_{\mathrm{Mid}} = 150_{\mathrm{vCPUs}}\]

The formula gives a target of 150 vCPUs for a mid range workload with a less complex query.

If your queries were more complex, but the QPS target was the same, the calculation changes to use a value of 150 and a result of 200 vCPUs:

\[30,0000_{\mathrm{QPS}} \div 150_{\mathrm{Low}} = 200_{\mathrm{vCPUs}}\]

You can then divide your result by the vCPU configuration you want to use to calculate the number of nodes you need:

\[\lceil 150_{\mathrm{vCPUs}} \div 32_{\mathrm{vCPUs Per Node}} \rceil = 5_{\mathrm{Nodes}}\]

Based on the formula, if you wanted to use nodes with 32 vCPUs and reach a target QPS of 30,000 with less complex queries, you would need 5 nodes in your deployment.

RAM

In general, you should allocate 65% of the RAM on a node in your cluster where you want to run the Search Service. A Search node needs more RAM if you:

Are storing field values or using doc values.
Have analyzed text fields.
Want to use more complex queries than keyword matches.

To calculate a more precise estimate for the required RAM for the Search Service, you need to:

Calculate Your Per Doc Index Bytes
Calculate Your Total Index GB
Add Your Replication Factor
Calculate Your Total Required RAM

Calculate Your Per Doc Index Bytes

Use the following formula first to calculate the number of bytes per document in your Search index:

\[\begin{equation} \begin{split} \text{Per Doc Index Bytes} = ( ( W \cdot 1024 \cdot \text{f_text} \cdot \text{m_text} ) + ( W \cdot 1024 \cdot \text{f_kw} \cdot \text{m_kw} ) + B ) \times (1 + D) \end{split} \end{equation}\]

You need to know the following variables for the formula:

Variable Description

Variable	Description
\(W\)	The average size of your JSON documents, in KB.
\({\text{f_text}}\)	A measure of the analyzed text from your JSON documents. You can omit this value if you’re using primarily keyword searches and do not have longer-form text fields that require an analyzer. You can use the following value ranges based on the kind of analyzed text you have in your index: Product descriptions, titles and body snippets, support ticket descriptions: `0.10-0.20` Long note fields, email bodies, articles, knowledge-base content: `0.20-0.40` Log files, message streams, event payloads with large message fields: `0.40-0.70` If you’re not sure about the size and complexity of the text fields in your documents and how they match to the example ranges, use a value of `0.25` to get a rough estimate. To get the most accurate values for \({\text{f_text}}\) and your RAM sizing calculations, contact Couchbase Support.
\({\text{m_text}}\)	A multiplier for calculating how the bytes in your documents translate into your Search index for analyzed text fields. For a good planning range, try a value between `0.12-0.35`, increasing based on the complexity of your analyzed text fields. To get the most accurate values for \({\text{m_text}}\) and your RAM sizing calculations, contact Couchbase Support.
\({\text{f_kw}}\)	A measure of the keywords from your JSON documents. For a good planning range for a keyword search use case or a filter-heavy workload, use a value of `0.10`. To get the most accurate values for \({\text{f_kw}}\) and your RAM sizing calculations, contact Couchbase Support.
\({\text{m_kw}}\)	A multiplier for calculating how the bytes in your documents translate into your Search index for keywords. For a good planning range, try a value between `0.10-0.18`. To get the most accurate values for \({\text{m_kw}}\) and your RAM sizing calculations, contact Couchbase Support.
\(B\)	The number of bytes needed for storing field values for your documents, if store is enabled for a child field mapping. If you’re not storing any field values in your Search index, set this value to `0`.
\(D\)	The additional overhead from adding doc values to your Search index from a child field mapping. Use a value from `0-1`. If you’re not using doc values in your Search index, set this value to `0`.

$W$

The average size of your JSON documents, in KB.

${\text{f_text}}$

A measure of the analyzed text from your JSON documents.

You can omit this value if you’re using primarily keyword searches and do not have longer-form text fields that require an analyzer.

You can use the following value ranges based on the kind of analyzed text you have in your index:

Product descriptions, titles and body snippets, support ticket descriptions: 0.10-0.20
Long note fields, email bodies, articles, knowledge-base content: 0.20-0.40
Log files, message streams, event payloads with large message fields: 0.40-0.70

If you’re not sure about the size and complexity of the text fields in your documents and how they match to the example ranges, use a value of 0.25 to get a rough estimate.

To get the most accurate values for ${\text{f_text}}$ and your RAM sizing calculations, contact Couchbase Support.

${\text{m_text}}$

A multiplier for calculating how the bytes in your documents translate into your Search index for analyzed text fields.

For a good planning range, try a value between 0.12-0.35, increasing based on the complexity of your analyzed text fields.

To get the most accurate values for ${\text{m_text}}$ and your RAM sizing calculations, contact Couchbase Support.

${\text{f_kw}}$

A measure of the keywords from your JSON documents.

For a good planning range for a keyword search use case or a filter-heavy workload, use a value of 0.10.

To get the most accurate values for ${\text{f_kw}}$ and your RAM sizing calculations, contact Couchbase Support.

${\text{m_kw}}$

A multiplier for calculating how the bytes in your documents translate into your Search index for keywords.

For a good planning range, try a value between 0.10-0.18.

To get the most accurate values for ${\text{m_kw}}$ and your RAM sizing calculations, contact Couchbase Support.

$B$

The number of bytes needed for storing field values for your documents, if store is enabled for a child field mapping.

If you’re not storing any field values in your Search index, set this value to 0.

$D$

The additional overhead from adding doc values to your Search index from a child field mapping.

Use a value from 0-1. If you’re not using doc values in your Search index, set this value to 0.

If you want to add numeric and geospatial fields to your sizing estimate, change the formula to the following:

\$\text{Per Doc Index Bytes} = ( ( W \cdot 1024 \cdot \text{f_text} \cdot \text{m_text} ) + ( W \cdot 1024 \cdot \text{f_kw} \cdot \text{m_kw} )\$
\$+ ( W \cdot 1024 \cdot 0.02_\text{f_numeric} \cdot 2.0_\text{m_numeric} )\$
\$+ ( W \cdot 1024 \cdot 0.002_\text{f_geo} \cdot 2.0_\text{m_geo} )+ B ) \times (1 + D)\$

The values provided in the preceding formula for ${\text{f_numeric}}$, ${\text{m_numeric}}$, ${\text{f_geo}}$ and ${\text{m_geo}}$ are reasonable defaults for most numeric and geospatial search workloads.

To get the most accurate values for ${\text{f_numeric}}$, ${\text{m_numeric}}$, ${\text{f_geo}}$ and ${\text{m_geo}}$ and accurately size the RAM for your workload, contact Couchbase Support.

Calculate Your Total Index GB

After you have calculated your ${\text{Per Doc Index Bytes}}$, calculate the total GB needed for your Search index, where:

$N$ is the total number of JSON documents you want to include in your Search index.
$S$ is a measure of your system overhead. For a rough estimate, use a value of $0.10$.

Use the following formula:

\[\begin{equation} \begin{split} \text{Total Index GB} = \frac{(N \times \text{Per Doc Index Bytes})}{10^{9}} \times (1 + S) \end{split} \end{equation}\]

Add Your Replication Factor

If you want to add replicas to your Search index, you need to factor that into your ${\text{Total Index GB}}$.

Use the following formula:

\[\begin{equation} \begin{split} \text{Total Index GB With Replicas} = \text{Total Index GB} \times (\text{Number Of Replicas} + 1) \end{split} \end{equation}\]

Calculate Your Total Required RAM

Then, you can calculate the total RAM required on a node for your use case with the following formula:

\[\begin{equation} \begin{split} \text{Total Node RAM} = \text{Total Index GB With Replicas} \times 0.65 \end{split} \end{equation}\]

Search Node Sizing Examples

You’ll get the most accurate results by going through sizing with Couchbase Support, but you can use the following examples for a sizing estimate for a Search workload:

High QPS and Keyword-Only Searches
Lower QPS with Higher Storage and a Larger Index

High QPS and Keyword-Only Searches

The following sizing scenario assumes a high QPS target, a CPU-bound configuration, and a keyword-only workload for a compact Search index.

This example uses the following variables:

Number of Documents	Per Doc Index Bytes	QPS Target	System Overhead	Replica Factor
194,000,000	258.05	87,000	0.10	2 (1 replica + 1)

Number of Documents

Per Doc Index Bytes

QPS Target

System Overhead

Replica Factor

194,000,000

258.05

87,000

0.10

2 (1 replica + 1)

Based on these variables, the required vCPUs could be either:

$580$, using a value of $150$ in the vCPU calculation.
$435$, using a value of $200$ in the vCPU calculation.

The Total Index GB With Replicas is $110.13 \text{ GB}$.

The vCPUs matter the most in this workload.

Your recommended node configurations could be any of the following:

	Number of Nodes	Number of vCPUs	RAM
Higher QPS	14	32	128 GB
Higher QPS	7	64	256 GB
Lower QPS	18	32	128 GB
Lower QPS	9	64	256 GB

Lower QPS with Higher Storage and a Larger Index

The following sizing scenario assumes a comparatively lower QPS target, a storage-bound configuration, and a larger Search index.

This example uses the following variables:

Number of Documents	Per Doc Index Bytes	QPS Target	System Overhead	Replica Factor
500,000,000	344.86 (For faceting, sorting, and more complex queries)	12,000	0.10	2 (1 replica + 1)

Number of Documents

Per Doc Index Bytes

QPS Target

System Overhead

Replica Factor

500,000,000

344.86 (For faceting, sorting, and more complex queries)

12,000

0.10

2 (1 replica + 1)

Based on these variables, the required vCPUs would be $60$, based on the more complex queries needing a higher QPS per vCPU and using a value of $200$ in the calculation.

If you wanted to use nodes with 32 vCPUs, you would need 2 nodes.

The Total Index GB With Replicas is $379.34 \text{ GB}$.

Each of the 2 nodes would need $379.34 \text{ GB} \times 0.65 = 123.28 \text{ GB}$ of RAM.

As a result, the best configuration for this workload should be 2 nodes with 32 vCPUs and 128 GB of RAM.

Sizing Query Service Nodes

A node that runs the Query Service executes queries for your application needs.

Since the Query Service does not need to persist data to disk, there are minimal resource requirements for disk space and disk I/O. You only need to consider CPU and memory.

Answer the following questions to help size the Query Service nodes on your cluster:

What types of queries do you need to run?
Do you need to run stale=ok or stale=false queries?
Are the queries simple or complex? For example, do you need to use JOINs?
What are the throughput and latency requirements for your queries?

Different queries have different resource requirements. A simple query might return results within milliseconds while a complex query may require several seconds.

The formula used to calculate the number of queries that’s processed simultaneously is CPU_cores * 4. The formula used to calculate the maximum queue-length for queries is CPU_cores * 256. If you reach either limit, the system rejects additional queries with a 503 error.

Sizing Analytics Service Nodes

The Analytics engine is a full-fledged parallel query processor that supports parallel joins, aggregations, and sorting for JSON data.

The Analytics Service is dependent on the Data Service and requires the Data service to be running on at least one of the cluster nodes.

Data space

Make sure that the data space for your Analytics Service nodes takes into account metadata replicas. The Analytics Service only replicates the metadata and not the actual data. There’s a small overhead for metadata replicas as metadata is generally small.
When evaluating a query, the Analytics engine uses temporary disk space. The type of query you want to run determines the required amount of temporary disk space.

For example, queries with heavy JOINs, aggregates, windowing, or additional predicates require more temporary disk space. Typically, the temporary disk space can be 2x the data space.
The percent of data shadowed, which is dependent on your use case.
When you load data from the Data Service into the Analytics Service, you can apply a filter to reduce both the loaded data size and the Analytics Service storage requirements proportionally.

Disk Types and Partitioning

During query execution, the Analytics query engine concurrently reads and processes data from all partitions. The Input/Output Operations per Second (IOPS) of the physical disk that hosts the data partitions plays a major role in determining the query execution time. Modern storage devices such as SSDs have much higher IOPS and can deal better with concurrent reads than HDDs. A single data partition underutilizes high IOPS devices.

To simplify setup for nodes with a single modern storage device, the Analytics Service creates multiple data partitions on the same storage device. It does this only when you specify a single Analytics disk path during node initialization. The Analytics Service determines the number of partitions using the following formula:

Maximum partitions to create = Min((Analytics Memory in MB / 1024), 16)
Actual created partitions = Min(node virtual cores, Maximum partitions to create)

For example, if a node has 8 virtual cores and the Analytics Service has at least 8 GB of memory, the system creates 8 data partitions on that node. Similarly, for a node with 32 virtual cores and 16 GB memory, the system creates 16 partitions, the maximum for automatic partitioning.

Index Considerations

The size of a secondary index is around the total size of indexed fields in the Analytics collection. For example, if a collection has 20 fields and only 1 of those fields appears in the secondary index, the secondary index size is ~1/20 of the collection size.

Sizing Eventing Service Nodes

Eventing is a compute-oriented service. By default, the Eventing Service has 1 worker and each worker has 2 threads of execution. You can scale the Eventing Service both vertically by adding more workers or horizontally by adding more nodes. The Eventing Service partitions the vBuckets across the number of available nodes.

CPU

Eventing runs arbitrary JavaScript code. This flexibility makes it difficult to define a precise sizing formula. You cannot define a precise formula unless you know the function designs, their KV operations, query operations, cURL operations, and the expected mutation rate.

For example, if you process 100K mutations per second and only match 1 out of 1000 patterns, then perform some intense computation on the matched 100 items in your Eventing Function, you need 100X less compute than if you performed the intense computation on each mutation.

Eventing also can perform I/O to external REST endpoints through a synchronous HTTP/S cURL call. Eventing typically blocks on I/O and requires little CPU. Achieving high throughput to overcome bandwidth requires additional workers and cores.

Use 8 vCPUs or 4 physical cores to run Eventing Functions.

RAM

For more information about how to size your Eventing memory quota, see Eventing Service Memory Quota.

Eventing Storage Collection (previously Metadata Bucket)

Each Eventing function stores fewer than 2048 documents in its Eventing storage collection. If timers are not used or if the active timers count does not exceed the per-function document limit, store the Eventing storage collection in a 100 MB bucket.

Using timers requires additional storage for each active timer. Each active timer requires 800 bytes, plus the size of the passed context, which represents the state supplied to the function at future execution.

A 200-byte context results in 1 KB of storage per active timer. 100,000 active timers require 100 MB of additional bucket space.

As a best practice, keep this collection fully resident in-memory to make sure you have constant availability.

All Eventing functions use this collection.

Sizing Backup Service Nodes

The hardware requirements for running a backup cluster are as follows:

Table 11. Hardware requirements
	Minimum	Recommended
CPUs	4 CPU cores	16 CPU cores
Memory	8 GiB	16 GiB

Sizing for Replication (XDCR)

Before setting up a replication, you must make sure your cluster is appropriately configured and provisioned.

Your cluster must be properly sized to be able to handle new XDCR streams.

For example, XDCR needs 1-2 additional CPU cores per stream. In some cases, it also requires additional RAM and network resources. If a cluster is not sized to handle both the existing workload and the new XDCR streams, the performance of both XDCR and the cluster overall might be negatively impacted.

For information about preparing your cluster for replication, see Prepare for XDCR.