The Analytics Service provides a parallel data-management capability, allowing the running of complex analytical queries.
The Analytics Service helps analyze JSON data in Couchbase in real time, without the need to extract, transform, and load (ETL) the underlying operational data into a separate system. The Analytics Service leverages massively parallel processing (MPP) architecture to deliver analytics and insights at the speed of transactions. Couchbase Analytics is best suited for running large, complex queries involving data aggregations, typically on large amounts of data.
The Analytics Service allows you to develop insight-driven applications easily and quickly. There are two focus areas: operational analytics and near real-time analytics. Operational analytics uses data analysis and business intelligence to improve efficiency and streamline everyday operations in real time.
Deploying the Analytics Service
The Analytics Service enables you to create shadow copies of the data you would like to analyze. When shadow collections are created, they are connected to the Data Service, and any changes in the operational data are reflected in the Analytics Service in near-real time, using the Database Change Protocol. This continuous data ingestion allows operational and analytic queries to run concurrently without impacting one another.
Due to the large scale and duration of operations it is likely to perform, the Analytics Service should be run alone, on its node or nodes in the cluster, with no other Couchbase Service running on the Analytics nodes.
For the practical steps required to initialize or join a cluster, and to deploy services, see Create a Cluster. For information on how to run analytic queries, see the Introduction to Couchbase Analytics.