Aggregation Pipeline in mongodb in sharded collection

Question

Just referring to Mongodb aggregation link https://docs.mongodb.com/v3.2/aggregation/#aggregation-pipeline and it mentions that

 "The aggregation pipeline can operate on a sharded collection."

Please lemme know If database is sharded then all the collections in the database will be sharded. Also Please confirm that if sharded the aggregate query will be run in many servers, and delivery the results fast. If so how the aggregation query functions.

Regards

Kris

hyades · Accepted Answer

A sharded cluster always has a Primary Shard, and one or more secondary shards.

Please lemme know If database is sharded then all the collections in the database will be sharded

No, by default none of your collections will be sharded. All such collections stay wholly on the primary shard. To shard a collection, use the shardCollection command

Also Please confirm that if sharded the aggregate query will be run in many servers, and delivery the results fast.

One important thing while defining a collection in a sharded environment is the shard key. You should ensure that choose a good shard key which is responsible for distribution of data across the shards. Thus, if you choose a good shard key, you can expect a better performance than a non-sharded environment.

If so how the aggregation query functions.

The aggregation query is split by the $match to different shards depending on where the docs are present, and are finally merged together on a shard. A good read is https://docs.mongodb.com/v3.2/core/aggregation-pipeline-sharded-collections/

Aggregation Pipeline in mongodb in sharded collection

Answers (2)

Related Questions