Business Intelligence Platform: Tutorial Using MongoDB Aggregation Pipeline
Using data to answer interesting questions is what researchers are busy doing in today’s data driven world. Given huge volumes of data, the challenge of processing and analyzing it is a big one; particularly for statisticians or data analysts who…
MongoDB Connector for Hadoop
Purpose The MongoDB Connector for Hadoop is a library which allows MongoDB (or backup files in its data format, BSON) to be used as an input source, or output destination, for Hadoop MapReduce tasks. It is designed to allow greater…
Chunk Size in MongoDB
When the first mongos connects to a set of config servers, it initializes the sharded cluster with a default chunk size of 64 megabytes. This default chunk size works well for most deployments; however, if you notice that automatic…
[MongoDB]: Tag aware sharding
MongoDB supports tagging a range of shard key values to associate that range with a shard or group of shards. Those shards receive all inserts within the tagged range. The balancer obeys tagged range associations, which enables the following deployment…
MongoDB: Change opLog Size
What is opLog ? Just as Transaction Log in MS SQL Server or Binary log in MySQL, all those transactions on MongoDB RS will be written into a separate file, called opLog. This opLog will have a fixed size, which…