What are hashed shard keys versus ranged shard keys, and their respective use cases?-MongoDB-php.cn

Table of Contents

Hashed Shard Keys: Even Distribution, Random Access

Ranged Shard Keys: Ordered Data, Targeted Queries

Choosing Between Them: Know Your Query Patterns

Home

Database

MongoDB

What are hashed shard keys versus ranged shard keys, and their respective use cases?

Robert Michael Kim

Jul 18, 2025 am 02:13 AM

哈希分片键范围分片键

Choosing a hash shard key or a range shard key depends on the query mode and data distribution requirements. The hash shard key achieves uniform data distribution through a hash algorithm, which is suitable for scenarios with high write load and avoiding hot spots, but the range query efficiency is low; 1. Suitable for applications with write extensions and no obvious range query. Range shard keys are based on key-value sequential distribution of data, suitable for scenarios where range queries (such as time intervals) are frequently performed; 2. Support efficient data subset scanning, but may lead to uneven data distribution and hot issues. 3. If the application mainly uses insert and has a small range query, select the hash shard key; if range filtering is often performed, select the range shard key. In addition, composite shard keys can also be considered to take into account multiple access modes.

What are hashed shard keys versus ranged shard keys, and their respective use cases?

When deciding between hashed shard keys and ranged shard keys in a sharded database like MongoDB, the main difference lies in how data is distributed across shards — and that has a big impact on performance and query patterns.

Hashed Shard Keys: Even Distribution, Random Access

A hashed shard key uses a hash of the actual key value to determine which shard a document goes to. This ensures an even distribution of data across all available shards, especially when the original key has a sequential nature (like timestamps or auto-incrementing IDs).

Use cases:

When you want to avoid write hotspots.
For workloads with high insert or update volume.
If your queries don't usually target specific ranges of the key.

One thing to note is that while writes are spread out nicely, range-based queries (eg, "find all records from last week") may end up hitting every shard, which can be slower than with a ranged key.

Ranged Shard Keys: Ordered Data, Targeted Queries

Ranged shard keys distributed data based on the natural order of the key values. Documents with similar key values end up close together on the same or neary shards.

Use cases:

When most queries target a specific range (eg, time-based queries).
If you need efficient scans over a subset of data.
When chunk migrations stay manageable due to predictable growth.

This setup works well for time-series data where you often query recent entries. But it can lead to uneven distribution if your data grows mostly at one end (like ever-increasing timestamps), which can cause hotspots.

Choosing Between Them: Know Your Query Patterns

Here's what to think about:

Write scaling : Hashed keys help balance inserts across shards.
Query efficiency : Ranged keys allow more targeted queries.
Data growth pattern : Increased values can overload a single chunk with ranged keys.
Hotspots : Sequential inserts with ranged keys can create bottlenecks.

If your app does a lot of inserts and doesn't rely heavily on range queries, go with hashed. If your queries often filter by a range (like date ranges or numeric ranges), then a ranged shard key might give better performance.

It's also possible to use a compound shard key — combining both hashed and other fields — but that's more advanced and depends heavily on your specific access patterns.

Basically that's it.

The above is the detailed content of What are hashed shard keys versus ranged shard keys, and their respective use cases?. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website

The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undress AI Tool

Undress images for free

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

Online AI tool for removing clothes from photos.

Clothoff.io

AI clothes remover

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

RimWorld Odyssey Temperature Guide for Ships and Gravtech

4 weeks ago By Jack chen

RimWorld Odyssey How to Fish

1 months ago By Jack chen

What are the transaction limits for a foreign user on Alipay?

1 months ago By 下次还敢

Kimi K2: The Most Powerful Open-Source Agentic Model

1 months ago By Jack chen

Can I have two Alipay accounts?

1 months ago By 下次还敢

Hot Tools

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Hot Topics

Laravel Tutorial

1602

PHP Tutorial

1506

276

Related knowledge

What are the limitations of MongoDB's free tier offerings (e.g., on Atlas)? Jul 21, 2025 am 01:20 AM

MongoDBAtlas' free hierarchy has many limitations in performance, availability, usage restrictions and storage, and is not suitable for production environments. First, the M0 cluster shared CPU resources it provides, with only 512MB of memory and up to 2GB of storage, making it difficult to support real-time performance or data growth; secondly, the lack of high-availability architectures such as multi-node replica sets and automatic failover, which may lead to service interruption during maintenance or failure; further, hourly read and write operations are limited, the number of connections and bandwidth are also limited, and the current limit can be triggered; finally, the backup function is limited, and the storage limit is easily exhausted due to indexing or file storage, so it is only suitable for demonstration or small personal projects.

What is the difference between updateOne(), updateMany(), and replaceOne() methods? Jul 15, 2025 am 12:04 AM

The main difference between updateOne(), updateMany() and replaceOne() in MongoDB is the update scope and method. ① updateOne() only updates part of the fields of the first matching document, which is suitable for scenes where only one record is modified; ② updateMany() updates part of all matching documents, which is suitable for scenes where multiple records are updated in batches; ③ replaceOne() completely replaces the first matching document, which is suitable for scenes where the overall content of the document is required without retaining the original structure. The three are applicable to different data operation requirements and are selected according to the update range and operation granularity.

Can you explain the purpose and use cases for TTL (Time-To-Live) indexes? Jul 12, 2025 am 01:25 AM

TTLindexesautomaticallydeleteoutdateddataafterasettime.Theyworkondatefields,usingabackgroundprocesstoremoveexpireddocuments,idealforsessions,logs,andcaches.Tosetoneup,createanindexonatimestampfieldwithexpireAfterSeconds.Limitationsincludeimprecisedel

What are the considerations for data migration from a relational database to MongoDB? Jul 12, 2025 am 12:45 AM

Migrating relational databases to MongoDB requires focusing on data model design, consistency control and performance optimization. First, convert the table structure into a nested or referenced document structure according to the query pattern, and use nesting to reduce association operations are preferred; second, appropriate redundant data is appropriate to improve query efficiency, and judge whether to use transaction or application layer compensation mechanisms based on business needs; finally, reasonably create indexes, plan sharding strategies, and select appropriate tools to migrate in stages to ensure data consistency and system stability.

What are roles and privileges in MongoDB's Role-Based Access Control (RBAC) system? Jul 13, 2025 am 12:01 AM

MongoDB's RBAC manages database access through role assignment permissions. Its core mechanism is to assign the role of a predefined set of permissions to the user, thereby determining the operations and scope it can perform. Roles are like positions, such as "read-only" or "administrator", built-in roles meet common needs, and custom roles can also be created. Permissions are composed of operations (such as insert, find) and resources (such as collections, databases), such as allowing queries to be executed on a specific collection. Commonly used built-in roles include read, readWrite, dbAdmin, userAdmin and clusterAdmin. When creating a user, you need to specify the role and its scope of action. For example, Jane can have read and write rights in the sales library, and inve

What are read preferences, and how do they control query routing in a replica set? Jul 13, 2025 am 12:26 AM

MongoDB's read preference determines how to route the application's read operations to the replica set members. All read operations are sent to the master node by default, but different read preferences can be configured according to requirements to optimize performance and data consistency. The main modes include primary (only read the master node, ensuring the latest data), primaryPreferred (priority master node, use secondary node when not available), secondary (secondary node only, suitable for offloading the primary node load), secondaryPreferred (priority secondary node, use primary node when there is no secondary), and nearest (select the node with the lowest network latency, regardless of primary or secondary). These patterns affect query routing methods, and are driven

MongoDB Atlas vs. Self-Hosted MongoDB: Which is Right for You? Jul 30, 2025 am 12:50 AM

MongoDBAtlas is more suitable for small teams or organizations lacking DBA resources because it provides fully automatic management, rapid deployment and built-in security; 2. Atlas is transparent and easy to budget in the early stage, but it may be higher than self-hosting after large usage. Although the infrastructure fee is low, it needs to be included in the labor and risk costs; 3. In terms of security and compliance, Atlas defaults to enterprise-level configuration and automatically updates, and self-hosting needs to be implemented manually and is prone to errors; 4. Select from hosting when it requires strong control, customization needs or data sovereignty restrictions, otherwise most teams should choose Atlas first to focus on product development rather than operation and maintenance. This choice is the most time-saving and reliable and supports flexible adjustments in the future.

What are transactions in MongoDB, and how do they provide ACID properties for multi-document operations? Jul 31, 2025 am 06:25 AM

MongoDBintroducedmulti-documenttransactionsinversion4.0,enablingatomicoperationsacrosscollectionsforstrongconsistency.Transactionsallowmultipleread/writeoperationstobegroupedasasingleunit,eitherallsucceedingorfailingtogether.Theyaresupportedinreplica

See all articles