Home  >  Article  >  Database  >  What are clustered indexes, non-clustered indexes, joint indexes and unique indexes in MySQL

What are clustered indexes, non-clustered indexes, joint indexes and unique indexes in MySQL

WBOY
WBOYforward
2023-05-26 16:37:472251browse

    1. Index type

    Indices can be divided into B-Tree indexes and hash indexes according to the underlying implementation. Most of the time we use B-Tree indexes. -Tree index, because of its good performance and features, is more suitable for building high-concurrency systems.

    Divided according to the storage method of the index, the index can be divided into clustered index and non-clustered index. The leaf nodes of a non-clustered index only contain all fields and primary key IDs, while the leaf nodes of a clustered index contain complete record rows.

    According to clustered index and non-clustered index, it can be further divided into ordinary index, covering index, unique index and joint index.

    2. Clustered index and non-clustered index

    Clustered index is also called clustered index. It is not actually a separate index type, but a data storage method. The leaf nodes of the clustered index store all column information of a row of records. In other words, the leaf node of the clustered index contains a complete record row.

    Non-clustered index is also called auxiliary index and ordinary index. Its leaf nodes only contain one primary key value. To find records through non-clustered index, you must first find the primary key, and then go to the clustered index through the primary key. Find the corresponding record row, this process is called table return.

    For example, a data table containing user names and ages, assuming that the primary key is the user ID, the structure of the clustered index is (orange represents the id, green is the pointer to the child node):

    What are clustered indexes, non-clustered indexes, joint indexes and unique indexes in MySQL

    In the leaf nodes, in order to highlight the records, (id, name, age) is distinguished. In fact, they are connected together. They form a line. The record as a whole.

    The structure of a non-clustered index (with age as the index) is:

    What are clustered indexes, non-clustered indexes, joint indexes and unique indexes in MySQL

    In addition to the age field itself, the leaf nodes of the node , only contains the primary key ID of the current record, and does not contain the information of the complete record. You need to query the clustered index through the ID number to obtain the entire row of record data.

    In InnoDB, each table must have a clustered index, which will be created based on the primary key by default. If there is no primary key in the table, InnoDB will select a suitable column as the clustered index. If no suitable column is found, a hidden column DB_ROW_ID will be used as the clustered index.

    3. Covering Index

    Because the non-clustered index does not contain complete data information, searching for complete data records requires table return, so one query operation actually requires two index queries. . If every index query needs to be run twice to get the result, then this will inevitably lead to a loss of efficiency, because if you can reduce the query by one, you should reduce it by one.

    Take the age index above as an example. It is a non-clustered index. If I want to query the user's ID by age, I execute the following statement:

    1

    select id from userinfo where age = 10;

    Is it still necessary to return the table in this case? Because I only need the value of the id, I can already get the id through the age index. If I still go back to the table once, wouldn't it be a useless operation? In fact it is not needed. When the auxiliary index already contains all the information required for the query, the table return operation can be avoided in the index query. This is a covering index.

    4. Joint index

    Joint index refers to an index created on multiple columns at the same time. After creating a joint index, the leaf node will contain the value of each index column at the same time, and it will be based on multiple columns at the same time. Column sorting, this sorting is similar to what we understand as dictionary order.

    For example, the index structure created for the above names and ages at the same time:

    What are clustered indexes, non-clustered indexes, joint indexes and unique indexes in MySQL

    (name, age)Both It's an abbreviation, and I can't think of a dozen names.

    Each leaf node saves all index columns at the same time. In addition, it still only contains the primary key id.

    Leftmost Prefix Matching Principle

    When an index is created for multiple columns, the index cannot be used as long as it contains the columns for which the index was created. The use of the index must follow the leftmost prefix matching principle.

    Assuming that an index is created for column (A, B, C), then only the following scenarios can use the index:

    • ##For column (A, B, C)/( Querying A, C) or (A, B) will match the index, and the index cannot be used for (C, A) or (B, C).

    • Wildcards can only be used in the form of LIKE 'val%', not LIKE '%VAL%', which will cause a full table scan.

    • Index columns cannot be operated. For example, WHERE A 1 = 5 will cause index failure.

    • Index columns cannot contain range value queries, such as LIKE/BETWEEN/>/
    • Index columns cannot contain NULL values.

    Index pushdown

    The new version of MySQL (5.6 or above) introduces an index pushdown mechanism: during the index traversal process, the index pushdown can be Fields are judged first, and records that do not meet the conditions are directly filtered out to reduce the number of table returns.

    For example, create a joint index for (name, age) in the above table. The query logic under normal circumstances is:

    • Find the corresponding primary key ID through name

    • Match the age condition based on the column of the id record

    This approach will lead to many unnecessary table returns. For example, (Zhang San, 10 ) and (Zhang San, 15). Now we want to query the record of (Zhang San, 20). When querying, first locate all primary key IDs that meet the conditions through Zhang San, and then traverse the rows that meet the conditions in the clustered index to see if there are records that match age = 20. In actual situations, there are no records that meet the conditions, so this table return process can be regarded as a futile move.

    The main function of index pushdown is to improve this. In the joint index, first filter out records that do not need to be returned to the table by name and age, and then return to the table to query the index to reduce the number of table returns.

    5. Unique index

    The unique index is an index that does not allow the same index value. The system checks whether there are duplicate key values ​​when creating the index. Every time the index is updated or added This is checked when logging. The primary key index is the unique index.

    The above is the detailed content of What are clustered indexes, non-clustered indexes, joint indexes and unique indexes in MySQL. For more information, please follow other related articles on the PHP Chinese website!

    Statement:
    This article is reproduced at:yisu.com. If there is any infringement, please contact admin@php.cn delete