Basic concepts about MYSQL indexes: bitsCN.com
1. what is an index?
Indexes are used to quickly search for records with specific values. all MySQL indexes are saved as B-trees. If no index exists, MySQL must scan all the records of the entire table from the first record until the required records are found. The more records in the table, the higher the operation cost. If an index has been created on the column used as a search condition, MySQL can quickly obtain the location of the target record without scanning any records. If the table has 1000 million records, the index search records should be at least 100 times faster than the sequential scan records.
Suppose we have created a table named "people:
Create table people (peopleid smallint not null, name CHAR (50) not null );
Then, we randomly insert 1000 different name values into the people table. There is no clear order for the name column in the data file. If we create an index for the name column, MySQL will sort the name column in the index. for each item in the index, mySQL internally stores the "pointer" of the location where the actual record is located in a data file ". Therefore, if we want to find the peopleid of the record whose name is equal to "Mike" (the SQL command is "SELECT peopleid FROM people WHERE name = Mike ;"), mySQL can search for the "Mike" value in the name index, and then directly go to the corresponding row in the data file to return the peopleid (999) of the row accurately ). In this process, MySQL only needs to process one row to return results. If there is no index for the "name" column, MySQL will scan all records in the data file, that is, 1000 records! Obviously, the less records that need to be processed by MySQL, the faster it can complete the task.
II. index types
MySQL provides multiple index types:
Common index:
This is the most basic index type, and it does not have uniqueness or the like **. Common indexes can be created in the following ways:
CREATE an INDEX, for example, CREATE INDEX <索引的名字> ON tablename (column list );
Modify a TABLE, such as alter table tablename add index [INDEX name] (column list );
Specify an INDEX when creating a TABLE, for example, create table tablename ([...], INDEX [INDEX name] (column list ));
Unique index:
This index is basically the same as the previous "normal index", but there is a difference: all values of the index column can only appear once, that is, they must be unique. You can create a unique index in the following ways:
CREATE an INDEX, for example, CREATE UNIQUE INDEX <索引的名字> ON tablename (column list );
Modify a TABLE, such as alter table tablename add unique [index name] (column list );
Specify an index when creating a TABLE, for example, create table tablename ([...], UNIQUE [index name] (column list ));
Primary key:
A primary key is a unique index, but it must be specified as a "primary key ". If you have used columns of the AUTO_INCREMENT type, you may already be familiar with primary keys and other concepts. The primary key is generally specified during TABLE creation, for example, "create table tablename ([...], primary key (column list ));". However, we can also ADD a primary key by modifying the TABLE, for example, "alter table tablename add primary key (column list );". Each table can have only one primary key.
Full text index:
MySQL supports full-text indexing and full-text retrieval from version 3.23.23. In MySQL, the full-text index type is FULLTEXT. Full-TEXT indexes can be created on VARCHAR or TEXT columns. It can be created using the create table command or the alter table or create index command. For large-scale datasets, using the alter table (or create index) command to CREATE a full-text INDEX is faster than inserting a record into an empty TABLE with a full-text INDEX. Full-text indexing is not covered in the following discussions. For more information, see MySQL documentation.
III. single column index and multi-column index
An index can be a single-column index or multiple-column index. The following is an example to illustrate the differences between the two indexes. Suppose there is a people table:
Create table people (peopleid smallint not null AUTO_INCREMENT, firstname CHAR (50) not null, lastname CHAR (50) not null, age smallint not null, townid smallint not null, primary key (peopleid ));
The following figure shows the data we inserted into the people table:
In this data segment, there are four people named "Mikes" (two named Sullivans and two named McConnells), two 17-year-olds, and one named Joe Smith.
This table is mainly used to return the corresponding peopleid based on the specified user name, name, and age. For example, you can search for the leid of a user named Mike Sullivan AND a 17-year-old user (the SQL command is SELECT peopleid FROM people WHERE firstname = Mike AND lastname = Sullivan AND age = 17 ;). Because we don't want MySQL to scan the entire table every time it executes a query, we need to consider using indexes here.
First, we can consider creating an index on a single column, such as the firstname, lastname, or age column. If we create an INDEX for the firstname column (alter table people add index firstname (firstname);), MySQL will use this INDEX to quickly search for records whose firstname is Mike, then, search for other conditions on the intermediate result set: it first excluded the records whose lastname is not equal to "Sullivan", and then excluded those records whose age is not equal to 17. When the record meets all search criteria, MySQL returns the final search result.
Because the index of the firstname column is created, MySQL improves the efficiency much compared to the full scan of the execution table, however, we require that the number of records scanned by MySQL still far exceeds what is actually needed. Although we can delete the index on the firstname column and then create the index on the lastname or age column, it seems that no matter which column is created, the search efficiency is still similar.
To improve search efficiency, we need to consider using multi-column indexes. If you create a multi-column index for the columns firstname, lastname, and age
The SQL statement only needs to be retrieved once to find the correct result! The following is an SQL command to create this multi-column index:
Alter table people add index fname_lname_age (firstname, lastname, age );
Because the index file is saved in B-tree format, MySQL can immediately convert it to the appropriate firstname, then to the appropriate lastname, and finally to the appropriate age. Without scanning any record of the data file, MySQL finds the target record correctly!
So, if you create a single column index on the columns firstname, lastname, and age respectively, will the result be the same as creating a multiple column index of firstname, lastname, and age? The answer is no. the two are completely different. When we execute a query, MySQL can only use one index. If you have three single-column indexes, MySQL will try to select the ** strictest index. However, even if it is the ** most rigorous single-column index, its ** capability is certainly far lower than the multiple-column index of the three columns firstname, lastname, and age.
4. leftmost prefix
Multi-column index has another advantage, which is embodied by the concept of Leftmost Prefixing. Continue to consider the previous example. now we have a multiple-column index on the firstname, lastname, and age columns. We call this index fname_lname_age. When the search condition is a combination of the following columns, MySQL uses the fname_lname_age index:
Firstname, lastname, agefirstname, lastnamefirstname
On the other hand, it is equivalent to the index created on the combination of columns (firstname, lastname, age), (firstname, lastname), and (firstname. All of the following queries can use this fname_lname_age index:
SELECT peopleid FROM people WHERE firstname = Mike AND lastname = Sullivan AND age = 17; SELECT peopleid FROM people WHERE firstname = Mike AND lastname = Sullivan; SELECT peopleid FROM people WHERE firstname = Mike; the following queries cannot use the index at all: SELECT peopleid FROM people WHERE lastname = Sullivan; SELECT peopleid FROM people WHERE age = 17; SELECT peopleid FROM people WHERE lastname = Sullivan AND age = 17;
5. select an index column
In the performance optimization process, selecting the columns to create an index is one of the most important steps. You can consider using two types of indexes: columns that appear in the WHERE clause and columns that appear in the join clause. See the following query:
SELECT age # Do not use the index FROM people WHERE firstname = Mike # consider using the index AND lastname = Sullivan # consider using the index
This query is slightly different from the previous query, but it is still a simple query. Because age is referenced in the SELECT part, MySQL does not use it for ** column selection operations. Therefore, it is unnecessary to create an index for the age column for this query. The following is a more complex example:
SELECT people. age, ## do not use the index town. name ## FROM people left join town ONpeople. townid = town. townid # Use Index WHERE firstname = Mike # Use Index AND lastname = Sullivan # Use Index
As in the preceding example, because firstname and lastname appear in the WHERE clause, the two columns still need to create indexes. In addition, because the townid of the town table is listed in the join clause, we need to consider creating an index for this column. So can we simply think that every column in The WHERE clause and join clause should be indexed? This is almost the case, but not completely. We must also consider the operator types for column comparison. MySQL uses bitsCN.com only for the following operators.