MySQLNested-LoopJoin algorithm learning

MySQLNested-LoopJoin algorithm learning _ MySQL

Last Update:2018-04-27 Source: Internet

Author: User

Developer on Alibaba Coud: Build your first app with APIs, SDKs, and tutorials on the Alibaba Cloud. Read more ＞

After playing MySQL for more than two years, I found that many people say that MySQL is inferior to Oracle in terms of optimizer. In fact, to some extent, it is true, however, after all, MySQL is only available in MySQL 5.7, and Oracle has been developed to 12c. today, I watched MySQL connections for over two years, I found that many people say that MySQL is worse than Oracle in terms of optimizer. In fact, it is true to some extent. However, after all, MySQL is only available in version 5.7, oracle has developed to 12c. today I will look at the MySQL connection algorithm. well, Hash Join is still not supported, and only Nested-Loop Join is supported, let's summarize my learning experience today.

The basic algorithm implementation of Nested-Loop Join is as follows:

for each row in t1 matching range {  for each row in t2 matching reference key {    for each row in t3 { if row satisfies join conditions, send to client    }  }}

This code is very simple. although I do not write much code, I still understand it. Here we assume there are three tables, t1, t2, t3, and this code respectively show the range, ref, and ALL in the explain plan, which are shown in the SQL execution plan layer, t3 will perform a full table scan. today I saw a cool SQL optimization method in this place, Straight-join: Optimize. We also mentioned that, by narrowing down the result set of the driver table for connection optimization, we can see that the driver table with a small result set can indeed reduce the number of loops.

Of course, on the basis of this algorithm, MySQL introduced the Block Nested-Loop join algorithm. In fact, it is basically no different from the above algorithm. the pseudo code is as follows:

for each row in t1 matching range {  for each row in t2 matching reference key {    store used columns from t1, t2 in join buffer    if buffer is full { for each row in t3 {   for each t1, t2 combination in join buffer {if row satisfies join conditions,send to client   } } empty buffer    }  }}if buffer is not empty {  for each row in t3 {    for each t1, t2 combination in join buffer { if row satisfies join conditions, send to client    }  }}

This algorithm caches the data of the outer loop in the join buffer, and compares the data in the buffer of the table round in the inner loop to reduce the number of cycles, thus improving the efficiency. There is a example on the official website, which I do not understand: if 10 rows are cached in the buffer, these 10 rows are passed to the inner loop, all rows in the inner loop are compared with the 10 rows in the buffer. The original article is as follows:

For example, if 10 rows are read into a buffer and the buffer is passed to the next inner loop, each row read in the inner loop can be compared against all 10 rows in the buffer

If S refers to the size of t1 and t2 combinations in the cache, and C is the number of these combinations in the buffer, the number of times the t3 table is scanned should be:

  (S * C)/join_buffer_size + 1

According to this formula, the larger join_buffer_size, the smaller the number of scans. if join_buffer_size is used to cache all the previous row combinations, this is the best performance time, it will have no effect if it is increased later.

The above is the content of MySQL Nested-Loop Join algorithm learning _ MySQL. For more information, see PHP Chinese network (www.php1.cn )!

This article is an English version of an article which is originally in the Chinese language on aliyun.com and is provided for information purposes only. This website makes no representation or warranty of any kind, either expressed or implied, as to the accuracy, completeness ownership or reliability of the article or any translations thereof. If you have any concerns or complaints relating to the article, please send an email, providing a detailed description of the concern or complaint, to info-contact@alibabacloud.com. A staff member will contact you within 5 working days. Once verified, infringing content will be removed immediately.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

Get Started for Free

Sales Support

1 on 1 presale consultation

Chat Contact Sales
After-Sales Support

24/7 Technical Support 6 Free Tickets per Quarter Faster Response

Open a Ticket
Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.

Learn More

MySQLNested-LoopJoin algorithm learning _ MySQL

Contact Us

A Free Trial That Lets You Build Big!

Sales Support

After-Sales Support