Topic Center

Contact Sales

Home > Developer > MySQL

Summary of Nested-loop join algorithm in Mysql _mysql

Last Update:2017-01-19 Source: Internet

Author: User

Tags hash

Developer on Alibaba Coud: Build your first app with APIs, SDKs, and tutorials on the Alibaba Cloud. Read more ＞

Unconsciously played two years of MySQL, found that a lot of people say that MySQL compared to Oracle, the optimizer does a poor job, in fact, in a way, indeed, But after all, MySQL only to 5.7 version, Oracle has developed to 12c, today I looked at the MySQL connection algorithm, well, now still do not support hash join, only nested-loop join, then today summed up my learning experience it.

Nested-loop Join basic algorithm implementation, pseudo code is this:

For each row in T1 matching range {For each
 row in T2 matching reference key {for each
  row in T3 {
   if row SA Tisfies join conditions,
   Send to Client
  }
 }

This code is very simple, although I do not write code, but I can understand. Here, suppose there are three tables, T1, T2, T3, this code, which shows the range, ref, and all in the explain plan, is shown in the SQL Execution plan layer, T3 will perform a full table scan, and I saw a very evil optimization SQL method in this place today, straight-join:http://hidba.ga/2014/09/26/join-query-in-mysql/, which mentions the concept of a driver table, then the driver table is the T3 table in Pseudocode, Bovenri said MySQL will automatically select the smallest result set as the driver table, as the algorithm analysis, so select the driver table is really the least cost. So it's also mentioned here that by narrowing the drive table result set for connection optimization, the resulting set of smaller drive tables can actually reduce the number of loops.

Of course, MySQL itself on the basis of this algorithm, the evolution of the block Nested-loop join algorithm, in fact, basically and the above algorithm is no different, pseudo code as follows:

For each row in T1 matching range {
 for each row in T2 matching reference key {
  store used columns to T1, T2 in Join buffer
  If buffer is full {
   for each row in T3 {
    for each T1, T2 combination in join buffer {
     if row s Atisfies join conditions,
     Send to Client
    }
   }
   empty buffer
  }
 }

if buffer isn't Empty {For each
 row in T3 {for each
  T1, T2 combination in join buffer {
   If row satisfies join conditions,
   Send to Client
  }
 }

In this algorithm, the data in the outer loop is cached in the join buffer, and the data in the table round buffer in the inner loop is compared to reduce the number of cycles, which can increase the efficiency. The official website has a example, I a bit does not understand: If has 10 rows to be cached in the buffer, these 10 lines are passed to the inner loop, all the inner loop's line will compare with this 10 rows in the buffer. The original text is like this:

For example, if ten rows are read into a buffer and the ' buffer is ' passed to the next inner loop, each row read in the inner Loop can is compared against all rows in the buffer
If s refers to T1, the size of the T2 combination in the cache, and C is the number of these combinations in buffer, the number of T3 tables scanned should be:

(S * C)/join_buffer_size + 1

According to this formula, the larger the join_buffer_size, the smaller the number of scans, if the join_buffer_size to be able to cache all the previous row combinations, then it is the best performance, then the increase will have no effect.

In the case of indexing, MySQL will try to use the index Nested-loop join algorithm, in some cases, may join the column is no index, then MySQL's choice is definitely not the first introduction of the simple Nested-loop join algorithm , because the algorithm is too rough to look straight. Complex SQL with larger data may not run out of results for years, if you don't believe it, it's too young too. Or inside can give you some SQL run to see.

The disadvantage of the simple nested-loop join algorithm is that it scans the inner table too many times, resulting in too large a scan record. The improvement of the block Nested-loop join algorithm compared to simple nested-loop join is that it can reduce the number of scans in the inner table, and even like the hash join algorithm, it only needs to scan the table once.

This article is an English version of an article which is originally in the Chinese language on aliyun.com and is provided for information purposes only. This website makes no representation or warranty of any kind, either expressed or implied, as to the accuracy, completeness ownership or reliability of the article or any translations thereof. If you have any concerns or complaints relating to the article, please send an email, providing a detailed description of the concern or complaint, to info-contact@alibabacloud.com. A staff member will contact you within 5 working days. Once verified, infringing content will be removed immediately.

Related Keywords:

join query in php mysql nested while loop in c how to join tables in mysql how to join two queries in mysql inner join in php mysql example join query in php mysql example mysql join query example in php

MySQL batch update and batch update different values for mult... 01-13

The solution of no package Mysql-server available error when ... 05-28

The efficiency of MySQL nested query and connection query 11-16

MySQL row-level lock, table-level lock, page-level lock Detai... 12-17

MySQL Case statement (with instance) 04-01

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

What's Trending

Top 10 Tags

datastax versions naming convention zookeeper client class definition md5 microsoft sql server 2005 data structures exception handling error handling

Top 10 Keywords

microsoft download center down wordpress address url site address url wordpress address url windows installer 4 0 download 302 not found web address url definition site address url wordpress db2 integer mac os installation step by step pdf abbreviation for return

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

Get Started for Free

Sales Support

1 on 1 presale consultation

Chat Contact Sales
After-Sales Support

24/7 Technical Support 6 Free Tickets per Quarter Faster Response

Open a Ticket
Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.

Learn More

Summary of Nested-loop join algorithm in Mysql _mysql

Contact Us

What's Trending

Top 10 Tags

Top 10 Keywords

Trending Topic

A Free Trial That Lets You Build Big!

Sales Support

After-Sales Support