Simple rewriting and optimization of MySQL subqueries

Last Update:2015-05-10 Source: Internet

Author: User

Developer on Alibaba Coud: Build your first app with APIs, SDKs, and tutorials on the Alibaba Cloud. Read more ＞

DBAs or developers who have used oracle or other relational databases have such experience. In subqueries, the database has been optimized and can choose to drive table execution, we are then porting this experience to the mysql database, but unfortunately, mysql may be disappointing in processing subqueries, this problem occurs in our production system:

select i_id, sum(i_sell) as i_sellfrom table_datawhere i_id in (select i_id from table_data where Gmt_create >= '2011-10-07 00:00:00′)group by i_id;

(Note: the business logic of SQL can be used as an example: first, we can query the sales volume of the 100 new books sold on the 10-07, and then query the sales volume of the 100 new books sold throughout the year ).

The performance problem of this SQL statement lies in the weakness of the mysql optimizer in processing subqueries. The mysql optimizer will rewrite subqueries when processing subqueries. In general, we want to complete the subquery results from the inside out, and then use the subquery to drive the external query tables to complete the query; however, mysql will first scan all the data in the External table, and each data entry will be uploaded to the subquery for association with the subquery. If the External table is large, performance issues will occur;

For the above query, because the table table_data contains million data, and there are a large number of duplicate data records in the subquery, this requires nearly connections, due to a large number of associations, this SQL statement has not been executed for several hours, so we need to rewrite the SQL statement:

SELECT t2.i_id, SUM(t2.i_sell) AS soldFROM (SELECT distinct i_id FROM table_dataWHERE gmt_create >= '2011-10-07 00:00:00′) t1, table_data t2WHERE t1.i_id = t2.i_id GROUP BY t2.i_id;

We change the subquery to join, and add distinct to the subquery to reduce the number of times t1 is associated with t2;

After the transformation, the SQL Execution time is reduced to less than Ms.

This article is an English version of an article which is originally in the Chinese language on aliyun.com and is provided for information purposes only. This website makes no representation or warranty of any kind, either expressed or implied, as to the accuracy, completeness ownership or reliability of the article or any translations thereof. If you have any concerns or complaints relating to the article, please send an email, providing a detailed description of the concern or complaint, to info-contact@alibabacloud.com. A staff member will contact you within 5 working days. Once verified, infringing content will be removed immediately.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

Get Started for Free

Sales Support

1 on 1 presale consultation

Chat Contact Sales
After-Sales Support

24/7 Technical Support 6 Free Tickets per Quarter Faster Response

Open a Ticket
Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.

Learn More

Simple rewriting and optimization of MySQL subqueries

Contact Us

A Free Trial That Lets You Build Big!

Sales Support

After-Sales Support