SQL Server Performance Tuning Execution plan (execution plan) tuning

Last Update:2018-06-17 Source: Internet

Author: User

Tags one table

Developer on Alibaba Coud: Build your first app with APIs, SDKs, and tutorials on the Alibaba Cloud. Read more ＞

There are three join strategies for SQL Server: Hash join,merge join,nested Loop join.

hash Join: Used to handle data that is not ordered/not indexed, it creates a hash table in memory of the data (the associated key) on both sides of the Join. For example, with the following query statement, the associated two tables are not indexed, and the execution plan is displayed as a hash Join.

[SQL]

SELECT
sh.*
From
Salesordheaderdemo as sh
JOIN
Salesorddetaildemo as SD
On
Sh. Salesorderid=sd. SalesOrderID
GO

Merge Join: Used to process indexed data, which is lighter than hash join. We index the associated columns of the previous two tables, and then again the above query, the execution plan changes to the merge Join

[SQL]

CREATE UNIQUE CLUSTERED INDEX idx_salesorderheaderdemo_salesorderid on salesordheaderdemo (SalesOrderID) /c4>
GO
CREATE UNIQUE CLUSTERED INDEX idx_salesdetail_salesorderlid on salesorddetaildemo (SalesOrderID, Salesorderdetailid)
GO

Nested Loop Join: On the basis of meeting the merge join, if there is less data on one side, SQL Server takes the one with less data as an outer loop and the other as an internal loop to complete the join process. Continuing with the previous example, adding a where statement to the query statement to reduce the amount of data on the Join side, the execution plan appears as a nested Loop Join.

[SQL]

SELECT
sh.*
From
Salesordheaderdemo as sh
JOIN
Salesorddetaildemo as SD
On
Sh. Salesorderid=sd. SalesOrderID
WHERE
Sh. salesorderid=43659

Improvements in the execution plan (Table/index scan)

On many occasions we need to extract a small amount of data from a table that contains a lot of data, so scan should be avoided because scan processing will traverse each line, which is time consuming. Let's look at an example:

[SQL]

SELECT
Sh. SalesOrderID
From
Salesordheaderdemo as sh
JOIN
Salesorddetaildemo as SD
On
Sh. Salesorderid=sd. SalesOrderID
WHERE
Sh. orderdate=' 2005-07-01 00:00:00.000 '
GO

The red circle in the diagram marks the table scan, and the execution plan is intelligently recommended for indexing. Let's first try to build an index on the Salesordheader table:

[SQL]

CREATE UNIQUE CLUSTERED INDEX idx_salesorderheaderdemo_salesorderid on salesordheaderdemo (SalesOrderID) /c5>
GO

Then execute the same query statement again, and the execution plan becomes the following:

Table Scan changes to index scan and continues to index another table:

[SQL]

CREATE UNIQUE CLUSTERED INDEX idx_salesdetail_salesorderlid on salesorddetaildemo (SalesOrderID, Salesorderdetailid)
GO

The following changes have occurred in the execution plan:

While it is not possible to say that Scan is worse than seek, seek is a better choice for most occasions (especially when finding small amounts of data in many data). For example, if you have a table of billions of data and you want to take 100 of them, you should make sure that you use Seek, but if you need to take out most of the data (say 95%), Scan might be better. (a more authoritative article gives the threshold value of 30%, which is more efficient when fetching more than 30% data, whereas Seek is better)

In addition, you may notice that both tables are indexed but one table is represented as Clustered Index scan in the execution plan, and the other is Clustered index seek, and we are not looking for two Clustered index seek? This is because the previous table does not have an assertion (predicate), and the latter table asserts the SalesOrderID with the on keyword.

Key Lookup in the execution plan

For the following example, we first set up two different indexes on the same table:

[SQL]

CREATE UNIQUE CLUSTERED INDEX idx_salesdetail_salesorderlid on salesorddetaildemo (SalesOrderID, Salesorderdetailid)
GO
CREATE nonclustered INDEX idx_non_clust_salesorddetaildemo_modifieddate on salesorddetaildemo (ModifiedDate)
GO

Execute the following query:

[SQL]

SELECT
ModifiedDate
From Salesorddetaildemo
WHERE modifieddate=' 2005-07-01 00:00:00.000 '
GO

Execution plan For example, he used the non-clustered index, which we previously established on the ModifiedDate field, to become an index Seek treatment.

Let's change the query statement by adding two additional fields to the SELECT:

[SQL]

SELECT
ModifiedDate,
SalesOrderID,
Salesorderdetailid
From Salesorddetaildemo
WHERE modifieddate=' 2005-07-01 00:00:00.000 '
GO

The execution plan, like, basically unchanged:

The above selected field is not belong to non-clustered index is Clustered index, if you add a few other fields?

[SQL]

SELECT
ModifiedDate,
SalesOrderID,
Salesorderdetailid,
ProductID,
UnitPrice
From Salesorddetaildemo
WHERE modifieddate=' 2005-07-01 00:00:00.000 '
GO

Baby, execute the plan a little more. Two processing (Key Lookup, Nested Loop):

Key lookup is a heavy processing, and we can avoid key lookup by using the keyword with to specify the use of Clustered Index.

[SQL]

SELECT
ModifiedDate,
SalesOrderID,
Salesorderdetailid,
ProductID,
UnitPrice
From Salesorddetaildemo with (index=idx_salesdetail_salesorderlid)
WHERE modifieddate=' 2005-07-01 00:00:00.000 '
GO

The execution plan turns into a Clustered Index Scan:

Previously mentioned Scan does not seem to be a good deal, then the dwarf in a higher, using SET STATISTICS IO on to compare:

[SQL]

SET STATISTICS IO on
GO
SELECT
ModifiedDate,
SalesOrderID,
Salesorderdetailid,
ProductID,
UnitPrice
From Salesorddetaildemo
WHERE modifieddate=' 2005-07-01 00:00:00.000 '
GO
SELECT
ModifiedDate,
SalesOrderID,
Salesorderdetailid,
ProductID,
UnitPrice
From Salesorddetaildemo with (index=idx_salesdetail_salesorderlid)
WHERE modifieddate=' 2005-07-01 00:00:00.000 '
GO
SELECT
ModifiedDate,
SalesOrderID,
Salesorderdetailid,
ProductID,
UnitPrice
From Salesorddetaildemo with (index=idx_non_clust_salesorddetaildemo_modifieddate)
WHERE modifieddate=' 2005-07-01 00:00:00.000 '
GO

Compared with the clustered index query performance is the worst, and the SET STATISTICS IO output Data clustered index query on logical reads spent more time.

Looks like the non-clustered index + Key lookup execution plan is good, but if you can avoid Key lookup is perfect, let's revise non-clustered index to include it in the index with the Include keyword His fields are:

[SQL]

DROP INDEX idx_non_clust_salesorddetaildemo_modifieddate on salesorddetaildemo
GO
CREATE nonclustered INDEX idx_non_clust_salesorddetaildemo_modifieddate on salesorddetaildemo (ModifiedDate)
INCLUDE
(
ProductID,
UnitPrice
)
GO
--Clear the cache, only for the development environment!
DBCC Freeproccache
DBCC dropcleanbuffers
GO

Execute the previous query again:

[SQL]

SELECT
ModifiedDate,
SalesOrderID,
Salesorderdetailid,
ProductID,
UnitPrice
From Salesorddetaildemo
WHERE modifieddate=' 2005-07-01 00:00:00.000 '
GO

This is perfect because our query fields are included in the index, so the execution plan is eventually optimized to index Seek.

SQL Server Performance Tuning Execution plan (execution plan) tuning

This article is an English version of an article which is originally in the Chinese language on aliyun.com and is provided for information purposes only. This website makes no representation or warranty of any kind, either expressed or implied, as to the accuracy, completeness ownership or reliability of the article or any translations thereof. If you have any concerns or complaints relating to the article, please send an email, providing a detailed description of the concern or complaint, to info-contact@alibabacloud.com. A staff member will contact you within 5 working days. Once verified, infringing content will be removed immediately.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

Get Started for Free

Sales Support

1 on 1 presale consultation

Chat Contact Sales
After-Sales Support

24/7 Technical Support 6 Free Tickets per Quarter Faster Response

Open a Ticket
Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.

Learn More