SQL Server Query optimization method

Source: Internet
Author: User

SQL Server Query optimization method

The reasons for the slow query are many, the following are common

1, no index or no index (this is the most common problem of slow query, is the defect of program design)
2, I/O throughput is small, forming a bottleneck effect
3. No computed column created causes query not to be optimized
4. Insufficient memory
5. Slow network speed
6, the amount of data queried is too large (can use multiple queries, other methods to reduce the amount of data)
7, lock or deadlock (this is also the most common problem of slow query, is the defect of program design)
8, sp_lock,sp_who, the activity of the user view, the reason is to read and write competitive resources.
9. Return unnecessary rows and columns
10, query statement is not good, no optimization

You can refine the query by using the following methods

1, put the data, logs, indexes on different I/O devices, increase the read speed, previously can be tempdb should be placed on the RAID0, SQL2000 is not supported. The larger the amount of data (size), the more important it is to increase I/O.
2. Vertical and horizontal partition table, reduce the size of the table (Sp_spaceuse)
3. Upgrading Hardware
4, according to the query criteria, index, optimize the index, optimize access mode, limit the data volume of the result set. Note that the fill factor is appropriate (preferably using the default value of 0). The index should be as small as possible, using a Lie Jian index with a small number of bytes (refer to the creation of the index), do not Jianjian a single index on a limited number of values such as the Gender field
5, improve speed;
6, expand the memory of the server. Configure virtual Memory: The virtual memory size should be configured based on the services that are running concurrently on the computer. Consider setting the virtual memory size to 1.5 times times the physical memory installed on your computer. If you have additional full-text search features installed and you plan to run the Microsoft Search service to perform full-text indexing and querying, consider: Configure the virtual memory size to be at least 3 times times the physical memory installed on the computer. Configure the SQL Server max server memory server configuration option to 1.5 times times the physical memory (half of the virtual memory size setting).
7. Increase the number of server CPUs, but it is important to understand that parallel processing of serial processing requires resources such as memory. The use of parallel or string travel is the MSSQL automatic evaluation option. A single task is decomposed into multiple tasks and can be run on the processor. For example, delays in the sorting, connection, scanning and groupby sentence execution, SQL Server based on the load of the system to determine the optimal level of parallelism, complex need to consume a large number of CPU queries are most suitable for parallel processing. However, the update operation Update,insert,delete cannot be processed in parallel.
8, if you use like to query, simple to use index is not, but the full-text index consumption space. Like ' a% ' uses the index like '%a ' when querying with like '%a% ' without an index, the query time is proportional to the total length of the field value, so the char type is not used, but varchar. The full-text index is long for the value of the field.
9, DBServer and applicationserver separation, OLTP and OLAP separation
10. A distributed partitioned view can be used to implement a federation of database servers. A consortium is a set of servers that are managed separately, but they work together to share the processing load of the system. This mechanism of forming a federation of database servers through partitioned data can expand a set of servers to support the processing needs of large, multi-tiered web sites. For more information, see Designing federated database servers. (Refer to SQL Help file ' partitioned view ')

A, before implementing a partitioned view, you must first horizontally partition the table
b, after creating the member table, define a distributed partitioned view on each member server, and each view has the same name. This enables queries that reference the distributed partitioned view name to run on any member server. The system operates as if each member server has a copy of the original table, but there is only one member table and one distributed partitioned view on each server. The location of the data is transparent to the application.

11. Rebuild Index Dbccreindex,dbccindexdefrag, shrink data and log dbccshrinkdb, Dbccshrinkfile. Set the auto-shrink log. For large databases do not set the database autogrow, it will degrade the performance of the server. There's a lot of emphasis on T-SQL, and here's a list of common points: first, the DBMS processes the query plan:
1. Lexical and grammatical checking of query statements
2. Query optimizer to submit statements to the DBMS
3 optimization of optimized algebra and access paths
4. Generate query plan by precompiled module
5, and then at the appropriate time to submit to the system processing execution
6, finally return the execution result to the user second, look at the SQL Server data storage structure: A page size of 8K (8060) bytes, 8 pages for a disk area, according to B-Tree storage.
12. The difference between commit and rollback rollback: roll back all things. Commit: Commit the current thing. There is no need to write things in dynamic SQL, if you want to write on the outside such as: Begintranexec (@s) CommitTrans or write dynamic SQL as a function or stored procedure.
13, in the query SELECT statement using the WHERE clause to limit the number of rows returned, avoid table scan, if the return of unnecessary data, wasted the server's I/O resources, aggravating the burden of the network to reduce performance. If the table is large, locks the table during the table scan and prevents other joins from accessing the table, with serious consequences.
14. The SQL Comment Statement has no effect on execution
15, as far as possible without using the cursor, it occupies a large number of resources. If you need to execute row-by-row, try to use non-cursor technology, such as: In the client loop, with temporary tables, table variables, subqueries, with case statements and so on. Cursors can be categorized according to the extraction options it supports: forward-only the rows must be fetched in the order from the first row to the last row. Fetchnext is the only allowed fetch operation and is also the default. Scrollable can randomly fetch any row anywhere in the cursor. The technique of cursors becomes very powerful under SQL2000, and his purpose is to support loops.
There are four concurrency options:
READ_ONLY: The cursor is not allowed to locate updates (update), and there is no lock in the row that makes up the result set.
optimisticwithvalues: Optimistic concurrency control is a standard part of transaction control theory. Optimistic concurrency control is used in situations where there is only a small chance for a second user to update a row in the interval between opening the cursor and updating the row. When a cursor is opened with this option, there is no lock to control the rows in it, which will help maximize its processing power. If the user attempts to modify a row, the current value of this row is compared with the value obtained when the row was last fetched. If any value changes, the server will know that the other person has updated the row and will return an error. If the value is the same, the server executes the modification. Select this concurrency option optimisticwithrowversioning: This optimistic concurrency control option is based on row versioning. With row versioning, the table must have some version identifier that the server can use to determine whether the row has changed after it has been read into the cursor.
in SQL Server, this performance is provided by the timestamp data type, which is a binary number that represents the relative order of changes in the database. Each database has a global current timestamp value: @ @DBTS. Each time a row with a timestamp column is changed in any way, SQL Server stores the current @ @DBTS value in the timestamp column, and then increases the value of the @ @DBTS. If a table has a timestamp column, the timestamp is recorded at the row level. The server can compare the current timestamp value of a row with the timestamp value stored at the last fetch to determine whether the row has been updated. The server does not have to compare the values of all columns, just compare the timestamp columns. If an application requires optimistic concurrency based on row versioning for tables that do not have timestamp columns, Reise considers optimistic concurrency control based on numeric values.
scrolllocks This option for pessimistic concurrency control. In pessimistic concurrency control, when a row of a database is read into a cursor result set, the application attempts to lock the database row. When a server cursor is used, an update lock is placed on the row when it is read into the cursor. If a cursor is opened within a transaction, the transaction update lock is persisted until the transaction is committed or rolled back, and the cursor lock is dropped when the next row is fetched. If you open a cursor outside of a transaction, the lock is discarded when the next row is fetched. Therefore, each time a user needs full pessimistic concurrency control, the cursor should open within the transaction. An update lock prevents any other task from acquiring an update lock or exclusive lock, preventing other tasks from updating the row.
However, updating a lock does not prevent a shared lock, so it does not prevent other tasks from reading the row unless the second task also requires a read with an update lock. Scroll locks These cursor concurrency options can generate scroll locks based on the lock hints specified in the SELECT statement defined by the cursor. The scroll lock is fetched on each line at fetch and remains until the next fetch or the cursor closes, whichever occurs first. The next time the fetch occurs, the server acquires a scroll lock for the row in the new fetch and releases the last scroll lock to fetch rows. A scroll lock is independent of the transaction lock and can be persisted after a commit or rollback operation. If the option to close the cursor at commit is off, the commit statement does not close any open cursors, and the scroll lock is persisted to the commit to maintain isolation of the extracted data. The type of scroll lock acquired depends on the cursor concurrency option and the lock hint in the cursor SELECT statement.
lock Prompt read-only optimistic numeric optimistic row versioning lock silent unlocked unlocked unlock unlocked nolock unlocked unlocked unlocked holdlock shared share share update updlock error update update TABLOCKX error unlocked unlocked update other unlocked unlocked not locked updates * Specifying the NOLOCK hint will make the table with the hint specified in cursor read-only.
16, use Profiler to track the query, get the time required to query, find out the problem of SQL; optimizing indexes with the index optimizer
17, pay attention to the difference between union and UnionAll. UnionAll Good
18, pay attention to using distinct, do not use when not necessary, it will make the query slower than the union. Duplicate records are not a problem in the query.
19. Do not return rows or columns that are not required when querying
20, use sp_configure ' querygovernorcostlimit ' or setquery_governor_cost_limit to limit the resources that the query consumes. When an estimate query consumes more resources than the limit, the server automatically cancels the query and kills it before the query. Setlocktime Setting the lock time
21, use selecttop100/10percent to limit the number of rows returned by the user or Setrowcount to restrict the rows of the Operation
22, before SQL2000, generally do not use the following words: "ISNULL", "<>", "! =", "!>", "!<", "not", "notexists", "Notin", "Notlike", and " Like '%500 ', because they do not go index is all a table scan. Also do not add functions, such as convert,substring, in the WHERE clause, if you must use a function, create a computed column and then create an index instead. You can also work around: wheresubstring (firstname,1,1) = ' m ' Instead of Wherefirstnamelike ' m% ' (index Scan), be sure to separate the function from the column name. And the index cannot be built too much and too large. Notin will scan the table multiple times, using exists, notexists,in,leftouterjoin to replace, especially the left connection, and exists is faster than in, The slowest is the not operation. If the value of the column is empty, the previous index does not work, and now 2000 of the optimizer can handle it. The same is isnull, "not", "notexists", "notin" can optimize her, and "<>" and so still can not be optimized, not used to index.
23. Use Queryanalyzer to view the SQL statement's query plan and evaluate whether the analysis is optimized for SQL. The average 20% of the code occupies 80% of the resources, and the focus of our optimization is these slow places.
24, if you use in or OR and so on to find the query does not go index, using the display declaration specified index: select*frompersonmember (index=ix_title) Whereprocessidin (' Male ', ' female ')
25, will need to query the results of pre-calculated to put in the table, query time and then select. This is the most important means before SQL7.0. For example, the hospital's hospitalization fee calculation.
26, MIN () and Max () can use the appropriate index
27, the database has a principle is the code close to the data is better, so the preference to default, in turn, rules,triggers,constraint (constraint such as external health main jian Checkunique ..., the maximum length of data type, etc. are constraints), Procedure. This not only makes maintenance work small, it writes programs with high quality, and executes faster.
28, if you want to insert a large binary value into the image column, using stored procedures, do not use inline insert to insert (do not know whether Java). Because the application first converts the binary value to a string (twice times its size), the server receives the character and then converts it to a binary value. The stored procedure does not have these actions: Method: Createprocedurep_insertasinsertintotable ( Fimage) VALUES (@image), call this stored procedure in the foreground to pass in binary parameters, so processing speed significantly improved.
29, between at some point faster than in, between can quickly find the range based on the index. The difference is visible with the query optimizer. Select*fromchineseresumewheretitlein (' Male ', ' female ') Select*fromchineseresumewherebetween ' men ' and ' women ' are the same. Because in will be compared several times, it is sometimes slower.
30, it is necessary to create indexes on global or local temporary tables, sometimes to improve speed, but not necessarily, because the index also consumes a lot of resources. His creation is the same as the actual table.
31, do not build things that do not work, such as generating reports, wasting resources. Use it only when it is necessary to use things.
32. Words with or can be decomposed into multiple queries, and multiple queries are connected through union. Their speed is only related to whether the index is used or not, and if the query requires a federated index, it is more efficient to execute with UnionAll. The words of multiple or are not used in the index, and the form of Union is then tried to match the index. A key question is whether to use the index.
33, minimize the use of views, its low efficiency. The view operation is slower than the direct table operation and can be replaced by StoredProcedure. In particular, instead of nesting views, nested views add to the difficulty of finding the original data. We look at the nature of the view: It is a well-optimized SQL that has generated query planning on the server. When retrieving data for a single table, do not use a view that points to multiple tables, either directly from the table or only the view that contains the table, otherwise the unnecessary overhead is increased and the query is disturbed. In order to speed up the query of the view, MSSQL adds the function of the view index.
34, do not need to use distinct and the time-out, these actions can be changed in the client execution. They add extra overhead. This is the same truth as union and UnionAll. Selecttop20ad.companyname,comid,position,ad.referenceid,worklocation,convert (varchar), ad.postDate,120) Aspostdate1,workyear,degreedescriptionfromjobcn_query.dbo.companyad_queryadwherereferenceidin (' JCNAD00329667 ', ' JCNAD132168 ', ' JCNAD00337748 ', ' JCNAD00338345 ', ' JCNAD00333138 ', ' JCNAD00303570 ', ' JCNAD00303569 ', ' JCNAD00303568 ', ' JCNAD00306698 ', ' JCNAD00231935 ', ' JCNAD00231933 ', ' JCNAD00254567 ', ' JCNAD00254585 ', ' JCNAD00254608 ', ' JCNAD00254607 ' ', ' JCNAD00258524 ', ' JCNAD00332133 ', ' JCNAD00268618 ', ' JCNAD00279196 ', ' JCNAD00268613 ') Orderbypostdatedesc
35, in the face value of the list, will appear the most frequent values on the front, the least appear in the last face, reduce the number of judgments
36, when using Selectinto, it will lock the system table (sysobjects,sysindexes, etc.), blocking the access of other connections. Create a temporary table with a declaration statement instead of Selectinto.droptablet_lxhbegintranselect*intot_lxhfromchineseresumewherename= ' XYZ '-- Commit in another connection select*fromsysobjects can see that selectinto locks the system table, and CreateTable locks the system table (whether it is a temporary table or a system table). So don't use it in things!!! In this case, use a real table, or a temporary table variable, if it is a temporary table that you want to use frequently.
37, generally in the groupby have a sentence before you can eliminate the redundant lines, so try not to use them to do the work of the culling line. Their order of execution should be optimal: the WHERE clause of the SELECT selects all the appropriate rows, GroupBy is used to group the statistical rows, and the having words are used to exclude redundant groupings. So groupby a having the overhead of small, query fast. For large data rows to group and have a very consuming resource. If the purpose of GroupBy does not include calculation, just grouping, then use distinct faster
38, one update multiple records score multiple updates each time a fast, that is, batch processing good
39, the use of temporary tables, as far as possible to use the result set and table class variables to replace it, table type of variable than temporary table good
40, under SQL2000, the calculated field can be indexed, the conditions to be met are as follows:
A, the expression of the calculated field is determined
B, cannot be used in the Text,ntext,image data type
C, the following options must be formulated Ansi_nulls=on,ansi_paddings=on,.......
41, try to put the data processing work on the server, reduce the network overhead, such as the use of stored procedures. Stored procedures are compiled, optimized, and organized into an execution plan, and stored in a database of SQL statements, is a collection of control flow language, the speed of course fast. Dynamic SQL, which is executed repeatedly, can use temporary stored procedures that are placed in tempdb (temporary tables). Previously, because SQL Server did not support complex math calculations, it was forced to put this work on top of other tiers and increase the overhead of the network. SQL2000 supports UDFs, which now supports complex mathematical calculations, the return value of functions is not too large, which is expensive. A user-defined function that executes like a cursor consumes a large amount of resources, if a large result is returned with a stored procedure
42. Do not use the same function repeatedly in a sentence, wasting resources, putting the result in a variable and then calling faster
43, SelectCount (*) efficiency teaching low, as far as possible to adapt his writing, and exists fast. Also note the difference: SelectCount (fieldofnull) fromtable and SelectCount ( Fieldofnotnull) The return value of fromtable is different.
44, when the server memory enough, the number of configuration threads = The maximum number of connections +5, so as to maximize the efficiency; otherwise, the thread pool of SQL Server is enabled using the number of configuration threads < Maximum number of connections, or if the number = maximum number of connections is +5, severely compromising the performance of the servers.
45, in a certain order to access your table. If you lock table A and then lock table B, you must lock them in this order in all stored procedures. If you (inadvertently) lock table B in a stored procedure, and then lock Table A, this could result in a deadlock. Deadlocks are hard to find if the lock sequence is not designed in advance
46. Monitor the load memory:pagefaults/sec counter of the corresponding hardware through Sqlserverperformancemonitor if the value is occasionally higher, it indicates that the thread is competing for memory. If it continues to be high, then memory can be a bottleneck. Process:
1.%dpctime refers to the percentage of the processor used in the deferred program invocation (DPC) to receive and provide services during the sample interval. (DPC is running at a lower interval than the standard interval priority). Because DPC is performed in privileged mode, the percentage of DPC time is part of the percentage of privileged time. These times are calculated separately and are not part of the total number of interval calculations. This total shows the average busy time as a percentage of the instance time.
2,%processortime counter if the value of this parameter continues to exceed 95%, the bottleneck is the CPU. Consider adding a processor or swapping it for a faster one.
3.%privilegedtime refers to the percentage of non-idle processor time used for privileged mode. (Privileged mode is a processing mode designed for operating system components and manipulating hardware drivers.) It allows direct access to hardware and all memory. Another mode is User mode, which is a limited processing mode designed for application, environment sub-system and integer sub-system. The operating system translates the application thread into privileged mode to access the operating system services). The% of privileged time includes the time to service the interruption and DPC. A high privilege time ratio can be caused by a large number of intervals that failed devices produce. This counter displays the average busy time as part of the sample time.
4,%usertime represents CPU-consuming database operations, such as sorting, execution aggregatefunctions, and so on. If the value is high, consider increasing the index, using a simple table join, and horizontally splitting the large table to reduce the value. Physicaldisk:curretndiskqueuelength counter This value should be no more than 1.5~2 times the number of disks. To improve performance, you can increase the disk. Sqlserver:cachehitratio counter the higher the value the better. If it lasts below 80%, you should consider increasing the memory. Note that the value of this parameter is incremented since SQL Server was started, so the value will not reflect the current value of the system after a period of time has elapsed.
47, analysis selectemp_nameformemployeewheresalary>3000 If the salary is a float type in this statement, the optimizer optimizes it to convert (float,3000), Since 3000 is an integer, we should use 3000.0 in programming instead of waiting for the DBMS to be transformed by the runtime. Conversions of the same character and integer data.

Http://www.cnblogs.com/yanghaibo/articles/1698788.html

SQL Server Query optimization method

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.