AboutDB2 databaseThe explanation of the main misunderstanding of Performance Understanding is more suitable for beginners who are just getting started to learn, and provides a good reference for everyone to master the knowledge of DB2 database, in this way, we have mastered the correct knowledge and avoided detours in the future, hoping to help you.
Main misunderstandings about DB2 database performance:
Logical design should always be fully mapped to Physical Design
Actual: the physical design in the DB2 database design should be as close as possible to the logical structure, but physical design changes made for performance cannot be ignored because they do not come from the logical design.
Put everything in a buffer pool (BP0) for DB2 to manage
Actually: As described in the DB2 manual and elsewhere, you have to (10000 4 k pages or less) with very limited memory ), you don't have time to manage it, and you didn't consider performance. It is recommended that you do not place anything except the DB2 catalog and Directory into BP0.
DSNDB07 is 100% ordered
Actually: DSNDB07 is never in the 100% order, because there is a random activity on the page in the working file. The activity may be as high as 45%, but usually ranges from 3% to 10%.
VARCHAR should always be placed at the end of the row
Actually: This is always a problem. If the table is always read and has very few updates, this will reduce the CPU load, but in other cases, this is the worst, even if the table is compressed. It should be placed at the end only when it is updated frequently, but not usually.
The program should be encoded in a way that follows the logical process
Actual: pseudo code or a logical process diagram does not need to consider performance-related encoding methods. This is very dramatic in the OLTP transaction code.
Most processes are not performed in SQL
Reality: in fact, the opposite is often true. SQL is a rich language that can process most processes. In fact, the biggest difficulty is that SQL is often used as an I/O processor rather than a collection processor.
The code and reference table should be used together with the referential integrity (RI) declared by DB2
Actually, RI should not be used as a shortcut for editing validity. This is usually something else, but should be used in real parent-child relationship.
A table can contain at most one or two indexes.
Actual: The table should have multiple indexes according to performance requirements.
Non-partitioned indexes (NPI) should not be used, especially in large tables.
Actually: this is related to countless problems, which can be overcome in general, but NPI is necessary for proper access and performance.
Large tables should be split
Actually: A table with too much data means performance degradation, which is a legacy concern. This understanding has been eliminated when more than 6 billion rows of data exist in some tables.
DB2 is good by default.
Actually: the default version is generally not the best. They are changed based on different versions. For example, bind the CURRENTDATA parameter.
Do not use Negation in SQL WHERE predicates.
Actually: Another rule is not clearly explained. Only when the predicate is a negative, the SQL access path may use an unnecessary tablespace scan. However, in most other cases, redundant filtering should be completed in the DB2 engine, which is better.
I can determine whether the access path is good only by explaining
Actual: EXPLAIN does not display the order of the queried blocks to be executed, does not tell you the first or second-stage predicates, and does not tell you how long a block will be executed. Basically, EXPLAIN only exports some data to a table, and then carries out more explanations based on other information. There are some tools to help with this process (such as Visual Explain), but if all the facts are not considered, this method will only bring disadvantages.
Do not make the EDM pool too large to avoid Paging
Actually: the EDM pool usually improves performance through paging (here paging refers to extended storage, rather than disks) instead of becoming smaller and continues to rebuild the internal structure due to page replacement and other factors.
Expansion is not related to anything else.
Actual: When did it start? In the future, if the world is full of SAN or ESS, it will be similar. The impact of expansion has become small because of the new disk cache controller, but there are still some additional checks and processing to manage them.
Relationship division is not used in DB2
Actually: The Relationship division has been used in many systems in the past and can be effectively implemented by database designers and program developers. In the current business intelligence (BI) and market systems, it can be used in each individual program several times.
Bind all packages to two plans: one batch processing and one online
Actually: this is a bad statement when introducing the DB2 package. There are many reasons to say that this understanding is wrong.
Unauthorized reading is not good.
Actually: unauthorized reading is not a four-character word, but it is a very good performance enhancement and can be used in more places than often understood.
No lock issues in the absence of timeout and deadlock
Reality: in fact, there is no problem that does not mean there is no need to pay attention to performance issues. Frequent locking is not considered a problem, because the focus is mainly on the adjustment measurement of the response (counting the number of deadlocks or Timeouts), rather than the subsequent adjustment (Monitoring lock wait time ).
ESA data compression is always good
Reality: When compression can be used in many places, it may cause problems. In each case, you must determine whether to use it before compression. This is not optional, but must be used or not at the high level.