PostgreSQL Database Kernel Analysis Notes (this book is not very good to see, mainly is the data structure, concepts and process of text introduction)

Last Update:2014-11-10 Source: Internet

Author: User

Tags fsm psql

Developer on Alibaba Coud: Build your first app with APIs, SDKs, and tutorials on the Alibaba Cloud. Read more ＞

PostgreSQL Database Kernel analysisJump to: Navigation, search

Directory

1 system overview
2 architecture
3 Storage management
4 index
5 query compilation
6 query execution
7 transaction processing and concurrency control
8 database security
9 Appendix A develop and debug with Eclipse

System Overview

Initialize database:./initdb--no-locale-d. /data
./pg_ctl start-d. /data
Database command: Initdb createuser dropuser createdb dropdb pg_dump pg_restore pg_ctl vacuumdb psql
Psql meta command: \? \O \l \q \c \dt \d \di \i (SQL);

System Structure

Main system tables and their dependencies
1. Pg_namespace (Nspname, Nspowner, Nspacl)
2. Pg_tablespace (Spcname, Spcowner, Spclocation, Spcacl)
3. Pg_database
4. Pg_class
5. Pg_type
6. Pg_attribute
7. Pg_index
System view: Pg_cursors pg_group pg_indexes pg_locks pg_roles pg_rules ...
Data Set Cluster
1. Table/index: Over 1G split, filenode.1 ...
2. If some attribute drugs store big data, then there will be associated toast tables
3. Subdirectories and files in Pg_data: pg_version base global Pg_clog pg_tblspc ...
4. Postgres.bki
5. Initdb the execution of the process
6. System database: Template1 template0 postgres
Process structure: Postmaster Postgres syslogger pgstat autovacuum bgwriter walwriter pgarch
1. Postmaster
  1. Memorycontext
  2. GUC Configuration Parameters
  3. Signal Processing: Sighup_handler Pmdie Reaper (cleanup of exited sub-processes)
  4. Worker process Start
2. Worker process
  1. Walwriter: Segment number starting from 0, cannot be reused
  2. Pgarch (Wal log archive): Call the shell command directly? K
3. Postgres
4. Exec_simple_query

Storage Management

External memory Management: Table file, free space, virtual file descriptor (VFD), Big Data
1. 8.2 + visibility map VM idle map FSM
heap File: Table file + tuple not associated, {normal, temporary, Sequence, TOAST}
1. Physical structure: Pageheaderdata linp<n> ... Freespace. tuple<n> special_space
2. "Hot technology"
  1. each version of a tuple has a corresponding version of the index ==〉 ... Tag Delete
Disk Management (smgr)
1. mdfdvec:vfd, Segno, chain
vfd mechanism
1. LRU Pool (vfdcache)
FSM
1. p66 fp_next_slot
2. fsm_search max heap binary tree?
VM: Speed up vacuum as a hint
Big data:
1. TOAST: Store variable-length data? such as varchar, need to exceed 2KB; out-of-line/compression 2 storage mechanisms
2. LOB
memory management
1. Memorycontext:allocset
2. Cache: Syscache/relcache
3. buffer pool
4. IPC
table operations and tuple operations
1. Synchronous Scan (with shared buffering on multiple scans)
vacuum mechanism
1. Lazy: Invalid token is available
2. full
RESOURCEOWNER resource tracking

I feel the description here is very confusing.

Index

Index mode
1. Partial index? CREATE INDEX idx on student (name) WHERE (id>1 and id<255);
2. An expression index? CREATE INDEX idx on student (lower (name))
PG_AM: Each tuple includes an access function (pg_proc.oid) provided by that index type?
B-tree Index
1. Each non-right-most node: High-key
2. Btwritestate: Record information throughout the index creation process
3. Generates a btpagestate for each layer whose btps_next points to the parent node (? ）
4. Fill factor: ... With (FILLFACTOR=70);
5. Scan Index
Hash index
1. 4 Kinds of pages: meta (0#) bucket overflow (elements in the bucket) bitmap (usage of the former two)
Gist
1. Consistent (E,Q) Union (p) Same (E1,E2) penalty (e1,e2) picksplit (p) Compress (e) decompress (e)
2. Gistinsertstack?
GIN
1. Compare, Extractvalue, extractquery, consistent (equals like Hashtable?) ), comparepartial
TSearch2

Query Compilation

Query analysis
Query rewriting
Query planning: Query tree link list + Execution plan chain list
1. The size, path, and cost are estimated for each intermediate relationship generated during path generation.
  1. DP, GA
  2. Basic Relationship Access Path
  3. Index Scan Path
  4. TID (The physical address of the tuple?) ）
2. Generate an Min/max aggregation plan that can be optimized
3. Generate a general Plan
  1. Scan: Order/Index
  2. Connections: Nested loops, hashes, merges
  3. Others: Append, Result, materialization
4. Generate a complete plan (+ Gather/Sort)
5. Organize the Planning tree
Cost estimate
Genetic algorithm

Query Execution

Non-optimized statements
An optimized statement
Scheduling node
1. Control: Result Append bitmapand/or recursiveunion
2. Scan: Seq Index bitmapheap bitmapindex Tid subquery Function Values Cte worktable
3. Materialized: Material Sort Group Agg unqiue Hash setop Limit Windowagg
4. Connection: Type (Inner left/right/full_outer Semi Anti), operation
Other sub-functions
1. Tuple operations
2. Expression evaluation
3. Projection

transaction processing and concurrency control

Tblockstate
2PC
3 Kinds of Locks
1. SpinLock
2. Lwlock
3. Regularlock
Lock management mechanism
Dead lock
1. Wait Diagram (WFG)
MVCC (the explanation here seems not clear enough)
Log Management: Xlog/clog
1. SLRU Buffer Pool
2. Subtrans Log Manager?
3. Multixact Log Manager: Record the combined transaction ID?

Database SecurityAppendix A development and debugging with Eclipse

PostgreSQL Database Kernel Analysis Notes (this book is not very good to see, mainly is the data structure, concepts and process of text introduction)

This article is an English version of an article which is originally in the Chinese language on aliyun.com and is provided for information purposes only. This website makes no representation or warranty of any kind, either expressed or implied, as to the accuracy, completeness ownership or reliability of the article or any translations thereof. If you have any concerns or complaints relating to the article, please send an email, providing a detailed description of the concern or complaint, to info-contact@alibabacloud.com. A staff member will contact you within 5 working days. Once verified, infringing content will be removed immediately.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

Get Started for Free

Sales Support

1 on 1 presale consultation

Chat Contact Sales
After-Sales Support

24/7 Technical Support 6 Free Tickets per Quarter Faster Response

Open a Ticket
Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.

Learn More