轉載：postgresql分區與最佳化

最後更新：2016-04-11 來源：互聯網

上載者：User

創建阿里雲帳戶，並獲得超過 40 款產品的免費試用版；而企業帳戶則可以享有總值 $1200 的免費試用版。立即註冊！

標籤：


--對於分區表constraint_exclusion 這個參數需要配置為partition或onpostgres=# show constraint_exclusion ; constraint_exclusion ---------------------- partition   --建立父子表, 用於儲存分區資料create table t(id int primary key);create table t1(like t including all) inherits(t);create table t2(like t including all) inherits(t);create table t3(like t including all) inherits(t);create table t4(like t including all) inherits(t);--PostgreSQL的子表和子表之間的約束是沒有任何關係的, 所以也可以有重疊, 即非全域約束. alter table t1 add constraint ck_t1_1 check(id<0); alter table t2 add constraint ck_t2_1 check(id>=0 and id<100); alter table t3 add constraint ck_t3_1 check(id>=100 and id<200); alter table t4 add constraint ck_t4_1 check(id>=200);  --分區欄位傳入常量, 執行時掃描的是父表和約束對應的子表 :postgres=#  explain select * from t where id=10;                                 QUERY PLAN                                  ----------------------------------------------------------------------------- Append  (cost=0.00..8.17 rows=2 width=4)   ->  Seq Scan on t  (cost=0.00..0.00 rows=1 width=4)         Filter: (id = 10)   ->  Index Only Scan using t2_pkey on t2  (cost=0.15..8.17 rows=1 width=4)         Index Cond: (id = 10)(5 rows)--分區欄位傳入常量, 執行時掃描的是父表和約束對應的子表;postgres=#  prepare p_test as select * from t where id=$1;PREPAREpostgres=# explain execute p_test(1);                                 QUERY PLAN                                  ----------------------------------------------------------------------------- Append  (cost=0.00..8.17 rows=2 width=4)   ->  Seq Scan on t  (cost=0.00..0.00 rows=1 width=4)         Filter: (id = 1)   ->  Index Only Scan using t2_pkey on t2  (cost=0.15..8.17 rows=1 width=4)         Index Cond: (id = 1)(5 rows)--子句查詢, 執行時掃描的是父表和所有子表, 注意這裡使用的子查詢是子表的查詢, 理論上應該是掃描父表和該子表postgres=#  explain select * from t where id=(select id from t1 limit 1);                                 QUERY PLAN                                  ----------------------------------------------------------------------------- Append  (cost=0.01..32.70 rows=5 width=4)   InitPlan 1 (returns $0)     ->  Limit  (cost=0.00..0.01 rows=1 width=4)           ->  Seq Scan on t1 t1_1  (cost=0.00..34.00 rows=2400 width=4)   ->  Seq Scan on t  (cost=0.00..0.00 rows=1 width=4)         Filter: (id = $0)   ->  Index Only Scan using t1_pkey on t1  (cost=0.15..8.17 rows=1 width=4)         Index Cond: (id = $0)   ->  Index Only Scan using t2_pkey on t2  (cost=0.15..8.17 rows=1 width=4)         Index Cond: (id = $0)   ->  Index Only Scan using t3_pkey on t3  (cost=0.15..8.17 rows=1 width=4)         Index Cond: (id = $0)   ->  Index Only Scan using t4_pkey on t4  (cost=0.15..8.17 rows=1 width=4)         Index Cond: (id = $0)(14 rows)--綜上可知在對分區表進行查詢時最好使用字面常量，而不要使用子查詢之類複雜的sql--如果子表上約束刪除，則pg不得不把刪除約束的子表也加入到查詢中(即使子表可以忽略)alter table t4 drop constraint ck_t4_1;postgres=#  explain select * from t where id=10;                                    QUERY PLAN                                  ----------------------------------------------------------------------------- Append  (cost=0.00..16.34 rows=3 width=4)   ->  Seq Scan on t  (cost=0.00..0.00 rows=1 width=4)         Filter: (id = 10)   ->  Index Only Scan using t2_pkey on t2  (cost=0.15..8.17 rows=1 width=4)         Index Cond: (id = 10)   ->  Index Only Scan using t4_pkey on t4  (cost=0.15..8.17 rows=1 width=4)         Index Cond: (id = 10)(7 rows)--如果constraint_exclusion設定為off，pg不得不進行全表掃描postgres=# set constraint_exclusion=off;SETpostgres=#  explain select * from t where id=10;                                 QUERY PLAN                                  ----------------------------------------------------------------------------- Append  (cost=0.00..32.69 rows=5 width=4)   ->  Seq Scan on t  (cost=0.00..0.00 rows=1 width=4)         Filter: (id = 10)   ->  Index Only Scan using t1_pkey on t1  (cost=0.15..8.17 rows=1 width=4)         Index Cond: (id = 10)   ->  Index Only Scan using t2_pkey on t2  (cost=0.15..8.17 rows=1 width=4)         Index Cond: (id = 10)   ->  Index Only Scan using t3_pkey on t3  (cost=0.15..8.17 rows=1 width=4)         Index Cond: (id = 10)   ->  Index Only Scan using t4_pkey on t4  (cost=0.15..8.17 rows=1 width=4)         Index Cond: (id = 10)(11 rows)--分區表上一般針對分區建立相對應的分區索引--建在父表的索引為全域索引，但如果你表沒有資料要查詢子表時，則分區表要進行全表掃描--父表建立的全域索引postgres=# \d+ p                                       Table "public.p"  Column   |              Type              | Modifiers | Storage | Stats target | Description -----------+--------------------------------+-----------+---------+--------------+------------- city_id   | integer                        | not null  | plain   |              |  logtime   | timestamp(0) without time zone | not null  | plain   |              |  peaktemp  | integer                        |           | plain   |              |  unitsales | integer                        |           | plain   |              | Indexes:    "idx_city_id" btree (city_id)    "idx_p_logtime" btree (logtime)Triggers:    delete_p_trigger BEFORE DELETE ON p FOR EACH ROW EXECUTE PROCEDURE p_delete_trigger()    insert_p_trigger BEFORE INSERT ON p FOR EACH ROW EXECUTE PROCEDURE p_insert_trigger()Child tables: p_201201,              p_201202,              p_201203,              p_201204,              p_201205,              p_201206,              p_201207,              p_201208,              p_201209,              p_201210,              p_201211,              p_201212,              p_defaultHas OIDs: no--分區沒有索引，不能使用父表索引postgres=# explain select * from p_201202 where city_id=2 and logtime=timestamp ‘2012-02-02 12:59:59‘;                                          QUERY PLAN                                          ---------------------------------------------------------------------------------------------- Seq Scan on p_201202  (cost=0.00..214.01 rows=2 width=20)   Filter: ((city_id = 2) AND (logtime = ‘2012-02-02 12:59:59‘::timestamp without time zone))(2 rows)--建立分區索引，可以使用分區索引postgres=# CREATE INDEX idx_p_201202_city_id ON p_201202 (city_id);CREATE INDEXpostgres=# explain select * from p_201202 where city_id=2 and logtime=timestamp ‘2012-02-02 12:59:59‘;                                      QUERY PLAN                                      -------------------------------------------------------------------------------------- Index Scan using idx_p_201202_city_id on p_201202  (cost=0.29..8.33 rows=2 width=20)   Index Cond: (city_id = 2)   Filter: (logtime = ‘2012-02-02 12:59:59‘::timestamp without time zone)--也可以指定只查詢父表的資料postgres=# select * from only p; city_id | logtime | peaktemp | unitsales ---------+---------+----------+-----------(0 rows)--如果一個分區表，父子表之間不再有繼承關係，則查詢父表時不再過濾到子表postgres=# alter table t3 no inherit t; ALTER TABLEpostgres=# explain select count(*) from t;                            QUERY PLAN                            ------------------------------------------------------------------ Aggregate  (cost=73.50..73.51 rows=1 width=0)   ->  Append  (cost=0.00..62.80 rows=4281 width=0)         ->  Seq Scan on t  (cost=0.00..0.00 rows=1 width=0)         ->  Seq Scan on t1  (cost=0.00..31.40 rows=2140 width=0)         ->  Seq Scan on t2  (cost=0.00..31.40 rows=2140 width=0)(5 rows)--再次添加繼承，查詢父表可以過濾到子表postgres=# alter table t3 inherit t;      ALTER TABLEpostgres=# explain select count(*) from t;                            QUERY PLAN                            ------------------------------------------------------------------ Aggregate  (cost=110.25..110.26 rows=1 width=0)   ->  Append  (cost=0.00..94.20 rows=6421 width=0)         ->  Seq Scan on t  (cost=0.00..0.00 rows=1 width=0)         ->  Seq Scan on t1  (cost=0.00..31.40 rows=2140 width=0)         ->  Seq Scan on t2  (cost=0.00..31.40 rows=2140 width=0)         ->  Seq Scan on t3  (cost=0.00..31.40 rows=2140 width=0)(6 rows)--以下為p表測試資料代碼CREATE TABLE p (    city_id         int not null,    logtime         timestamp(0) not null,    peaktemp        int,    unitsales       int);CREATE INDEX idx_p_logtime ON p (logtime);CREATE TABLE p_201201 (LIKE p INCLUDING all) INHERITS (p);CREATE TABLE p_201202 (LIKE p INCLUDING all) INHERITS (p);CREATE TABLE p_201203 (LIKE p INCLUDING all) INHERITS (p);CREATE TABLE p_201204 (LIKE p INCLUDING all) INHERITS (p);CREATE TABLE p_201205 (LIKE p INCLUDING all) INHERITS (p);CREATE TABLE p_201206 (LIKE p INCLUDING all) INHERITS (p);CREATE TABLE p_201207 (LIKE p INCLUDING all) INHERITS (p);CREATE TABLE p_201208 (LIKE p INCLUDING all) INHERITS (p);CREATE TABLE p_201209 (LIKE p INCLUDING all) INHERITS (p);CREATE TABLE p_201210 (LIKE p INCLUDING all) INHERITS (p);CREATE TABLE p_201211 (LIKE p INCLUDING all) INHERITS (p);CREATE TABLE p_201212 (LIKE p INCLUDING all) INHERITS (p);CREATE TABLE p_default (LIKE p INCLUDING all) INHERITS (p);CREATE OR REPLACE FUNCTION p_insert_trigger()RETURNS TRIGGER AS $$BEGIN    IF    ( NEW.logtime >= DATE ‘2012-01-01‘ AND NEW.logtime < DATE ‘2012-02-01‘ ) THEN        INSERT INTO p_201201 VALUES (NEW.*);    ELSIF ( NEW.logtime >= DATE ‘2012-02-01‘ AND NEW.logtime < DATE ‘2012-03-01‘ ) THEN        INSERT INTO p_201202 VALUES (NEW.*);    ELSIF ( NEW.logtime >= DATE ‘2012-03-01‘ AND NEW.logtime < DATE ‘2012-04-01‘ ) THEN        INSERT INTO p_201203 VALUES (NEW.*);    ELSIF ( NEW.logtime >= DATE ‘2012-04-01‘ AND NEW.logtime < DATE ‘2012-05-01‘ ) THEN        INSERT INTO p_201204 VALUES (NEW.*);    ELSIF ( NEW.logtime >= DATE ‘2012-05-01‘ AND NEW.logtime < DATE ‘2012-06-01‘ ) THEN        INSERT INTO p_201205 VALUES (NEW.*);    ELSIF ( NEW.logtime >= DATE ‘2012-06-01‘ AND NEW.logtime < DATE ‘2012-07-01‘ ) THEN        INSERT INTO p_201206 VALUES (NEW.*);    ELSIF ( NEW.logtime >= DATE ‘2012-07-01‘ AND NEW.logtime < DATE ‘2012-08-01‘ ) THEN        INSERT INTO p_201207 VALUES (NEW.*);    ELSIF ( NEW.logtime >= DATE ‘2012-08-01‘ AND NEW.logtime < DATE ‘2012-09-01‘ ) THEN        INSERT INTO p_201208 VALUES (NEW.*);    ELSIF ( NEW.logtime >= DATE ‘2012-09-01‘ AND NEW.logtime < DATE ‘2012-10-01‘ ) THEN        INSERT INTO p_201209 VALUES (NEW.*);    ELSIF ( NEW.logtime >= DATE ‘2012-10-01‘ AND NEW.logtime < DATE ‘2012-11-01‘ ) THEN        INSERT INTO p_201210 VALUES (NEW.*);    ELSIF ( NEW.logtime >= DATE ‘2012-11-01‘ AND NEW.logtime < DATE ‘2012-12-01‘ ) THEN        INSERT INTO p_201211 VALUES (NEW.*);    ELSIF ( NEW.logtime >= DATE ‘2012-12-01‘ AND NEW.logtime < DATE ‘2013-01-01‘ ) THEN        INSERT INTO p_201212 VALUES (NEW.*);    ELSIF ( NEW.logtime >= DATE ‘2013-01-01‘ OR NEW.logtime < DATE ‘2012-01-01‘ ) THEN        INSERT INTO p_default VALUES (NEW.*);    ELSE        RAISE EXCEPTION ‘Date out of range.  Fix the p_insert_trigger() function!‘;    END IF;    RETURN NULL;END;$$ LANGUAGE plpgsql;CREATE OR REPLACE FUNCTION p_delete_trigger()RETURNS TRIGGER AS $$BEGIN    IF    ( OLD.logtime >= DATE ‘2012-01-01‘ AND OLD.logtime < DATE ‘2012-02-01‘ ) THEN        DELETE FROM p_201201 WHERE logtime=OLD.logtime;    ELSIF ( OLD.logtime >= DATE ‘2012-02-01‘ AND OLD.logtime < DATE ‘2012-03-01‘ ) THEN        DELETE FROM p_201202 WHERE logtime=OLD.logtime;    ELSIF ( OLD.logtime >= DATE ‘2012-03-01‘ AND OLD.logtime < DATE ‘2012-04-01‘ ) THEN        DELETE FROM p_201203 WHERE logtime=OLD.logtime;    ELSIF ( OLD.logtime >= DATE ‘2012-04-01‘ AND OLD.logtime < DATE ‘2012-05-01‘ ) THEN        DELETE FROM p_201204 WHERE logtime=OLD.logtime;    ELSIF ( OLD.logtime >= DATE ‘2012-05-01‘ AND OLD.logtime < DATE ‘2012-06-01‘ ) THEN        DELETE FROM p_201205 WHERE logtime=OLD.logtime;    ELSIF ( OLD.logtime >= DATE ‘2012-06-01‘ AND OLD.logtime < DATE ‘2012-07-01‘ ) THEN        DELETE FROM p_201206 WHERE logtime=OLD.logtime;    ELSIF ( OLD.logtime >= DATE ‘2012-07-01‘ AND OLD.logtime < DATE ‘2012-08-01‘ ) THEN        DELETE FROM p_201207 WHERE logtime=OLD.logtime;    ELSIF ( OLD.logtime >= DATE ‘2012-08-01‘ AND OLD.logtime < DATE ‘2012-09-01‘ ) THEN        DELETE FROM p_201208 WHERE logtime=OLD.logtime;    ELSIF ( OLD.logtime >= DATE ‘2012-09-01‘ AND OLD.logtime < DATE ‘2012-10-01‘ ) THEN        DELETE FROM p_201209 WHERE logtime=OLD.logtime;    ELSIF ( OLD.logtime >= DATE ‘2012-10-01‘ AND OLD.logtime < DATE ‘2012-11-01‘ ) THEN        DELETE FROM p_201210 WHERE logtime=OLD.logtime;    ELSIF ( OLD.logtime >= DATE ‘2012-11-01‘ AND OLD.logtime < DATE ‘2012-12-01‘ ) THEN        DELETE FROM p_201211 WHERE logtime=OLD.logtime;    ELSIF ( OLD.logtime >= DATE ‘2012-12-01‘ AND OLD.logtime < DATE ‘2013-01-01‘ ) THEN        DELETE FROM p_201212 WHERE logtime=OLD.logtime;    ELSIF ( OLD.logtime >= DATE ‘2013-01-01‘ OR OLD.logtime < DATE ‘2012-01-01‘ ) THEN        DELETE FROM p_default WHERE logtime=OLD.logtime;    ELSE        RAISE EXCEPTION ‘Date out of range.  Fix the p_insert_trigger() function!‘;    END IF;    RETURN NULL;END;$$ LANGUAGE plpgsql;CREATE TRIGGER insert_p_trigger    BEFORE INSERT ON p    FOR EACH ROW EXECUTE PROCEDURE p_insert_trigger();CREATE TRIGGER delete_p_trigger    BEFORE DELETE ON p    FOR EACH ROW EXECUTE PROCEDURE p_delete_trigger();INSERT INTO p (city_id, logtime, peaktemp, unitsales) VALUES (1, timestamp ‘2012-01-02 12:59:59‘, 20, 10);INSERT INTO p (city_id, logtime, peaktemp, unitsales) VALUES (2, timestamp ‘2012-02-02 12:59:59‘, 20, 10);INSERT INTO p (city_id, logtime, peaktemp, unitsales) VALUES (3, timestamp ‘2012-03-02 12:59:59‘, 20, 10);INSERT INTO p (city_id, logtime, peaktemp, unitsales) VALUES (4, timestamp ‘2012-04-02 12:59:59‘, 20, 10);INSERT INTO p (city_id, logtime, peaktemp, unitsales) VALUES (5, timestamp ‘2012-05-02 12:59:59‘, 20, 10);INSERT INTO p (city_id, logtime, peaktemp, unitsales) VALUES (6, timestamp ‘2012-06-02 12:59:59‘, 20, 10);INSERT INTO p (city_id, logtime, peaktemp, unitsales) VALUES (7, timestamp ‘2012-07-02 12:59:59‘, 20, 10);INSERT INTO p (city_id, logtime, peaktemp, unitsales) VALUES (8, timestamp ‘2012-08-02 12:59:59‘, 20, 10);INSERT INTO p (city_id, logtime, peaktemp, unitsales) VALUES (9, timestamp ‘2012-09-02 12:59:59‘, 20, 10);INSERT INTO p (city_id, logtime, peaktemp, unitsales) VALUES (10, timestamp ‘2012-10-02 12:59:59‘, 20, 10);INSERT INTO p (city_id, logtime, peaktemp, unitsales) VALUES (11, timestamp ‘2012-11-02 12:59:59‘, 20, 10);INSERT INTO p (city_id, logtime, peaktemp, unitsales) VALUES (12, timestamp ‘2012-12-02 12:59:59‘, 20, 10);INSERT INTO p (city_id, logtime, peaktemp, unitsales) VALUES (13, timestamp ‘2013-01-02 12:59:59‘, 20, 10);INSERT INTO p (city_id, logtime, peaktemp, unitsales) VALUES (14, timestamp ‘2011-12-02 12:59:59‘, 20, 10);INSERT INTO p (city_id, logtime, peaktemp, unitsales) select m, timestamp ‘2012-02-02 12:59:59‘, 20, 10 from generate_series(1,10000) m;explain select * from p_201202 where city_id=2 and logtime=timestamp ‘2012-02-02 12:59:59‘;

轉載：https://yq.aliyun.com/articles/2637?spm=5176.100240.searchblog.12.59Jibq#

轉載：postgresql分區與最佳化

本文章原先以中文撰寫並發佈於 aliyun.com，亦設英文版本，僅作資訊用途。本網站不對文章的準確性，完整性或可靠性或其任何翻譯作出任何明示或暗示的陳述或保證。如對該文章有任何疑慮或投訴，請傳送電郵至 info-contact@alibabacloud.com 並提供相關疑慮或投訴的詳細說明。職員會於 5 個工作天內與您聯絡，一經驗證之後，即會刪除該侵權內容。

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

Get Started for Free

Sales Support

1 on 1 presale consultation

Chat Contact Sales
After-Sales Support

24/7 Technical Support 6 Free Tickets per Quarter Faster Response

Open a Ticket
Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.

Learn More