I have large number of tables that are "INSERT once", and then read-only after that. ie: After the initial INSERT of a record, there are never any UPDATE or DELETE statements. Due to this, the data fragmentation on disk for the tables is minimal. I'm now considering adding an needs_action boolean field to every table. This field will only ever get updated once, and it will be done on a slow/periodic basis. As a result of MVCC, when VACUUM comes along (with an even slower schedule) after the UPDATE , the table becomes quite fragmented as it clears out the originally inserted tuples, and they get subsequently backfilled by new insertions. In short: The addition of this single "always updated once" field changes the table from being being minimally fragmented by design, to being highly fragmented by design. Is there some method of effectively achieving this single needs_action record flagging that can avoid the resulting table fragmentation? . . . . < And now for some background/supplemental info... > Some options considered so far... At the risk of making this question massive (and therefore overlooked?), below are some options that have been considered so far: Just add the column to each table, do the UPDATE and don't worry about resulting fragmentation until it actually proves to be an issue. I'm conscious of premature optimization here, but with some of the tables getting large (>1M, and even > 1B) I'd rather get the design right up front. Make an independent tracking table (for every table) only containing A) the PK from the master table, and B) the needs_action flag. Create a record in the tracking table with an AFTER INSERT trigger in the master table This will preserve "INSERT only" minimal fragmentation levels on the master table... at the expense of adding (significant?) up-front write overhead putting the tracking tables in a separate schema would also neatly separate the functionality from the core tables Forcing the needs_action field to be a HOT update to avoid tuple replication Needing an index on WHERE needs_action = TRUE seems to rule out this option, but maybe there's another way to find them quickly? Using table fillfactor (at 50?) to leave space for the inevitable UPDATE eg: set fillfactor to 50 to leave room for the UPDATE , and therefore keep it in the same page But... with only one UPDATE it seems like this will leave the table packing fraction eternally at 50% and take twice the storage space? I don't 100% understand this option yet... still learning. Find a special/magical field/bit in the master table records that can be twiddled without MVCC impact. This does not seem to exist in postgres. Even if it did, it would need to be indexed (or have some other quick finding mechanism akin to a WHERE needs_action = TRUE partial index) Being able to optionally suppress MVCC operation on specific columns seems like it would be nice here (although surely fraught with peril) Storing needs_action outside of postgres (eg: as a <table_name>:needs_copying list of PKs in redis) to evade fragmentation due to mvcc. I have concerns about keeping this atomic, though. Maybe using redis_fdw (or some other fdw?) in an AFTER INSERT trigger can keep it atomic? I need to learn more about fdw capabilities... seems like all fdw's I can find are all read-only, though. Run a fancy view with background defrag/compacting, as described in this fantastic article Seems like a bit much to do for all tables. Simply track ids/PKs that need copying in a postgres table just stash ids that need action as records to a fast inert table (eg: no PK), and DELETE the records when action is completed similar to RPUSH ing to an offline redis list (but definitely ACID) This seems like the best option at the moment. Are there other options to consider? More on the specific use case driving this... I'm interested in the general case how to avoid this fragmentation, but here's a bit more on the current use case: Read performance is much more important than write performance on all tables (but avoiding crazy slow writes is clearly desirable) Some tables will reach millions of rows. A few may hit billions of rows. SELECT queries will be across wide table ranges (not just recent data) and can range from single result records to 100k+ Table design can be done from scratch... no need to worry about existing data PostgreSQL 9.6

Reputation: 11335

How to avoid data fragmentation in an "INSERT once, UPDATE once" table?

I have large number of tables that are "INSERT once", and then read-only after that. ie: After the initial INSERT of a record, there are never any UPDATE or DELETE statements. Due to this, the data fragmentation on disk for the tables is minimal.

I'm now considering adding an needs_action boolean field to every table. This field will only ever get updated once, and it will be done on a slow/periodic basis. As a result of MVCC, when VACUUM comes along (with an even slower schedule) after the UPDATE, the table becomes quite fragmented as it clears out the originally inserted tuples, and they get subsequently backfilled by new insertions.

In short: The addition of this single "always updated once" field changes the table from being being minimally fragmented by design, to being highly fragmented by design.

Is there some method of effectively achieving this single needs_action record flagging that can avoid the resulting table fragmentation?

<And now for some background/supplemental info...>

Some options considered so far...

At the risk of making this question massive (and therefore overlooked?), below are some options that have been considered so far:

Just add the column to each table, do the UPDATE and don't worry about resulting fragmentation until it actually proves to be an issue.
- I'm conscious of premature optimization here, but with some of the tables getting large (>1M, and even > 1B) I'd rather get the design right up front.
Make an independent tracking table (for every table) only containing A) the PK from the master table, and B) the needs_action flag. Create a record in the tracking table with an AFTER INSERT trigger in the master table
- This will preserve "INSERT only" minimal fragmentation levels on the master table... at the expense of adding (significant?) up-front write overhead
- putting the tracking tables in a separate schema would also neatly separate the functionality from the core tables
Forcing the needs_action field to be a HOT update to avoid tuple replication
- Needing an index on WHERE needs_action = TRUE seems to rule out this option, but maybe there's another way to find them quickly?
Using table fillfactor (at 50?) to leave space for the inevitable UPDATE
- eg: set fillfactor to 50 to leave room for the UPDATE, and therefore keep it in the same page
- But... with only one UPDATE it seems like this will leave the table packing fraction eternally at 50% and take twice the storage space? I don't 100% understand this option yet... still learning.
Find a special/magical field/bit in the master table records that can be twiddled without MVCC impact.
- This does not seem to exist in postgres. Even if it did, it would need to be indexed (or have some other quick finding mechanism akin to a WHERE needs_action = TRUE partial index)
- Being able to optionally suppress MVCC operation on specific columns seems like it would be nice here (although surely fraught with peril)
Storing needs_action outside of postgres (eg: as a <table_name>:needs_copying list of PKs in redis) to evade fragmentation due to mvcc.
- I have concerns about keeping this atomic, though. Maybe using redis_fdw (or some other fdw?) in an AFTER INSERT trigger can keep it atomic? I need to learn more about fdw capabilities... seems like all fdw's I can find are all read-only, though.
Run a fancy view with background defrag/compacting, as described in this fantastic article
- Seems like a bit much to do for all tables.
Simply track ids/PKs that need copying in a postgres table
- just stash ids that need action as records to a fast inert table (eg: no PK), and DELETE the records when action is completed
- similar to RPUSHing to an offline redis list (but definitely ACID)
- This seems like the best option at the moment.

Are there other options to consider?

More on the specific use case driving this...

I'm interested in the general case how to avoid this fragmentation, but here's a bit more on the current use case:

Read performance is much more important than write performance on all tables (but avoiding crazy slow writes is clearly desirable)
Some tables will reach millions of rows. A few may hit billions of rows.
SELECT queries will be across wide table ranges (not just recent data) and can range from single result records to 100k+
Table design can be done from scratch... no need to worry about existing data
PostgreSQL 9.6

Upvotes: 3

How to avoid data fragmentation in an "INSERT once, UPDATE once" table?

Answers (2)

Related Questions

How to avoid data fragmentation in an &quot;INSERT once, UPDATE once&quot; table?

Answers (2)

Related Questions

How to avoid data fragmentation in an "INSERT once, UPDATE once" table?