Best performance approach to history mechanism?

Question

enter image description here

We are going to create History Mechanism for our changes in DB (DART in pic) via Triggers.

we have 600 tables.

Each record that will be changed - the trigger will insert the deleted one into XXX.

regarding to the XXX :

option 1 : clone each table in "Dart" DB and each table now will have a "sister table"

e.g. : Table1 will have Table1_History

problems :

we will have 1200 tables
programmer can do mistakes by working on wrong tables...

option 2 : make a new DB (DART_2005 in pic) and the history tables will be there

option 3 : use linked server which stores the Db which will contain the history tables.

question :

1) which option gives the best performance ( I guess 3 is not - but is it 1 or 2 or same ?)

2) Does option 2 is acting like "linked server" ( in queries we will need to select from both DB's...)

3) What is the best practice approach ?

Filip De Vos · Accepted Answer

All three approaches are viable and have similar performance based on your network speed, but each one will cause you a lot of headaches on a system with many concurrent users.

Since your will be inserting/updating multiple tables in one transaction with a very different access pattern (source table is random, history table is sequential) you will end up with blocking and or deadlocks.

If the existing table schema can not be changed

If you want to have a history system in place driven by your database ideally you will queue your history updates to prevent blocking problems.

Fire a trigger on update of your table
The trigger will submit a message containing the information from the inserted/deleted tables to a SQL Server Service Broker Queue
An activation stored procedure can pull the information from the queue and write it to the appropriate history table
On failure, a new message is sent to an "error queue" where a retry mechanism can re-submit to the original queue (make sure to include a retry counter in the message)

This way your history updates will be non-blocking and can not get lost.

Note: when working with SQL Server Service broker, make sure you completely understand the "Poison message" concept.

If the existing table schema can be changed

When this is an option, I recommend working with a "Record versioning" system where every update will create a new record & your application will correctly query the most recent version of the data. To ensure proper performance the table can be partitioned to the keep the most recent version of the data in a partition and the older versions in an archive partition. (I usually have a field end_date or expiration_date which is set to 9999/12/31 for the currently valid record.)

This approach of course requires considerable code changes in your data model and the existing application which might be not very cost effective.

Best performance approach to history mechanism?

Answers (2)

If the existing table schema can not be changed

If the existing table schema can be changed

Related Questions