Optimized ways to update every record in a table after running some calculations on each row

Question

There is a large table that holds millions of records. phpMyAdmin reports 1.2G size for the table.

There is a calculation that needs to be done for every row. The calculation is not simple (cannot be put in set col= calc format), it uses a stored function to get the values, so currently we have for each row a single update.

This is extremely slow and we want to optimize it.

Stored function:
https://gist.github.com/a9c2f9275644409dd19d

And this is called by this method for every row:
https://gist.github.com/82adfd97b9e5797feea6

This is performed on a off live server, and usually it is updated once per week.

What options we have here.

Stephen Senkomago Musoke · Accepted Answer

Why not setup a separate table to hold the computed values to take the load off your current table. It can have two columns: primary key for each row in your main table and a column for the computed value.

Then your process can be:

a) Truncate computedValues table - This is faster than trying to identify new rows

b) Compute the values and insert into the computed values table

c) So when ever you need your computed values you join to the computedValues table using a primary key join which is fast, and in case you need more computations well you just add new columns.

d) You can also update the main table using the computed values if you have to

Optimized ways to update every record in a table after running some calculations on each row

Answers (2)

Related Questions