How can I aggregate and collapse rows in a database table using SQL?

Question

I have a table with example data as shown below.

word       | last_seen  | first_seen | count
-----------|------------|------------|------
definition | 2014-09-08 | 2012-01-02 | 15
definition | 2014-10-11 | 2013-05-12 | 35
attribute  | 2013-07-23 | 2010-06-29 | 22

I'm wanting to to an in-place aggregation of the data, hopefully just using SQL, where the data for repeated words is such that I end up with MAX(last_seen), MIN(first_seen), and SUM(count).

word       | last_seen  | first_seen | count
-----------|------------|------------|------
definition | 2014-10-11 | 2012-01-02 | 50
attribute  | 2013-07-23 | 2010-06-29 | 22

I know I can see the results of the aggregation with the following:

SELECT 
  word, 
  MAX(last_seen) AS last_seen, 
  MIN(first_seen) AS first_seen, 
  SUM(count) AS count 
FROM 
  words 
GROUP BY word;

However, I don't just want to see the resulting aggregation... I want to actually update the words table, replacing the rows that have duplicate word column entries with the aggregated data.

JNevill · Accepted Answer

As far as I'm aware there is no "Edit in place" in Postgresql (or any other traditional RDBMS that I can think of). Instead:

Take the results of your query and dump them into a temp table: CREATE TEMP TABLE AS WITH DATA
Delete out everything in your word table: TRUNCATE word; <--This is the scary part so make sure you are cool with your query before truncating.
Insert the records in your temp table into the now empty word table: INSERT INTO word SELECT * FROM ;
Optionally: Drop your temp table DROP TABLE ; (being a temp table it will drop automagically when you end your session, but I'm a fan of being explicit)

How can I aggregate and collapse rows in a database table using SQL?

Answers (2)

Related Questions