Periodic RDS Postgresql Replication Delays

Question

I have been observing that my PostgreSQL read replica shows periodic delay for replication lags. The lag seems to build to up to 30-40 minutes and then automatically goes down to 0. There is a correlation with CPU Utilization but it's nowhere close to CPU limit.

Read traffic comes from a reporting software called DOMO. DOMO periodically copies a large chunk of data & full tables into its warehouse.

Here's AWS Cloudwatch graph. The red line shows Replication Lag in seconds. The blue line shows the CPU load.

Lag vs CPU

Lag vs Network Out

Lag vs Read IOPS

Lag vs Write IOPS

Cloud: Amazon RDS

Instance Size: db.m3.2xlarge

PostgresSQL version: 9.3

Postgres Settings:

Shared Buffers (Set by RDS) = 7.3 GB (956978 * 8KB)

Updates

Tried setting Shared Buffers to 1GB (didn't help)

Updates June, 5 2017

I created a branch new replica for my database and pointed the reporting software (DOMO) at it. Things in the new instance look stable for now. The old replica which has no read traffic now is stable as well. Beginning to suspect some type of AWS config issue or something to do what remaining artifacts in the database (vacuum?).

Periodic RDS Postgresql Replication Delays

Answers (1)

Related Questions