Alerts
pgDash Alerts allows you to quickly set smart alerts for your database. Here is an overview the alerts that are available:
Server Level
At the server-level (or more correctly at the database cluster level), you can set alerts for:
Transaction ID range getting closer to 2 billion
Time since last checkpoint
Replication: number of inactive replication slots
Replication at primaries: write/flush/replay lag in bytes
Replication at standbys: replay lag in units of bytes
Replication at standbys: replay lag in units of time
WAL files: number of wal files in pg_wal (earlier pg_xlog) directory
WAL archiving: number of wal files ready for archiving
Number of backends waiting for locks
Number of backends idling in transaction
Number of backends with transactions open for more than a certain time
Here is the UI to set server-level alerts. You can add the new rule either as a warning or a critical severity alert
Database Level
At database-level, you can set these alerts:
Number of backends, as an absolute number
Number of backends, as a percentage of the maximum connections allowed to the database
Commit ratio, as a percentage
Transaction ID age (value of age(datfrozenxid)) as as a percentage of autovacuum_freeze_max_age setting (usually 2 billion)
Database size
Count of disabled triggers
Cache hit ratio, as a percentage
You can choose which databases to apply the rule to. You can either apply the to rule to all databases, or restrict to databases whose name contains/does not contain/starts with/ends with/is exactly a certain texts
Table Level
Table-level alerts include size and bloat. The full list is:
The time since the last auto/manual vacuum/analyze happened
The size of the table in units of bytes
The bloat, in units of bytes
The bloat, as a percentage of the table size
The tables the rule applies to can be chosen like with database-level rules. Additionally, you can also restrict the rule to all the tables within a certain database
Tablespace Level
Disk space rules are set at the tablespace level.
The size of the tablespace, in units of bytes.
The free disk space, as a percentage of the total disk space.
The free inode count, as a percentage of the total available inode count.
Alerting Status
Alerting rules are evaluated whenever you send in a pgmetrics report. These are displayed in the UI, and can also be sent out as email notifications. Here is the full UI, with a few triggered alerts:
Notifications
Alerts can be notified via email, to one or more email addresses. They can also be sent out to Slack channels. PagerDuty events can also be triggered. You can also create alerts in VictorOps. Here is a what an email looks like:
Last updated