Notifications Server has slowed down and is unable to keep up with the generation of notifications for the alerts raised. The acknowledgements on the sending of notifications through the channels have slowed down. This may happen due to
1. Alert storms - too many alerts have been generated than can be handled by the Notification server in reasonable time period.
2. Operations DB is overloaded and has caused overall system slow down.
The Notification acknowledgements can slow down usually because the Notifications Resource Pool, and its member Management server that has the Notification subsystem is overloaded. It is possible that the Operations database is under heavy load or the resources are being exhausted.
If these alerts are generated during or shortly after installation of the Data Warehouse or reporting components likely cause is due to the opening the flow of alerts and state transitions to the DW. However, the corresponding data structures are not there yet since Data Warehouse did not deploy its components as of yet. Deployment happens within ~30 minutes of installation.
While deployment is not yet done, data will be retained by alert and state sync workflows to be later written to the DW. Thus, these workflows do not put data back to data sources and that in turn generates the alerts in question. Once deployment is complete, data is written and alerts clear up automatically.
If alerts do not auto-resolve within 120 minutes of Data Warehouse installation then another problem exists.
|