Skip to main content

Alerts

Overview: Infrastructure Monitoring Alerts

Alerts are like important signals that show possible problems in a computer system. They need the attention of experts like DevOps, System Administrators, and SRE teams. Alerts are meant to make people aware and watch out for potential issues in the computer system. They act like early warning signs, helping teams notice and fix problems before they become serious.

What are Alerts?

In our system, alerts are like instant notifications that pop up when something isn't quite right in your computer setup. They're also called alarms. These alerts are super important because they help us spot and fix issues in how the system works before users notice anything wrong.

Navigating the Alert Page

To access the Alerts page, follow these steps:

NAVIHATING.png
  1. Navigate to AIOps Application.

  2. Click on Alerts.

Interface Layout and Components

DASHBOARD1.png
  1. Filter the alerts by sub-status, severity, priority and Manager.

  2. The alert status displays sufficient information for you to recognize an event's current state quickly.

  3. In AIOps alerts, the "search" tool lets users type in specific words to quickly find and filter important information in the alert system. You can focus on certain details, like alert descriptions, and see a group of alerts that match what you typed. The results will show all the records that match your words, especially in the Description column.

  4. Edit Columns: The "Edit Column" feature in AIOps alerts allows users to customize displayed columns, including entity name, type, ID, first seen, last seen, rule ID, and repeat count. This customization empowers users to choose, include, or exclude columns for a personalized and efficient monitoring experience. For example, users can prioritize specific information like entity names or rule IDs based on their preferences, extracting tailored insights from AIOps alert data.

  5. Show alerts for: Users have the option to select the time frame for which they want to view the alerts. Options include viewing alerts for the last 1, 2, 6, 24 hours, or 7 days and 30 days, enabling users to focus on alerts within a specific timeframe.

  6. Download: Users can download the alert data in PDF format by clicking on the 'Download' button. This allows them to save the alert information locally.

  7. Help: Click to open the online help.

  8. View the list of alerts and its details.

  9. Scroll bar: Use the scroll bar to navigate to the alert list.

Note

You can view the alert data on the dashboard according to the selected timeframe.

Edit Columns

Edit columns collectively offer a comprehensive overview of the alerts, their properties, and associated entities, facilitating efficient monitoring, analysis, and resolution within the AIOps platform.

ALERT_OVERSIGHT.png
  • Action: Describes the action taken or recommended based on the alert or insight.

  • Alert Description: Provides a brief description or summary of the alert.

  • Created By: Indicates the user or system responsible for creating the alert.

  • Entity ID: A unique identifier for the entity (e.g., device, application, service) associated with the alert.

  • Entity Name: The name or label of the entity associated with the alert.

  • Entity Type: Specifies the type of entity (e.g., server, network device, application) generating the alert.

  • First Seen: Timestamp indicating when the alert was first detected or observed.

  • Insights: Additional contextual information or analysis related to the alert or entity. You can click.

  • Last Seen: Timestamp indicating when the alert was last observed or updated.

  • Priority: Indicates the priority level or severity of the alert, often categorized as high, medium, or low.

  • Related Alerts: Displays any other alerts or incidents linked or correlated with the current alert.

  • Repeat Count: Number of times the alert has occurred or been triggered within a specific timeframe.

  • Severity: Level of severity associated with the alert, typically categorized as critical, major, minor, or informational.

  • Source: Identifies the source or origin of the alert

  • Status: The current status of the alert

  • Sub-Status: Additional status details or sub-categories to further categorize the alert state.

  • Updated By: Indicates the user or system responsible for the most recent update or action taken on the alert.