Skip to main content

Alert Dashboard

Alert Dashboard

Welcome to the Alert Dashboard, a powerful tool designed to provide high-level insights into all alerts occurring across your infrastructure in a single, intuitive interface. This dashboard leverages data from multiple sources to offer a comprehensive summary of alerts, enabling users to monitor trends, identify patterns, and make informed decisions to optimize system performance and reliability.

The Dashboard tab provides real-time statistics on infrastructure health and alert details in a visually simple format. Within the Dashboard tab, you can do:

  • Data Source Insights: Gain insights into the sources of alerts with a dedicated tab, allowing users to identify the origin of issues and take appropriate actions.

  • Infrastructure Summaries: Access detailed summaries of critical infrastructure elements, such as IO Hosts, Containerized Clusters, and Applications, providing essential context for monitoring and management.

  • Interactive Elements: Navigate through various sections of the dashboard with interactive features, allowing users to drill down into specific areas for more detailed information and analysis.

  • Intuitive Interface: Designed with an intuitive interface, the dashboard offers a user-friendly experience for efficiently monitoring and managing infrastructure health and alert status.

1.png
  1. Alert Dashboard:  This dashboard provides an intuitive interface for tracking severity levels and data sources, as well as detailed summaries of infrastructure elements.

  2. Severity Selection Dropdown filter: Choose the severity level to filter the displayed data on the dashboard. This allows you to focus on specific issues based on their severity.

    1. Critical

    2. Major

    3. Warning

    4. Minor

    5. Info

    6. Unknown

  3. Source filter: Select the data source from the dropdown menu to view relevant information sourced from Infrastructure Monitoring, AM, or other platforms.

    • IO

    • CO

    • Others

  4. Show Alerts For: This feature allows users to specify the time period for viewing alerts. They can choose from predefined options such as the last 7, 15, or 30 days, or opt to view alerts from the current day. Also, you can choose custom dates and times.

    timeframe.png
  5. Download: Users can download the alert data in PDF format by clicking on the 'Download' button. This allows them to save the alert information locally.

  6. Infrastructure Summary Widget: The Infrastructure Summary tab offers an overview of essential infrastructure components, including:

    • IO Hosts:Displays data for critical and warning events on IO hosts. Click on each host to view detailed information and graphs.

    • Containerized Clusters:Provides an overview of all containerised clusters. Click on each cluster to explore detailed metrics and visualisations.

    • Applications: Shows data for all applications. Click on each application to delve into specific details and analytics.

  7. Infrastructure Health Overview: Here, you will find detailed information about the health and performance of your infrastructure.

    The dashboard aggregates data from multiple sources to offer comprehensive insights into your infrastructure status.

    With a significant number of unique alerts, users can stay informed about critical events and potential issues.

    Efficient noise reduction techniques ensure that users focus only on relevant information, enhancing productivity.

    High alert correlation rate aids in identifying patterns and relationships between different alerts, facilitating proactive action.

    Mean time to resolve (MTTR) provides an indication of issue resolution efficiency, contributing to operational effectiveness.

  8. Open Alerts Trend

    The Open Alerts Trend chart provides a graphical representation of the distribution of critical, major, and warning alerts over time.

    Graphical View: The chart displays bars representing the number of critical, major, and warning alerts recorded over a specified time period. Users can quickly identify trends and fluctuations in alert volumes.

    Hover Details: When users hover over a specific bar on the chart, detailed information about the alerts recorded during that time interval is displayed. This includes the number of critical, major, and warning alerts, as well as the specific date and time of the data point.

    The Open Alerts Trend chart provides users with valuable insights into the distribution of alerts across different severity levels and helps them identify periods of heightened alert activity. By analyzing this data, users can make informed decisions and take proactive measures to address critical issues as they arise. 

    By clicking on the "Open Alert List" link, users are redirected to the Alerts page, where they can access detailed information about the alerts. Refer Alerts for more details.

Dashboard Alert Details Section

In the Alert Details View of the dashboard, users can access various insights and metrics related to alerts. Here are the key components available in this view:

1.png
  • Applications by Alert Severity: This section provides a breakdown of alerts based on their severity levels within different applications. It allows users to quickly identify which applications are most affected by critical, major, warning, minor, info, or unknown alerts.  You can click on the bar that displays the alert severity with numbers.

  • Alerts by Source Type: Users can view alerts categorized by their source type. This section helps in understanding the distribution of alerts originating from different sources such as CO , IO, or others.

  • Alerts by Time to Resolve: This section displays the distribution of alerts based on the time taken to resolve them. Users can assess the efficiency of their resolution processes and identify any trends or outliers in the time-to-resolve metrics.

  • IO Instances with Alerts: Here, users can view a list of IM Instances that have generated alerts. It provides insights into which hosts are experiencing issues or generating alerts, allowing users to focus their attention on addressing these issues.

    Users can click on "Show more" to view details of IM Instances, including those with the most alerts. They can choose to display the top 10, 5, or 20 hosts within a timeframe of 7, 15, 30 days, or today's data. A graphical representation showcases major, warning, and critical alerts, visually distinguishing them with red, orange, and yellow, respectively.

    IO.png
  • Containerized clusters This section lists containerized clusters that have generated alerts. Users can monitor the health of containerized environments and take proactive measures to address any issues affecting these clusters.

    Users have the option to click on "Show more" to access details of containerized clusters, including those with the most alerts. They can select to display the top 10, 5, or 20 clusters within a timeframe of 7, 15, 30 days, or today's data. The graphical representation highlights major, warning, and critical alerts, using red, orange, and yellow, respectively, for visual clarity.

    cluster.png
  • Third-Party Hosts with Alerts: Users can see a list of third-party hosts that have triggered alerts. This helps in identifying any issues originating from third party monitoring solutions.

    Users have the option to click on "Show more" to access details of third-party hosts, including those with the most alerts. They can select to display the top 10, 5, or 20 hosts within a timeframe of 7, 15, 30 days, or today's data. The graphical representation highlights major, warning, and critical alerts, using red, orange, and yellow, respectively, for visual clarity.

    ipm_host_with_alerts.png

Quick Filter:

You can enter the name of any cluster, IO instances, or third-party host into the search box to view detailed information. The filter allows you to focus on critical, major, and warning alerts, presenting them in a chart format for easy analysis.

quick_filter.png