OpsRamp Expands AIOps and Cloud Native
detailed new topology maps, enhanced
artificial intelligence for IT operations (AIOps)
features and new monitoring capabilities for
cloud native workloads. The OpsRamp Winter
Release provides greater service-centricity
and context for hybrid infrastructure
monitoring and management allowing
enterprise IT teams to embrace more
intelligent incident management and deliver
exceptional customer experiences.
OpsRamp’s topology discovery now includes:
•Application Topology. OpsRamp discovers more than forty popular enterprise applications and establishes topological relationships between application components and infrastructure.
•Hypervisor Topology. OpsRamp discovers virtual machines, hypervisor servers and clusters in VMware vSphere and KVM environments and their relationships.
•Enhanced Service Maps. Representing
logical IT services, OpsRamp service maps
now have a new user interface that makes it
easy to identify underlying resources behind
an IT service outage so that operations
teams can hone in on the right course of
action to restore services.
New features include:
•Auto-Incident Creation and Routing. The biggest priority during a major outage is to assign and route incidents to the right on-call teams. OpsRamp’ OpsQ now enables automatic incident creation and routing using alert escalation policies to auto-assign incidents based on prior alert, incident, and notification data. Machine learning-driven alert escalation uses specific learned patterns (assignee groups, business impact, urgency, and priority) to route incident assignments for different types of alerts.
•Augmented Training for Inference Models. OpsRamp’s machine learning-based inference models correlate alerts linked by a common cause using historical alert data. Opsramp’s OpsQ now allows users to augment these models with additional user-provided training data. With such augmented training, IT operations teams can bootstrap OpsQ to recognize alert sequences that are uncommon in everyday operations, but important to identify when they occur.
•Frequency-Driven Alert Escalation.
OpsQ now supports policies to escalate
alerts based on how often an alert has
recently occurred. With frequency-based
alerting, operations teams can filter out
alerts that flap only occasionally and
escalate alerts that flap repeatedly.
•Cloud Native Monitoring. Enterprises are increasingly adopting cloud native technologies like Docker containers and Kubernetes container orchestration for faster time-to-market. OpsRamp now discovers and monitors Kubernetes environments across on-prem and cloud services like Azure Kubernetes Services, Google Kubernetes Engine, and Amazon Elastic Container Service for Kubernetes. DevOps teams can understand the total services (nodes and containers for each cluster, a breakdown of pods by namespace) and resource trends (CPU and memory utilization) for each Kubernetes cluster.
•Cloud Event Monitoring. Events
are a key medium of communication for
operational issues in the public cloud.
Given that events are a primary source of
signal in multi-cloud environments, OpsRamp
can now collect, aggregate, correlate and
escalate events from AWS services such as
AWS Health, ECS, Redshift, Data Migration
Services, and CloudWatch. With this
capability, OpsRamp serves as a single point
of monitoring, management, and remediation
for cloud events across multiple cloud