AIME Autonomous Infrastructure Management Engine
AI Orchestration and Management Functionality
The Autonomous Infrastructure Management Engine (AIME) is the artificial intelligence orchestration and management functionality that powers Scale Computing HyperCore. It drastically reduces the effort required to deploy, secure, manage, and maintain on-premises infrastructure.
AIME is a hand-built model of the environment the cluster is running in, built in such a way that thinks about the state that the system is in. Think of it like a digital twin where it's modeling the reality that the cluster is sitting in, including what's happening on the hardware and the cluster but the surrounding environment.
AIME builds a model of the state of the system that allows SC//HyperCore to handle day-to-day operational administrative tasks and maintenance automatically, monitors the system for security, hardware, and software errors, and remediates those errors where possible. It identifies the root cause and minimizes the impact of those issues when it cannot repair them automatically, notifying users with specific problem determination and action, versus just sending a stream of data that must be interpreted. This includes actions to secure the environment. It also maintains current firmware, driver, and OS versions for security and stability purposes.
Benefits of Automated Infrastructure Management
Minimize IT Support
AIME's comprehensive monitoring, proactive problem detection, and automated remediation swiftly address issues and maintain system health without constant manual intervention.
Eliminate Confusing Logs
Precise problem determination and actionable insights streamline troubleshooting, eliminating the need for deciphering confusing logs and providing clear guidance on addressing issues.
Automate Remediation
By leveraging its understanding of the system's state and conditions, AIME automates remediation tasks and swiftly addresses issues to minimize downtime.
AIOps is a Requirement for Modern Infrastructure
Platform//2024 On-Demand
In this on-demand session recording from Platform//2024, Mike Lyon, Sr. Solutions Architect, introduces AIME and explains how AIOps has become a requirement of modern infrastructure.
AIME Overview
State machines
Enhanced operational efficiency, proactive management, and overall infrastructure stability and performance
The core intelligence in the system is a hierarchical finite state machine. AIME’s state machine constantly monitors and models the reality that the cluster is currently operating in, and then using that model, AIME can trigger appropriate actions. AIME’s model of reality includes the hardware and software the cluster is running, the physical environment (temperature, power, cabling, etc.), and the logical environment (networking and external services).
Given its extensive understanding of the environment, AIME can take many actions to maximize performance, security, and uptime for all your workloads.
The state machine model itself encodes the knowledge of over a decade of experience running tens of thousands of clusters in the field and improving that model release after release. It can model an astronomical number of possible states and take appropriate actions from any of those states.
Conditions
Simplifying problem identification for the state machine
Below the sophisticated intelligence of the state machines, there is a simpler, but, essential, compartmentalized layer of intelligence—conditions.
Conditions are boolean flags that indicate problem areas.
Conditions point directly to problems, making it simple for the state machine to understand the problem and enact fixes or an administrator when the solution requires human intervention. A concise list of problems leads to understanding the root cause quickly, and conditions often indicate what actions are required to return the cluster to a healthy state.
Checked values
Ensuring the data can be trusted
Underpinning AIME is the compartmentalized data layer. As the old adage goes, “Garbage in, garbage out.” If we are going to trust AIME to take action to repair problems on the cluster, we have to know that it’s going to make good decisions. This can only happen if it can trust the data it uses to make decisions. To ensure the data can be trusted, we use carefully controlled variables we call checked values.
Conditions are boolean flags that indicate problem areas.
Checked values are self-monitoring and recognize when they contain bad or old data. Reading bad data is disallowed, and the conditions and state machines that rely on that data know the data can’t be used. This prevents taking actions on the cluster that could be harmful due to bad or stale data.
Artificial Intelligence Orchestration and Management Functionality
AIME is a sophisticated AI engine that empowers organizations with automated efficiency, proactive security measures, and streamlined maintenance protocols.
Learn more about Scale Computing HyperCore's Features
Hypervisor
SC//HyperCore's software layer includes a lightweight, type-1 hypervisor that directly integrates into the OS kernel, leveraging virtualization offload capabilities provided by modern CPUs.
SCRIBE
Scale Computing Reliable Independent Block Engine is an enterprise-class, clustered, block storage layer and critical component of Scale Computing HyperCore.
HEAT
The addition of flash storage in SC//HyperCore nodes allows more capabilities in the SCRIBE storage architecture with the HyperCore Enhanced Automated Tiering (HEAT) feature.
SC//HyperCore
Move beyond traditional IT silos with efficient, lightweight, all-in-one architecture that deploys fully integrated and highly available virtualization.
See What Others Have To Say About Scale Computing
We are proud to have over more than a thousand positive customer reviews on the most trusted third-party industry reviews websites. Read what our customers have to say on Gartner Peer Insights, G2, and TrustRadius.