The Solaris 10 Operating System (OS) introduces a new architecture for building and deploying systems and services capable of Predictive Self Healing. This technology enables Oracle Sun systems to accurately predict component failures and mitigate many serious problems — before they actually occur. Solaris Fault Manager and Solaris Service Manager are the two main components of Predictive Self Healing. Solaris Fault Manager receives data relating to hardware and software errors and automatically diagnoses the underlying problem. Once diagnosed, Solaris Fault Manager automatically responds by offlining faulty components. Solaris Service Manager makes services, rather than processes, into first-class citizens, permitting automatic self-healing. Base Solaris services have service descriptions which include full dependency information for start, stop, and restart; applications can easily be converted to run under Solaris Service Manager.
Predictive Self Healing is designed to maximize the availability of the system and application services by automatically diagnosing, isolating, and recovering from faults. This helps to not only reduce hardware failures but also to reduce the impact of application failures, leading to increased system and application availability.
With Solaris Fault Manager, the system automatically diagnoses faulty components, a function that in some cases can reduce analysis time from days to seconds. Once diagnosed, the system can quickly take corrective action and automatically restore application services. This powerful technology ensures that business critical applications and essential system services can continue uninterrupted in the event of software failures, major hardware component failures, and even software misconfiguration problems.
Customers can now deliver higher levels of availability and application services while minimizing downtime and associated administrative costs. Reduced downtime can potentially save companies $10,000 to $6 million per hour for mission-critical environments.
Solaris Service Manager reduces complexity by abstracting problem diagnosis and services in a manner that is transparent to users and applications. It simplifies common administrative tasks, speeds system boot, and significantly reduces human errors associated with system failures that can lead to service downtime and inefficient management of the system. Administrative tasks such as enabling and disabling services and changing properties are simplified and secure, with an undo capability to revert changes. In addition, service information is stored in a central repository, making the systems easier to manage and maintain.
Self-healing technology can also help improve the productivity of support staff. They can now spend much less time investigating and resolving issues, resulting in a higher ratio of supported systems per individual. Plus, a self-healing system can lead to reduced administration costs — systems that perform many complex tasks without user intervention require staff with less expertise, experience, and salary.
Solaris Fault Manager issues easy-to-understand diagnosis messages that link to knowledge articles at sun.com/msg. By providing system administrators with unique event IDs, they can access detailed information in knowledge articles, which describe what failure occurred and what the system did to fix it. These knowledge articles guide system administrators through any tasks that require human intervention, including repairs, and explain predicted or detected problems using clear language and links to repair procedures and documentation — all of which greatly reduces the complexity of repairing the system.
The scalable architecture of Solaris's Predictive Self Healing technology can be rapidly evolved to new problems and updated as new diagnosis and availability technologies are added to the system. Most future updates can be dynamically loaded and unloaded from the system while it is running and can be upgraded on the fly without requiring downtime or losing previous diagnosis data.
With businesses operating around the clock and demanding uninterrupted service, service availability is of paramount importance. Predictive Self Healing delivers the next generation of availability technology today, including features that keep systems and services running and simple for administrators. Over time, a rapidly evolving ecosystem of self-healing components can help provide consistent, easy-to-use, and always-available Sun systems.