Information Technology Reference
In-Depth Information
disconnect all of the PCIe cards currently active in the source domain, using the
cfgadm command on each card. If the XSB that represents the CMU/IOU to be
moved has the attribute no-io=true , then it is not necessary to run the cfgadm
command. If the PCIe card is active and has not been disconnected from the do-
main, the command to remove the source CMU/IOU will fail.
2.3.3 Fault Isolation
Domains are protected against software or hardware failures in other domains.
Failures in hardware shared between domains cause failures only in those domains
that share the hardware. When a domain encounters a fatal error, a domainstop
operation occurs that cleanly and quickly shuts down only the domain with the
error. Domainstop operates by shutting down the paths in and out of the system
address controller and the system data interface ASICs. The shutdown is intended
to prevent further corruption of data and to facilitate debugging by preventing the
failure from being masked by continued operation.
When certain hardware errors occur in a Sun SPARC Enterprise M-Series
server, the system controller performs specific diagnosis and domain recovery
steps. The following automatic diagnosis engines identify and diagnose hardware
errors that affect the availability of the system and its domains:
eXtended System Control Facility (XSCF) diagnosis engine: Diagnoses
hardware errors associated with domains operations.
Oracle Solaris operating system diagnosis engine: Identifies nonfatal
domain hardware errors and reports them to the system controller.
POST diagnosis engine: Identifies any hardware test failures that occur
when the power-on self-test runs.
In most situations, hardware failures that cause a domain crash are detected
and eliminated from the domain configuration either by the power-on self-test
(POST) or an OpenBoot PROM during the subsequent automatic recovery boot
of the domain.
2.3.4 Dynamic Reconfiguration
Dynamic Reconfiguration (DR) allows resources to be dynamically reallocated or
balanced between domains. Utilizing this technology enables a physical or logical
restructuring of the hardware components of Sun SPARC Enterprise M-Series
servers even as the system continues running and the applications remain
available. This high degree of resource flexibility allows the domain or platform
 
 
 
Search WWH ::




Custom Search