In this chapter, we will walk through the different types of failure scenarios that can occur in Cisco UCS. UCS solution components have excellent redundancy for critical equipment such as chassis and Fabric Interconnects. However, in an unexpected situation such as a physical component's failure, we should be able to identify the failed component and possibly conduct some troubleshooting before contacting Cisco TAC. The most common equipment that fails for the UCS are chassis/Fabric Interconnect power supplies, fan units, IOMs and SFPs for both IOMs, and Fabric Interconnect ports. If proper failover is configured for the network adapters (vNICs) and proper connectivity is configured for the storage adapters (vHBAs), the majority of single component failures do not result in data or management traffic disruption.
UCS failures may also be related to configuration issues, firmware...