The user's UCS Manager, found chassis 2 6th slot fan Alarm, led does not light, the slot fan and other normal slot fan interchange, the fan is working properly, but the original good fan inserted in the 6th slot, still alarm, LED does not light. The initial judgment should be the problem with the knife case.
A case was opened to Cisco and the Cisco TAC Engineer was connected remotely to 6248, and the command to view the fan did work abnormally, with the following information:
magic:0x486f7403 # OK
Valid:1
pid:1554
Interval:15 # seconds
write_ts:1451028436 # Fri Dec 25 07:27:16 2015
stale_ts:1451028458 # Fri Dec 07:27:38 OK
now:1451028443 # Fri Dec 25 07:27:23 2015
Status:1 # ACTIVE
Policy_state:1 # COOL
Xreading:1 # Developer_mode:false
Hwconf_valid:1
Maxfans:8
FAN[1].FAULT/READ/REQ:0/30/30 # OK
FAN[2].FAULT/READ/REQ:0/30/30 # OK
FAN[3].FAULT/READ/REQ:0/30/30 # OK
FAN[4].FAULT/READ/REQ:0/30/30 # OK
FAN[5].FAULT/READ/REQ:0/30/30 # OK
Fan[6].fault/read/req:1/0/30 # MISSING
FAN[7].FAULT/READ/REQ:0/30/30 # OK
FAN[8].FAULT/READ/REQ:0/30/30 # OK
The above information allows you to see a 6th slot fan missing.
After reading the relevant information such as logs, the Cisco TAC Engineer suspects that the problem appears on the UCS5108 's internal bus and gives the following recommendations:
The first step:
Remove PSU1 let sit for 2 minutes replace, wait ten secondsconfirm PSU1 have power, Move to PSU2
Remove PSU2 let sit for 2 minutes replace, wait ten seconds haspower, Move to PSU3
Remove PSU3 let sit for 2 minutes replace, wait ten seconds Psu3has power, Move to PSU4
Remove PSU4 let sit for 2 minutes replace, wait ten seconds Psu4has power, Move to Fan1
Step Two:
Basically, the power supply reseat did not has a business impact,you could does it now.
If the issue still exist, please take the action below in Amaintenance window:
Remove Fan1 let sit for seconds replace, wait ten secondsconfirm Fan1 have power, Move to Fan2
Remove Fan2 let sit for seconds replace, wait ten secondsconfirm Fan2 have power, Move to Fan3
Remove Fan3 let sit for seconds replace, wait ten secondsconfirm Fan3 have power, Move to Fan4
Remove Fan4 let sit for seconds replace, wait ten secondsconfirm Fan4 have power, Move to Fan5
Remove Fan5 let sit for seconds replace, wait ten secondsconfirm Fan5 have power, Move to Fan6
Remove Fan6 let sit for seconds replace, wait ten secondsconfirm Fan6 have power, Move to Fan7
Remove Fan7 let sit for seconds replace, wait ten seconds confirmFan7 have power, Move to FAN8
Remove Fan8 let sit for seconds replace, wait ten secondsconfirm Fan8 has power
Step Three:
Remove right IOM, let sit for 5 minutes replace, Confirmthat IO MOD are up and Running before you reseat left IOM
Once right IOM was up and Running finally Reseat left IOM let Sitfor 5 minutes, and place it back into the chassis.
One final step:
If all the above does not fix the issue and then you need to power-cycle Thewhole chassis 2.
When I went to the third step, the alarm box above the fan was gone, but the command to view the fan still did not work properly, then took the last step, after the box restart is complete, the fan is working properly by command.
magic:0x486f7403 # OK
Valid:1
pid:1511
Interval:15 # seconds
write_ts:1451289000 # Mon Dec 28 07:50:00 2015
stale_ts:1451289022 # Mon Dec 07:50:22 OK
now:1451289010 # Mon Dec 28 07:50:10 2015
Status:1 # ACTIVE
Policy_state:1 # COOL
Xreading:1 # Developer_mode:false
Hwconf_valid:1
Maxfans:8
FAN[1].FAULT/READ/REQ:0/30/30 # OK
FAN[2].FAULT/READ/REQ:0/30/30 # OK
FAN[3].FAULT/READ/REQ:0/30/30 # OK
FAN[4].FAULT/READ/REQ:0/30/30 # OK
FAN[5].FAULT/READ/REQ:0/30/30 # OK
Fan[6].fault/read/req:0/30/30 # OK
FAN[7].FAULT/READ/REQ:0/30/30 # OK
FAN[8].FAULT/READ/REQ:0/30/30 # OK
This is a troubleshooting solution.
*************************************************************************************
PostScript: Although the replacement method can quickly locate the fault, but sometimes not necessarily accurate, the problem is not necessarily on the hardware, there may be a problem with the software.
Thanks Cisco TAC Lighting!
*************************************************************************************
This article is from the "Xunil" blog, make sure to keep this source http://136464.blog.51cto.com/126464/1729934
Cisco UCS 5108 tool box fan failure handling