Cisco UCS 5108 tool box fan failure handling

Source: Internet
Author: User

The user's UCS Manager, found chassis 2 6th slot fan Alarm, led does not light, the slot fan and other normal slot fan interchange, the fan is working properly, but the original good fan inserted in the 6th slot, still alarm, LED does not light. The initial judgment should be the problem with the knife case.

A case was opened to Cisco and the Cisco TAC Engineer was connected remotely to 6248, and the command to view the fan did work abnormally, with the following information:

magic:0x486f7403 # OK
Valid:1
pid:1554
Interval:15 # seconds
write_ts:1451028436 # Fri Dec 25 07:27:16 2015
stale_ts:1451028458 # Fri Dec 07:27:38 OK
now:1451028443 # Fri Dec 25 07:27:23 2015
Status:1 # ACTIVE
Policy_state:1 # COOL
Xreading:1 # Developer_mode:false
Hwconf_valid:1
Maxfans:8
FAN[1].FAULT/READ/REQ:0/30/30 # OK
FAN[2].FAULT/READ/REQ:0/30/30 # OK
FAN[3].FAULT/READ/REQ:0/30/30 # OK
FAN[4].FAULT/READ/REQ:0/30/30 # OK
FAN[5].FAULT/READ/REQ:0/30/30 # OK
Fan[6].fault/read/req:1/0/30 # MISSING
FAN[7].FAULT/READ/REQ:0/30/30 # OK
FAN[8].FAULT/READ/REQ:0/30/30 # OK

The above information allows you to see a 6th slot fan missing.

After reading the relevant information such as logs, the Cisco TAC Engineer suspects that the problem appears on the UCS5108 's internal bus and gives the following recommendations:

The first step:

Remove PSU1 let sit for 2 minutes replace, wait ten secondsconfirm PSU1 have power, Move to PSU2

Remove PSU2 let sit for 2 minutes replace, wait ten seconds haspower, Move to PSU3

Remove PSU3 let sit for 2 minutes replace, wait ten seconds Psu3has power, Move to PSU4

Remove PSU4 let sit for 2 minutes replace, wait ten seconds Psu4has power, Move to Fan1

Step Two:

Basically, the power supply reseat did not has a business impact,you could does it now.

If the issue still exist, please take the action below in Amaintenance window:

Remove Fan1 let sit for seconds replace, wait ten secondsconfirm Fan1 have power, Move to Fan2

Remove Fan2 let sit for seconds replace, wait ten secondsconfirm Fan2 have power, Move to Fan3

Remove Fan3 let sit for seconds replace, wait ten secondsconfirm Fan3 have power, Move to Fan4

Remove Fan4 let sit for seconds replace, wait ten secondsconfirm Fan4 have power, Move to Fan5

Remove Fan5 let sit for seconds replace, wait ten secondsconfirm Fan5 have power, Move to Fan6

Remove Fan6 let sit for seconds replace, wait ten secondsconfirm Fan6 have power, Move to Fan7

Remove Fan7 let sit for seconds replace, wait ten seconds confirmFan7 have power, Move to FAN8

Remove Fan8 let sit for seconds replace, wait ten secondsconfirm Fan8 has power

Step Three:

Remove right IOM, let sit for 5 minutes replace, Confirmthat IO MOD are up and Running before you reseat left IOM

Once right IOM was up and Running finally Reseat left IOM let Sitfor 5 minutes, and place it back into the chassis.

One final step:

If all the above does not fix the issue and then you need to power-cycle Thewhole chassis 2.

When I went to the third step, the alarm box above the fan was gone, but the command to view the fan still did not work properly, then took the last step, after the box restart is complete, the fan is working properly by command.

magic:0x486f7403 # OK
Valid:1
pid:1511
Interval:15 # seconds
write_ts:1451289000 # Mon Dec 28 07:50:00 2015
stale_ts:1451289022 # Mon Dec 07:50:22 OK
now:1451289010 # Mon Dec 28 07:50:10 2015
Status:1 # ACTIVE
Policy_state:1 # COOL
Xreading:1 # Developer_mode:false
Hwconf_valid:1
Maxfans:8
FAN[1].FAULT/READ/REQ:0/30/30 # OK
FAN[2].FAULT/READ/REQ:0/30/30 # OK
FAN[3].FAULT/READ/REQ:0/30/30 # OK
FAN[4].FAULT/READ/REQ:0/30/30 # OK
FAN[5].FAULT/READ/REQ:0/30/30 # OK
Fan[6].fault/read/req:0/30/30 # OK
FAN[7].FAULT/READ/REQ:0/30/30 # OK
FAN[8].FAULT/READ/REQ:0/30/30 # OK


This is a troubleshooting solution.

*************************************************************************************

PostScript: Although the replacement method can quickly locate the fault, but sometimes not necessarily accurate, the problem is not necessarily on the hardware, there may be a problem with the software.

Thanks Cisco TAC Lighting!

*************************************************************************************

This article is from the "Xunil" blog, make sure to keep this source http://136464.blog.51cto.com/126464/1729934

Cisco UCS 5108 tool box fan failure handling

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.