Troubleshooting of shared storage failure caused by Oracle RAC node downtime in Linux

Source: Internet
Author: User

Environment:

Two HP ML570 Linux AS4.5 Oracle 10g

The two servers perform Oracle RAC and connect to HP MSA1000 through SAN Switch

Fault symptom:

Because the cabinet where one Oracle rac node is located is out of power, two rac nodes are down at the same time, in addition, all partitions in the four ocfs2 partitions mounted on Storage are lost (/dev/sda1 is changed to/dev/sda) and cannot be mounted. Therefore, Oracle services cannot be started.

Fault Analysis and troubleshooting:

Because the customer's DB data is not backed up, be careful when fixing it.

A. First, make sure that the Storage is correct in terms of hardware and connectivity.

B. Check that the OS is normal and the Storage can be accessed normally.

C. Restore the lost Partition Table

Because I used to set the partition, the number and size of the partitions are clear. Therefore, we will re-divide the partitions according to the last partition format to re-create the partition table, data should not be affected because the customer has not backed up the data. Therefore, this operation is highly risky, but this is the only option currently.

D. After fdisk ends, reboot server

A miracle occurred, the data was still there, and the service started normally.

Note: There is no absolute thing in the world, and there is no insurance. Although Oracle RAC is implemented, it can only ensure the redundancy of the two servers and cannot ensure the redundancy of Storage, therefore, we recommend that you implement a feasible backup policy in the future.

However, there is another problem that I have never figured out, that is, a node of RAC experiences a power failure. How can the Partition Table of Public partitions on Storage be lost?

Related Article

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.