Ocfs2 File System FAQs

Last Update:2018-12-05 Source: Internet

Author: User

Tags dmesg

Developer on Alibaba Coud: Build your first app with APIs, SDKs, and tutorials on the Alibaba Cloud. Read more ＞

Symptom 1:
Mount-t ocfs2-o datavolume, nointr/dev/sdb1/webdata

Mount. ocfs2: Transport endpoint is not connected while mounting/dev/sdb1 on/webdata. Check 'dmesg' for more information on this error.

Possible problems:

1: The firewall is open, not closed, and the heartbeat port is blocked.

2: The/etc/init. d/o2cb configure values of each node are different.

IXDBA. NET Community Forum

3: one node is being mounted. Another node has just been configured and the ocfs2 service has been restarted. In this case, you only need to restart the service for each node to complete the mounting.

4: SElinux is not disabled.

The following is a case:

[Root @ test02 ~] # Mount-t ocfs2/dev/vg_ocfs/lv_u02/u02
Mount. ocfs2: Transport endpoint is not connected while mounting/dev/vg_ocfs/lv_u02 on/u02. Check 'dmesg' for more information on this error.

This error occurs because the O2CB_HEARTBEAT_THRESHOLD node values are different when OCFS is configured. When I used/etc/init. d/o2cb configure, the values of each node were the same, but I forgot to restart o2cb on the first node, and I found it after checking the result for a long time. Next, of course, the mounted OCFS directory UMOUNT is dropped, and an error is returned:

[Root @ test01 u02] # umount-f/u02
Umount2: Device or resource busy
Umount:/u02: device is busy
Umount2: Device or resource busy
Umount:/u02: device is busy

At this time,/etc/init should be used. d/ocfs2 stop and/etc/init. d/o2cb stop: stop OCFS2 and O2CB and then UMOUJNT. Then, after starting OCFS2 and O2CB, other nodes can mount ocfs smoothly.

Symptom 2:
#/Etc/init. d/o2cb online ocfs2

Starting cluster ocfs2: Failed

Cluster ocfs2 created

O2cb_ctl: Configuration error discovered while populating cluster ocfs2. None of its nodes were considered local. A node is considered local when its node name in the configuration maches this machine's host name.

Stopping cluster ocfs2: OK

For host name issues, check the more/etc/ocfs2/cluster. conf and/etc/hosts file information, and modify the corresponding host name.

Note: To ensure that the ocfs2 file system can be automatically mounted at startup, you must add the automatic start option to/etc/fstab, the Host Name and ip address of the two nodes must be added to/etc/hosts for resolution, Host Name and/etc/ocfs2/cluster. the hostnames configured in conf must be the same.

Symptom 3

1: Starting O2CB cluster ocfs2: Failed
An error occurred while configuring o2cb after installing ocfs2:
[Root @ rac1 ocfs2] #/etc/init. d/o2cb configure
Processing the O2CB driver.

This will configure the on-boot properties of the O2CB driver.
The following questions will determine whether the driver is loaded on
Boot. The current values will be shown in brackets ('[]'). Hitting
<ENTER> without typing an answer will keep that current value. Ctrl-C
Will abort.

Load O2CB driver on boot (y/n) [y]:
Cluster to start on boot (Enter "none" to clear) [ocfs2]:
Specify heartbeat dead threshold (> = 7) [7]:
Writing O2CB configuration: OK
Starting O2CB cluster ocfs2: Failed
Cluster ocfs2 created
O2cb_ctl: Configuration error discovered while populating cluster ocfs2. None of its nodes were considered local. A node is considered local when its node name in the configuration matches this machine's host name.
Stopping O2CB cluster ocfs2: OK

In this case, the OCFS is not configured. You can check that there is a graphical ocfs configuration command. You must configure it first, and it is best to use an IP address instead of a host name!

That is to say, when ocfs2 is started, the ocfs node configuration file must be configured properly. If the configuration is not correct, an error will be reported. When the graphic interface is configured,/etc/ocfs2/cluster. the conf file should be a null file, or an error will be reported!

Symptom 4
Mounting the ocfs2 File System
Mount. ocfs2: Error when attempting to run/sbin/ocfs2_hb_ctl: "Operation not permitted"
Mount-t ocfs2-o datavolume/dev/sdb1/u02/oradata/orcl
Ocfs2_hb_ctl: Bad magic number in superblock while reading uuid
Mount. ocfs2: Error when attempting to run/sbin/ocfs2_hb_ctl: "Operation not permitted"

This problem is caused by the error that the partition of the ocfs2 file system is not formatted. before mounting the ocfs2 file system, the partition used for this file system must be formatted.

Symptom 5:
Configuration assistant "Oracle Cluster Verification Utility" failed
10g rac installation ask oracle 10.2.0.1 solaris 5.9 the last step of Dual-host crs installation is wrong, do not know how to solve?

LOG information:
INFO: Configuration assistant "Oracle Cluster Verification Utility" failed
-----------------------------------------------------------------------------
* ** Starting OUICA ***
Oracle Home set to/orabase/product/10.2
Configuration directory is set to/orabase/product/10.2/export toollogs. All xml files under the directory will be processed
INFO: The "/orabase/product/10.2/export toollogs/configToolFailedCommands" script contains all commands that failed, were skipped or were canceled. this file may be used to run these configuration assistants outside of OUI. note that you may have to update this script with passwords (if any) before executing the same.
-----------------------------------------------------------------------------
SEVERE: OUI-25031: Some of the configuration assistants failed. it is stronugly recommended that you retry the configuration assistants at this time. not successfully running any "Recommended" assistants means your system will not be correctly configured.
1. Check the Details panel on the Configuration Assistant Screen to see the errors resulting in the failures.
2. Fix the errors causing these failures.
3. Select the failed assistants and click the 'retry' button to Retry them.
INFO: User Selected: Yes/OK

This is because the vip address is not started. We recommend that you finish running orainstRoot. sh and root. sh command and then open a new window to execute vipca. After the crs service is all up, execute the final verify step. You can try again.

Run crs_stat-t in the bin directory of crs to check whether all services have started. In this case, the vip is unavailable.

Symptom 6:
Failed to upgrade Oracle Cluster Registry configuration
When the CRS is installed and the./root. sh command is executed on the second node, the following prompt is displayed. I am running normally on the first node. Please kindly advise me! Thank you!
[Root @ RACtest2 crs] #./root. sh
WARNING: directory '/app/oracle/product/10.2.0' is not owned by root
WARNING: directory '/app/oracle/product' is not owned by root
WARNING: directory '/app/oracle' is not owned by root
WARNING: directory '/app' is not owned by root
Checking to see if Oracle CRS stack is already configured

Setting the permissions on OCR backup directory
Setting up NS directories
PROT-1: Failed to initialize ocrconfig
Failed to upgrade Oracle Cluster Registry configuration

Error cause:

This is because there is a problem with the permission to install the crs device. For example, if your device uses raw to place ocr and vote, you must set the permissions for these hardware devices and the connected files, the following is my environment:

[Root @ rac2 javasrs] #

Lrwxrwxrwx 1 root 13 Jan 27 :49 ocr. crs->/dev/raw/raw1

Lrwxrwxrwx 1 root 13 Jan 26 13:31 vote. crs->/dev/raw/raw2

Chown root: oinstall/dev/raw/raw1

Chown root: oinstall/dev/raw/raw2

Chmod 660/dev/raw/raw1

Chmod 660/dev/raw/raw2

Where/dev/sdb1 places ocr,/dev/sdb2 places vote.

[Root @ rac2 upload RS] # service rawdevices reload

Assigning devices:

/Dev/raw/raw1 -->/dev/sdb1

/Dev/raw/raw1: bound to major 8, minor 17

/Dev/raw/raw2 -->/dev/sdb2

/Dev/raw/raw2: bound to major 8, minor 18

Done

Then execute the command again.

[Root @ rac2 license RS] #/oracle/app/oracle/product/crs/root. sh

WARNING: directory '/oracle/app/oracle/product' is not owned by root

WARNING: directory '/oracle/app/oracle' is not owned by root

Checking to see if Oracle CRS stack is already configured

Setting the permissions on OCR backup directory

Setting up NS directories

Oracle Cluster Registry configuration upgraded successfully

WARNING: directory '/oracle/app/oracle/product' is not owned by root

WARNING: directory '/oracle/app/oracle' is not owned by root

Clscfg: EXISTING configuration version 3 detected.

Clscfg: version 3 is 10G Release 2.

Assigning default hostname rac1 for node 1.

Assigning default hostname rac2 for node 2.

Successfully accumulated necessary OCR keys.

Using ports: CSS = 49895 CRS = 49896 EVMC = 49898 and EVMR = 49897.

Node <nodenumber>: <nodename> <private interconnect name>

Node 1: rac1 priv1 rac1

Node 2: rac2 priv2 rac2

Clscfg: Arguments check out successfully.

Symptom 7
Startup will be queued to init within 90 seconds
Run root. sh on the installed node a as follows:
[Root @ rac2 OraHome1] #./root. sh
WARNING: directory '/oracle' is not owned by root
Checking to see if Oracle CRS stack is already configured
/Etc/oracle does not exist. Creating it now.

Setting the permissions on OCR backup directory
Setting up NS directories
Oracle Cluster Registry configuration upgraded successfully
WARNING: directory '/oracle' is not owned by root
Clscfg: EXISTING configuration version 3 detected.
Clscfg: version 3 is 10G Release 2.
Assigning default hostname rac1 for node 1.
Assigning default hostname rac2 for node 2.
Successfully accumulated necessary OCR keys.
Using ports: CSS = 49895 CRS = 49896 EVMC = 49898 and EVMR = 49897.
Node <nodenumber>: <nodename> <private interconnect name> Node 1: rac1 vip1 rac1
Node 2: rac2 vip2 rac2
Clscfg: Arguments check out successfully.

No keys were written. Supply-force parameter to override.
-Force is destructive and will destroy any previous cluster
Configuration.
Oracle Cluster Registry for cluster has already been initialized
Startup will be queued to init within 90 seconds.
It hangs up. Check the log ocrconfig_7758.log.
::::::::::::::
Oracle Database 10g CRS Release 10.2.0.1.0 Production Copyright 1996,200 5 Oracle. All rights reserved.
22:47:09. 537: [OCRCONF] [3086919360] ocrconfig starts...
22:47:09. 541: [OCRCONF] [3086919360] Upgrading OCR data
22:47:09. 649: [OCRRAW] [3086919360] propriogid: 1: INVALID FORMAT
22:47:09. 660: [OCRRAW] [3086919360] ibctx: 1: ERROR: INVALID FORMAT
22:47:09. 660: [OCRRAW] [3086919360] proprinit roblem reading the bootblock or superbloc 22

22:47:09. 661: [default] [3086919360] a_init: 7! : Backend init unsuccessful: [22]
22:47:09. 662: [OCRCONF] [3086919360] Exporting OCR data to [OCRUPGRADEFILE]
22:47:09. 663: [OCRAPI] [3086919360] a_init: 7! : Backend init unsuccessful: [33]
22:47:09. 663: [OCRCONF] [3086919360] There was no previous version of OCR. error: [PROC-33: Oracle Cluster Registry is not
Configured]
22:47:09. 666: [OCRRAW] [3086919360] propriogid: 1: INVALID FORMAT
22:47:09. 668: [OCRRAW] [3086919360] ibctx: 1: ERROR: INVALID FORMAT
22:47:09. 668: [OCRRAW] [3086919360] proprinit roblem reading the bootblock or superbloc 22

22:47:09. 668: [default] [3086919360] a_init: 7! : Backend init unsuccessful: [22]
22:47:09. 672: [OCRRAW] [3086919360] propriogid: 1: INVALID FORMAT
22:47:09. 673: [OCRRAW] [3086919360] ibctx: 1: ERROR: INVALID FORMAT
22:47:09. 673: [OCRRAW] [3086919360] proprinit roblem reading the bootblock or superbloc 22

First, check whether the firewall is disabled:

Check and disable udp icmp rejection

During Linux installation, I pointed out that no firewall option is configured. By default, the options for configuring the firewall are selected by the installer. This made me suffer several times, so I need to carefully check whether the firewall option is not configured, and ensure that the udp ICMP filter is disabled.

If udp icmp is blocked or rejected by the firewall, the Oracle cluster software will crash after several minutes of running. If the Oracle cluster process fails, the following content appears in your <machine_name> _ evmocr. log file:

08/29/2005 22:17:19
Oac_init: 2: cocould not connect to server, clsc retcode = 9
08/29/2005 22:17:19
A_init: 12! : Client init unsuccessful: [32]
Ibctx: 1: ERROR: INVALID FORMAT
Proprinit: problem reading the bootblock or superbloc 22

If such an error occurs, the solution is to remove the udp ICMP (iptables) Denial rule, or simply disable the firewall option. Then, the Oracle cluster software will start to work normally without crashing. Run the following command as the root user account:

1. check to make sure that the firewall option is disabled. If the firewall option is disabled (as shown in the following example), you do not have to continue with the following steps.
#/Etc/rc. d/init. d/iptables statusFirewall is stopped
.

2. If the firewall option is enabled, you must manually disable udp icmp rejection:
#/Etc/rc. d/init. d/iptables stopFlushing firewall rules: [OK] Setting chains to policy ACCEPT: filter [OK] Unloading iptables modules: [OK]

3. Disable udp icmp rejection for the next server restart (should always be disabled ):
# Chkconfig iptables off

Again, if it is not the problem above,

We recommend that you first clear the ocr and votedisk information with dd, then grant the permission, and then run root. sh.

Check my feedback on running root. sh on the first node:
[Root @ node1 crs10.2.0] #./root. sh
WARNING: directory '/ora10g/product' is not owned by root
WARNING: directory '/ora10g' is not owned by root
Checking to see if Oracle CRS stack is already configured
/Etc/oracle does not exist. Creating it now.

Setting the permissions on OCR backup directory
Setting up NS directories
Oracle Cluster Registry configuration upgraded successfully
WARNING: directory '/ora10g/product' is not owned by root
WARNING: directory '/ora10g' is not owned by root
Assigning default hostname node1 for node 1.
Assigning default hostname node2 for node 2.
Successfully accumulated necessary OCR keys.
Using ports: CSS = 49895 CRS = 49896 EVMC = 49898 and EVMR = 49897.
Node <nodenumber>: <nodename> <private interconnect name> Node 1: node1 privnode1 node1
Node 2: node2 privnode2 node2
Creating OCR keys for user 'root', privgrp 'root '..
Operation successful.
Now formatting voting device:/ocfs/votedisk. dat
Format of 1 voting devices complete.
Startup will be queued to init within 90 seconds.
Adding daemons to inittab
Expecting the CRS daemons to be up within 600 seconds.
CSS is active on these nodes.
Node1
CSS is inactive on these nodes.
Node2
Local node checking complete.
Run root. sh on remaining nodes to start CRS daemons.
[Root @ node1 crs10.2.0] #

Symptom 8:
CRS-0215: cocould not start resource 'ora. orcl. orcl1.inst '.
$ Srvctl start instance-d orcl-I orcl1
PRKP-1001: An error occurred while starting instance orcl1 on node znawdb1
CRS-0215: cocould not start resource 'ora. orcl. orcl1.inst '.

This problem occurs because the ocfs2 file system that loads database data files, or the ASM instance is not mounted. For example, my environment is raw + ASM. When the ASM instance is not mounted, then start asm with srvctl start ASM-n rac1, but this start does not mount ASM. Therefore, when srvctl start instance-d orcl-I orcl1 is executed again, the CRS-0215: cocould not start resource 'ora. orcl. orcl2.inst '.

Execute alter diskgroup dgroup1 MOUNT. Then run srvctl start instance-d orcl-I orcl1 again.

Symptom 9
CRS-0223: Resource 'ora. rac1.LISTENER _ RAC1.lsnr 'has placement error.
The error message is as follows:

[Oracle @ rac1 admin] $ srvctl start nodeapps-n rac1