網卡mtu 值不同導致rac 2節點ASM不能同時啟動 ORA-27550: Target ID protocol check failed.,mtuora-27550
在安裝完10205 rac 後準備配置 asm dg ,發現節點二上的ASM 沒法啟動:
en12 為private 網卡
en13 為 public 網卡
---->> 查看 oracle asm alert log 報錯顯示:
LMD0 started with pid=6, OS id=565728
LMS0 started with pid=7, OS id=491812
MMAN started with pid=8, OS id=594200
DBW0 started with pid=9, OS id=651626
LGWR started with pid=10, OS id=692672
CKPT started with pid=11, OS id=668066
SMON started with pid=12, OS id=696756
RBAL started with pid=13, OS id=700854
GMON started with pid=14, OS id=688448
Fri Feb 13 16:07:38 BEIST 2015
Maximum Tranmission Unit (mtu) of the ether adapter is different --------MTU 節點間不一致
on the node running instance 1, and this node.
Ether adapters connecting the cluster nodes must be configured
with identical mtu on all the nodes, for Oracle.
Please ensure the mtu attribute of the ether adapter on all
nodes are identical, before running Oracle.
Fri Feb 13 16:07:38 BEIST 2015
Errors in file /oracle/app/oracle/admin/+ASM/bdump/+asm2_lmon_582048.trc:
ORA-27550: Target ID protocol check failed. tid vers=%d, type=%d, remote instance number=%d, local instance number=%d
LMON: terminating instance due to error 27550
Fri Feb 13 16:07:39 BEIST 2015
System state dump is made for local instance
Fri Feb 13 16:07:39 BEIST 2015
Errors in file /oracle/app/oracle/admin/+ASM/bdump/+asm2_diag_614754.trc:
ORA-27550: Target ID protocol check failed. tid vers=%d, type=%d, remote instance number=%d, local instance number=%d
Fri Feb 13 16:07:39 BEIST 2015
Trace dumping is performing id=[cdmp_20150213160739]
Fri Feb 13 16:07:40 BEIST 2015
----查看節點一網路資訊:
[oracle@dg1:/oracle/app/oracle/admin/+ASM/bdump/]$netstat -in
Name Mtu Network Address Ipkts Ierrs Opkts Oerrs Coll
en12 9000 link#2 0.1a.64.bb.82.8 286059 0 57330 3 0
en12 9000 10 10.0.0.1 286059 0 57330 3 0
en13 9000 link#3 0.1a.64.bb.82.9 17267057 0 15456852 24 0
en13 9000 172.31.2 172.31.2.130 17267057 0 15456852 24 0
en13 9000 172.31.2 172.31.2.132 17267057 0 15456852 24 0
lo0 16896 link#1 380344 0 380439 0 0
lo0 16896 127 127.0.0.1 380344 0 380439 0 0
lo0 16896 ::1 380344 0 380439 0 0
----查看節點二網路資訊:
[oracle@dg2:/oracle/app/oracle/product/10/db/dbs/]$netstat -in
Name Mtu Network Address Ipkts Ierrs Opkts Oerrs Coll
en12 1500 link#2 0.1a.64.bb.82.64 104657 0 55568 3 0
en12 1500 10 10.0.0.2 104657 0 55568 3 0
en13 9000 link#3 0.1a.64.bb.82.65 1538972 0 761507 21 0
en13 9000 172.31.2 172.31.2.131 1538972 0 761507 21 0
en13 9000 172.31.2 172.31.2.133 1538972 0 761507 21 0
lo0 16896 link#1 112862 0 112928 0 0
lo0 16896 127 127.0.0.1 112862 0 112928 0 0
lo0 16896 ::1 112862 0 112928 0 0
[oracle@dg2:/oracle/app/oracle/product/10/db/dbs/]$ifconfig -a
en12: flags=5e080863,c0<UP,BROADCAST,NOTRAILERS,RUNNING,SIMPLEX,MULTICAST,GROUPRT,64BIT,CHECKSUM_OFFLOAD(ACTIVE),PSEG,LARGESEND,CHAIN>
inet 10.0.0.2 netmask 0xffffff00 broadcast 10.0.0.255
tcp_sendspace 131072 tcp_recvspace 65536 rfc1323 0
en13: flags=5e080863,c0<UP,BROADCAST,NOTRAILERS,RUNNING,SIMPLEX,MULTICAST,GROUPRT,64BIT,CHECKSUM_OFFLOAD(ACTIVE),PSEG,LARGESEND,CHAIN>
inet 172.31.2.131 netmask 0xffffff00 broadcast 172.31.2.255
inet 172.31.2.133 netmask 0xffffff00 broadcast 172.31.2.255
tcp_sendspace 262144 tcp_recvspace 131072 rfc1323 1
lo0: flags=e08084b<UP,BROADCAST,LOOPBACK,RUNNING,SIMPLEX,MULTICAST,GROUPRT,64BIT>
inet 127.0.0.1 netmask 0xff000000 broadcast 127.255.255.255
inet6 ::1/0
tcp_sendspace 131072 tcp_recvspace 131072 rfc1323 1
解決方案:
調整網卡mtu 值。注意需要關閉叢集。以及是否啟用巨幀。
未完待續。。。。。
~
~
BTW:
MTU
用途: 限制網路上傳輸的包的大小.
值: 預設值:configuration-dependent
顯示: lsattr -E -l interface_name
更改: chdev -l interface_name -a mtu=NewValue
使用命令 chdev,介面在使用時不能被更改。重新引導後更改才生效。
一種備用的方法如下:
ifconfig interface_name mtu NewValue
這方法在一個啟動並執行系統中更改 MTU 大小,但在系統重啟後不保留這個值。
診斷: 包片段統計.
調整: 增加網路介面的 MTU 的大小。對於 Gigabit Ethernet 適配器使用裝置屬性 jumbo_frames=yes 來啟用巨大的幀(僅設介面上的 MTU 值為 9000 是不夠的)。
如果是使用了網卡彙總,可能需要拆掉,在調整各網卡的mtu後在重新彙總,否則可能會碰到以下問題:
# chdev -l en12 -a mtu=9000
Method error (/usr/lib/methods/chgif):
0514-068 Cause not known.
0821-532 ioctl (SIOCSIFMTU): A return value of a math subroutine is not within machine precision.
0821-103 : The command /usr/sbin/ifconfig en12 inet 10.0.0.2 arp netmask 255.255.255.0 mtu 9000 up failed.
0821-007 cfgif: ifconfig command failed.
The status of"en12" Interface in the current running system is uncertain.
0821-103 : The command /usr/lib/methods/cfgif -len12 failed.
0821-532 ioctl (SIOCSIFMTU): A return value of a math subroutine is not within machine precision.
0821-103 : The command /usr/sbin/ifconfig en12 inet 10.0.0.2 arp netmask 255.255.255.0 mtu 9000 up failed.
0821-229 chgif: ifconfig command failed.
The status of"en12" Interface in the current running system is uncertain