Sunday, 8 June 2014

Node-2 root.sh fails with CRS-2674

During RAC 11gr2 installtion while running root.sh on node-2 you may face below network due to sub-netmask incorrect between nodes.

ERROR:
======
root.sh fails on node-2

2011-08-04 18:26:05: output for start nodeapps is  PRCR-1013 : Failed to start resource ora.net1.network PRCR-1064 : Failed to start resource ora.net1.network on node node-lab-02 CRS-2674: Start of 'ora.net1.network' on 'node-lab-02' failed PRCR-1013 : Failed to start resource ora.ons PRCR-1064 : Failed to start resource ora.ons on node node-lab-02 CRS-2674: Start of 'ora.net1.network' on 'node-lab-02' failed
2011-08-04 18:26:05: output of startnodeapp after removing already started mesgs is PRCR-1013 : Failed to start resource ora.net1.network PRCR-1064 : Failed to start resource ora.net1.network on node node-lab-02 CRS-2674: Start of 'ora.net1.network' on 'node-lab-02' failed PRCR-1013 : Failed to start resource ora.ons PRCR-1064 : Failed to start resource ora.ons on node node-lab-02 CRS-2674: Start of 'ora.net1.network' on 'node-lab-02' failed
2011-08-04 18:26:05: /u01/app/11.2.0/grid/bin/srvctl start nodeapps -n node-lab-02 ... failed
2011-08-04 18:26:05: Running as user grid: /u01/app/11.2.0/grid/bin/cluutil -ckpt -oraclebase /u01/app/oracle -writeckpt -name ROOTCRS_NODECONFIG -state FAIL
2011-08-04 18:26:05: s_run_as_user2: Running /bin/su grid -c ' /u01/app/11.2.0/grid/bin/cluutil -ckpt -oraclebase /u01/app/oracle -writeckpt -name ROOTCRS_NODECONFIG -state FAIL '
2011-08-04 18:26:05: Removing file /tmp/file7zp6I4
2011-08-04 18:26:05: Successfully removed file: /tmp/file7zp6I4
2011-08-04 18:26:05: /bin/su successfully executed

 PRCR-1013 : Failed to start resource ora.net1.network PRCR-1064 :
 Failed to start resource ora.net1.network on node node-lab-02
 CRS-2674: Start of 'ora.net1.network' on 'node-lab-02' failed
 PRCR-1013 : Failed to start resource ora.ons
 PRCR-1064 : Failed to start resource ora.ons on node node-lab-02
 CRS-2674: Start of 'ora.net1.network' on 'node-lab-02' failed

GSD exists

[
Diag:
====
root@node-lab-02 ~]# cd /u01/app/11.2.0/grid/bin
[root@node-lab-02 bin]# ./srvctl config nodeapps
Network exists: 1/10.10.13.0/255.255.255.0/eth6, type static
VIP exists: /node-lab-01-vip/10.100.130.61/10.10.13.0/255.255.255.0/eth6, hosting node node-lab-01
VIP exists: /node-lab-02-vip/10.100.130.62/10.10.13.0/255.255.255.0/eth6, hosting node node-lab-02
GSD exists
ONS exists: Local port 6100, remote port 6200, EM port 2016
[root@node-lab-02 bin]#
[root@node-lab-02 bin]# ./oifcfg getif
eth5  10.10.24.0  global  cluster_interconnect
eth6  10.10.13.0  global  public
Node-2
[root@node-lab-02 bin]# ./oifcfg iflist -p -n
eth0  10.10.20.0  PRIVATE  255.255.255.0
eth5  10.10.24.0  PRIVATE  255.255.255.0
eth5  169.210.10.11  UNKNOWN  255.255.0.0
eth6  10.10.13.0  PRIVATE  255.255.255.192 --> this is the issue
[root@node-lab-02 bin]#

Node-1
[root@node-lab-01 bin]# ./oifcfg iflist -p -n
eth0  10.10.20.0  PRIVATE  255.255.255.0
eth5  10.10.24.0  PRIVATE  255.255.255.0
eth5  169.210.10.11  UNKNOWN  255.255.0.0
eth6  10.10.13.0  PRIVATE  255.255.255.0 --> this is the issue.
[root@node-lab-01 bin]#

Solution :
========

Correct the network configuration in CRS

cd <GRID_HOME>/bin

./srvctl modify nodeapps -n node-lab-02 -A node-lab-01-vip/255.255.255.192/eth6

Then run the root.sh in node-2

No comments: