为测试一个迁移方案,装了一套10g rac环境,可能是很久没有装过10g的RAC了,整个过程情况不断。
1.在把集群软件和数据库软件都装好之后,用crs_stat检测状态的时候,发现vip的状态不对,ping vipIP不通
[root@rac10g01 ~]# cd $CRS_HOME
[root@rac10g01 crs_1]# crs_stat -t
Name Type Target State Host
------------------------------------------------------------
ora....g01.gsd application ONLINE ONLINE rac10g01
ora....g01.ons application ONLINE ONLINE rac10g01
ora....g01.vip application ONLINE OFFLINE
ora....g02.gsd application ONLINE ONLINE rac10g02
ora....g02.ons application ONLINE ONLINE rac10g02
ora....g02.vip application ONLINE OFFLINE
2.随后用命令,crs_start去启动vip,也报错:
[root@rac10g01 crs_1]# crs_start ora.rac10g01.vip
Attempting to start `ora.rac10g01.vip` on member `rac10g01`
Start of `ora.rac10g01.vip` on member `rac10g01` failed.
Attempting to start `ora.rac10g01.vip` on member `rac10g02`
Start of `ora.rac10g01.vip` on member `rac10g02` failed.
CRS-1006: No more members to consider
CRS-0215: Could not start resource 'ora.rac10g01.vip'.
3.查看了下vip的日志:$CRS_HOME/log/$hostname/racg/ora.rac10g01.vip.log,里面报错:
2015-11-18 10:53:18.576: [ RACG][1470001856] [22927][1470001856][ora.rac10g01.vip]: Wed Nov 18 10:53:18 CST 2015 [ 22932 ] /sbin/mii-tool eth0 error
Wed Nov 18 10:53:18 CST 2015 [ 22932 ] defaultgw: started
Wed Nov 18 10:53:18 CST 2015 [ 22932 ] defaultgw: completed with
checkIf: Default gateway is not defined (host=rac10g01)
2015-11-18 10:53:18.576: [ RACG][1470001856] [22927][1470001856][ora.rac10g01.vip]: Interface eth0 checked failed (host=rac10g01)
Wed Nov 18 10:53:18 CST 2015 [ 22932 ] checkIf: end for if=eth0
Invalid parameters, or failed to bring up VIP (host=rac10g01)
2015-11-18 10:53:18.576: [ RACG][1470001856] [22927][1470001856][ora.rac10g01.vip]: clsrcexecut: env ORACLE_CONFIG_HOME=/u01/app/oracle/crs_1
2015-11-18 10:53:18.576: [ RACG][1470001856] [22927][1470001856][ora.rac10g01.vip]: clsrcexecut: cmd = /u01/app/oracle/crs_1/bin/racgeut -e _USR_ORA_DEBUG=5 54 /u01/app/
oracle/crs_1/bin/racgvip start rac10g01
2015-11-18 10:53:18.577: [ RACG][1470001856] [22927][1470001856][ora.rac10g01.vip]: clsrcexecut: rc = 1, time = 3.270s
2015-11-18 10:53:21.731: [ RACG][1470001856] [22927][1470001856][ora.rac10g01.vip]: Wed Nov 18 10:53:18 CST 2015 [ 23007 ] Broadcast = 192.168.146.255
Wed Nov 18 10:53:18 CST 2015 [ 23007 ] Checking interface existance
Wed Nov 18 10:53:18 CST 2015 [ 23007 ] Calling getifbyip
2015-11-18 10:53:21.731: [ RACG][1470001856] [22927][1470001856][ora.rac10g01.vip]: Wed Nov 18 10:53:18 CST 2015 [ 23007 ] getifbyip: started for 192.168.146.13
Wed Nov 18 10:53:18 CST 2015 [ 23007 ] getifbyip: returning IP
Wed Nov 18 10:53:18 CST 2015 [ 23007 ] Completed getifbyip
2015-11-18 10:53:21.732: [ RACG][1470001856] [22927][1470001856][ora.rac10g01.vip]: Wed Nov 18 10:53:18 CST 2015 [ 23007 ] Calling getifbyip -a
Wed Nov 18 10:53:18 CST 2015 [ 23007 ] getifbyip: started for 192.168.146.13
Wed Nov 18 10:53:18 CST 2015 [ 23007 ] getifbyip: returning IP
2015-11-18 10:53:21.732: [ RACG][1470001856] [22927][1470001856][ora.rac10g01.vip]: Wed Nov 18 10:53:18 CST 2015 [ 23007 ] Completed getifbyip
Wed Nov 18 10:53:18 CST 2015 [ 23007 ] ping_vip 192.168.146.13 started
Wed Nov 18 10:53:21 CST 2015 [ 23007 ] ping_vip: 192.168.146.13 is not pingable, _count = 1
4.在log里面看到vip也显示的ping不通。debug一下启动的过程:
[root@rac10g01 bin]# crsctl debug log res "ora.rac10g01.vip:5"
Set Resource Debug Module: ora.rac10g01.vip Level: 5
[root@rac10g01 bin]# crs_start ora.rac10g01.vip
Attempting to start `ora.rac10g01.vip` on member `rac10g01`
Start of `ora.rac10g01.vip` on member `rac10g01` failed.
Attempting to start `ora.rac10g01.vip` on member `rac10g02`
Start of `ora.rac10g01.vip` on member `rac10g02` failed.
CRS-1006: No more members to consider
CRS-0215: Could not start resource 'ora.rac10g01.vip'.
在log里面显示:
2015-11-18 11:01:34.258: [ RACG][662033088] [26208][662033088][ora.rac10g01.vip]: clsrcgetprsrctx: all 0.020s
2015-11-18 11:01:34.279: [ RACG][662033088] [26208][662033088][ora.rac10g01.vip]: clsrcnodeapp: prsr num_env = 0
2015-11-18 11:01:34.279: [ RACG][662033088] [26208][662033088][ora.rac10g01.vip]: clsrcnodeapp: setting ORACLE_CONFIG_HOME=/u01/app/oracle/crs_1
2015-11-18 11:01:40.495: [ RACG][662033088] [26208][662033088][ora.rac10g01.vip]: Wed Nov 18 11:01:34 CST 2015 [ 26213 ] Broadcast = 192.168.146.255
Wed Nov 18 11:01:34 CST 2015 [ 26213 ] Checking interface existance
Wed Nov 18 11:01:34 CST 2015 [ 26213 ] Calling getifbyip
2015-11-18 11:01:40.496: [ RACG][662033088] [26208][662033088][ora.rac10g01.vip]: Wed Nov 18 11:01:34 CST 2015 [ 26213 ] getifbyip: started for 192.168.146.13
Wed Nov 18 11:01:34 CST 2015 [ 26213 ] getifbyip: returning IP eth0:1
Wed Nov 18 11:01:34 CST 2015 [ 26213 ] Completed getifbyip eth0:1
2015-11-18 11:01:40.496: [ RACG][662033088] [26208][662033088][ora.rac10g01.vip]: Wed Nov 18 11:01:34 CST 2015 [ 26213 ] Calling getifbyip -a
Wed Nov 18 11:01:34 CST 2015 [ 26213 ] getifbyip: started for 192.168.146.13
Wed Nov 18 11:01:34 CST 2015 [ 26213 ] getifbyip: returning IP eth0:1
2015-11-18 11:01:40.496: [ RACG][662033088] [26208][662033088][ora.rac10g01.vip]: Wed Nov 18 11:01:34 CST 2015 [ 26213 ] Completed getifbyip eth0:1
Wed Nov 18 11:01:34 CST 2015 [ 26213 ] Completed with initial interface test
Wed Nov 18 11:01:34 CST 2015 [ 26213 ] checkIf: start for if=eth0
2015-11-18 11:01:40.496: [ RACG][662033088] [26208][662033088][ora.rac10g01.vip]: Wed Nov 18 11:01:34 CST 2015 [ 26213 ] /sbin/mii-tool eth0 error
ping to 192.168.146.1 via eth0 failed, rc = 1 (host=rac10g01)
ping to 192.168.146.1 via eth0 failed, rc = 1 (host=rac10g01)
2015-11-18 11:01:40.496: [ RACG][662033088] [26208][662033088][ora.rac10g01.vip]: Wed Nov 18 11:01:40 CST 2015 [ 26213 ] checkIf: RX packets checked if=eth0 OK
Wed Nov 18 11:01:40 CST 2015 [ 26213 ] checkIf: end for if=eth0
5.看到ping网关的时候都ping不通,检查一下racgvip里面的DEFAULTGW设置
发现DEFAULTGW=
没有做设置。手工设置一下DEFAULTGW=192.168.146.1,在第二个节点也修改一下。两个节点都修改好之后,重启vip的资源,success了。
[root@rac10g01 bin]# crs_start ora.rac10g01.vip
Attempting to start `ora.rac10g01.vip` on member `rac10g01`
Start of `ora.rac10g01.vip` on member `rac10g01` succeeded.
[root@rac10g01 bin]#
[root@rac10g01 bin]# crs_stat -t
Name Type Target State Host
------------------------------------------------------------
ora....g01.gsd application ONLINE ONLINE rac10g01
ora....g01.ons application ONLINE ONLINE rac10g01
ora....g01.vip application ONLINE ONLINE rac10g01
ora....g02.gsd application ONLINE ONLINE rac10g02
ora....g02.ons application ONLINE ONLINE rac10g02
ora....g02.vip application ONLINE OFFLINE
[root@rac10g01 bin]# crs_start ora.rac10g02.vip
Attempting to start `ora.rac10g02.vip` on member `rac10g02`
Start of `ora.rac10g02.vip` on member `rac10g02` succeeded.
[root@rac10g01 bin]# crs_stat -t
Name Type Target State Host
------------------------------------------------------------
ora....g01.gsd application ONLINE ONLINE rac10g01
ora....g01.ons application ONLINE ONLINE rac10g01
ora....g01.vip application ONLINE ONLINE rac10g01
ora....g02.gsd application ONLINE ONLINE rac10g02
ora....g02.ons application ONLINE ONLINE rac10g02
ora....g02.vip application ONLINE ONLINE rac10g02
[root@rac10g01 bin]#