问题描述:在vmware workstation 9上搭建oracle 11g rac环境的时候,grid安装完毕,但是在执行crs_stat -t命令验证安装的时候,只能看见自己节点的信息:
我在执行数据库软件安装的时候,发现只能看见rac1的节点,rac2节点看不见
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 |
grid@rac2:~> crs_stat -t Name Type Target State Host ------------------------------------------------------------ ora.DATA.dg ora....up.type ONLINE ONLINE rac2 ora....N1.lsnr ora....er.type ONLINE ONLINE rac2 ora.asm ora.asm.type ONLINE ONLINE rac2 ora.cvu ora.cvu.type ONLINE ONLINE rac2 ora.gsd ora.gsd.type OFFLINE OFFLINE ora....network ora....rk.type ONLINE ONLINE rac2 ora.oc4j ora.oc4j.type ONLINE ONLINE rac2 ora.ons ora.ons.type ONLINE ONLINE rac2 ora....SM1.asm application ONLINE ONLINE rac2 ora.rac2.gsd application OFFLINE OFFLINE ora.rac2.ons application ONLINE ONLINE rac2 ora.rac2.vip ora....t1.type ONLINE ONLINE rac2 ora.scan1.vip ora....ip.type ONLINE ONLINE rac2 |
笔者这里安装了2个节点的rac,2个节点都只能看见自身的crs信息,之前的安装步骤都是执行成功的,于是重启2个节点是系统,尝试是否会正常,但想法太单纯了,当我自动重启2个节点的时候,发现最先启动的那个节点能看见crs信息,而另一个后启动的节点则出现错误:
RAC 2 系统先被拉起:
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 |
grid@rac2:~> crs_stat -t Name Type Target State Host ------------------------------------------------------------ ora.DATA.dg ora....up.type ONLINE ONLINE rac2 ora....N1.lsnr ora....er.type ONLINE ONLINE rac2 ora.asm ora.asm.type ONLINE ONLINE rac2 ora.cvu ora.cvu.type ONLINE ONLINE rac2 ora.gsd ora.gsd.type OFFLINE OFFLINE ora....network ora....rk.type ONLINE ONLINE rac2 ora.oc4j ora.oc4j.type ONLINE ONLINE rac2 ora.ons ora.ons.type ONLINE ONLINE rac2 ora....SM1.asm application ONLINE ONLINE rac2 ora.rac2.gsd application OFFLINE OFFLINE ora.rac2.ons application ONLINE ONLINE rac2 ora.rac2.vip ora....t1.type ONLINE ONLINE rac2 ora.scan1.vip ora....ip.type ONLINE ONLINE rac2 |
RAC 1在RAC 2节点后启动:
1 2 3 4 5 6 7 |
grid@rac1:~> crsctl check crs CRS-4638: Oracle High Availability Services is online CRS-4535: Cannot communicate with Cluster Ready Services CRS-4530: Communications failure contacting Cluster Synchronization Services daemon CRS-4534: Cannot communicate with Event Manager grid@rac1:~> crs_stat -t CRS-0184: Cannot communicate with the CRS daemon. |
发现重启后,只要是后启动的那个节点,crs就启动不成功,然后查看一下节点的实例状态
RAC 1:
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 |
grid@rac1:/opt/app/grid/11.2/log/rac1/crsd> ps -ef | grep asm grid 4786 1 0 11:00 ? 00:00:00 asm_pmon_+ASM2 grid 4791 1 0 11:00 ? 00:00:00 asm_psp0_+ASM2 grid 4797 1 2 11:00 ? 00:00:11 asm_vktm_+ASM2 grid 4804 1 0 11:00 ? 00:00:00 asm_gen0_+ASM2 grid 4809 1 0 11:00 ? 00:00:00 asm_diag_+ASM2 grid 4814 1 0 11:00 ? 00:00:00 asm_ping_+ASM2 grid 4819 1 0 11:00 ? 00:00:01 asm_dia0_+ASM2 grid 4824 1 0 11:00 ? 00:00:01 asm_lmon_+ASM2 grid 4829 1 0 11:00 ? 00:00:00 asm_lmd0_+ASM2 grid 4834 1 0 11:00 ? 00:00:01 asm_lms0_+ASM2 grid 4841 1 0 11:00 ? 00:00:00 asm_lmhb_+ASM2 grid 4846 1 0 11:00 ? 00:00:00 asm_mman_+ASM2 grid 4851 1 0 11:00 ? 00:00:00 asm_dbw0_+ASM2 grid 4856 1 0 11:00 ? 00:00:00 asm_lgwr_+ASM2 grid 4861 1 0 11:00 ? 00:00:00 asm_ckpt_+ASM2 grid 4866 1 0 11:00 ? 00:00:00 asm_smon_+ASM2 grid 4871 1 0 11:00 ? 00:00:00 asm_rbal_+ASM2 grid 4876 1 0 11:00 ? 00:00:00 asm_gmon_+ASM2 grid 4881 1 0 11:00 ? 00:00:00 asm_mmon_+ASM2 grid 4886 1 0 11:00 ? 00:00:00 asm_mmnl_+ASM2 grid 4891 1 0 11:00 ? 00:00:00 asm_lck0_+ASM2 |
RAC 2:
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 |
grid@rac2:/opt/app/grid/11.2/log/rac2/crsd> ps -ef | grep asm grid 4301 1 0 10:54 ? 00:00:00 asm_pmon_+ASM1 grid 4306 1 0 10:54 ? 00:00:00 asm_psp0_+ASM1 grid 4311 1 0 10:54 ? 00:00:00 asm_vktm_+ASM1 grid 4318 1 0 10:54 ? 00:00:00 asm_gen0_+ASM1 grid 4323 1 0 10:54 ? 00:00:00 asm_diag_+ASM1 grid 4328 1 0 10:54 ? 00:00:00 asm_ping_+ASM1 grid 4333 1 0 10:54 ? 00:00:02 asm_dia0_+ASM1 grid 4338 1 0 10:54 ? 00:00:01 asm_lmon_+ASM1 grid 4343 1 0 10:54 ? 00:00:01 asm_lmd0_+ASM1 grid 4348 1 0 10:54 ? 00:00:02 asm_lms0_+ASM1 grid 4355 1 0 10:54 ? 00:00:00 asm_lmhb_+ASM1 grid 4360 1 0 10:54 ? 00:00:00 asm_mman_+ASM1 grid 4365 1 0 10:54 ? 00:00:00 asm_dbw0_+ASM1 grid 4370 1 0 10:54 ? 00:00:00 asm_lgwr_+ASM1 grid 4375 1 0 10:54 ? 00:00:00 asm_ckpt_+ASM1 grid 4380 1 0 10:54 ? 00:00:00 asm_smon_+ASM1 grid 4385 1 0 10:54 ? 00:00:00 asm_rbal_+ASM1 grid 4390 1 0 10:54 ? 00:00:00 asm_gmon_+ASM1 grid 4395 1 0 10:54 ? 00:00:00 asm_mmon_+ASM1 grid 4400 1 0 10:54 ? 00:00:00 asm_mmnl_+ASM1 grid 4405 1 0 10:54 ? 00:00:00 asm_lck0_+ASM1 grid 4453 1 0 10:54 ? 00:00:00 asm_asmb_+ASM1 grid 4458 1 0 10:54 ? 00:00:00 oracle+ASM1_asmb_+asm1 (DESCRIPTION=(LOCAL=YES)(ADDRESS=(PROTOCOL=beq))) |
发现实例rac2实例怎么是asm1,2 个节点的实例交换了,然后关闭rac 2节点,重启rac 1节点
RAC 1:
1 2 3 4 5 6 7 8 9 10 |
grid@rac1:~> crs_stat -t CRS-0184: Cannot communicate with the CRS daemon. grid@rac1:~> crsctl check crs CRS-4638: Oracle High Availability Services is online CRS-4535: Cannot communicate with Cluster Ready Services CRS-4530: Communications failure contacting Cluster Synchronization Services daemon CRS-4534: Cannot communicate with Event Manager grid@rac1:~> ps -ef | grep asm grid 4596 4508 0 11:20 pts/0 00:00:00 grep asm |
发现rac 1节点的crs还是不能启动,此时安装后没有修改过任何东西,为撒rac 1就启动不来了呢?查看rac 1的实例
1 2 |
grid@rac1:~> echo $ORACLE_SID +ASM1 |
实例确实是+ASM1,但刚才看见RAC 2上注册了这个实例,怀疑2个节点实例混乱了,临时修改一下rac 1的实例为+ASM2
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 |
grid@rac1:~> export ORACLE_SID=+ASM2 grid@rac1:~> ps -ef | grep asm grid 4732 1 0 11:21 ? 00:00:00 asm_pmon_+ASM2 grid 4737 1 0 11:21 ? 00:00:00 asm_psp0_+ASM2 grid 4742 1 2 11:21 ? 00:00:02 asm_vktm_+ASM2 grid 4749 1 0 11:21 ? 00:00:00 asm_gen0_+ASM2 grid 4754 1 0 11:21 ? 00:00:00 asm_diag_+ASM2 grid 4759 1 0 11:21 ? 00:00:00 asm_ping_+ASM2 grid 4764 1 0 11:21 ? 00:00:00 asm_dia0_+ASM2 grid 4769 1 0 11:21 ? 00:00:00 asm_lmon_+ASM2 grid 4774 1 0 11:21 ? 00:00:00 asm_lmd0_+ASM2 grid 4779 1 0 11:21 ? 00:00:00 asm_lms0_+ASM2 grid 4786 1 0 11:21 ? 00:00:00 asm_lmhb_+ASM2 grid 4791 1 0 11:21 ? 00:00:00 asm_mman_+ASM2 grid 4796 1 0 11:21 ? 00:00:00 asm_dbw0_+ASM2 grid 4801 1 0 11:21 ? 00:00:00 asm_lgwr_+ASM2 grid 4806 1 0 11:21 ? 00:00:00 asm_ckpt_+ASM2 grid 4811 1 0 11:21 ? 00:00:00 asm_smon_+ASM2 grid 4816 1 0 11:21 ? 00:00:00 asm_rbal_+ASM2 grid 4821 1 0 11:21 ? 00:00:00 asm_gmon_+ASM2 grid 4826 1 0 11:21 ? 00:00:00 asm_mmon_+ASM2 grid 4831 1 0 11:21 ? 00:00:00 asm_mmnl_+ASM2 grid 4836 1 0 11:21 ? 00:00:00 asm_lck0_+ASM2 grid 4878 1 0 11:22 ? 00:00:00 asm_asmb_+ASM2 grid 4883 1 0 11:22 ? 00:00:00 oracle+ASM2_asmb_+asm2 (DESCRIPTION=(LOCAL=YES)(ADDRESS=(PROTOCOL=beq))) grid 4888 1 0 11:22 ? 00:00:00 asm_o000_+ASM2 grid 4893 1 0 11:22 ? 00:00:00 oracle+ASM2_o000_+asm2 (DESCRIPTION=(LOCAL=YES)(ADDRESS=(PROTOCOL=beq))) grid 5360 4508 0 11:23 pts/0 00:00:00 grep asm grid@rac1:~> crsctl check crs CRS-4638: Oracle High Availability Services is online CRS-4537: Cluster Ready Services is online CRS-4529: Cluster Synchronization Services is online CRS-4533: Event Manager is online grid@rac1:~> crs_stat -t Name Type Target State Host ------------------------------------------------------------ ora.DATA.dg ora....up.type ONLINE ONLINE rac1 ora....N1.lsnr ora....er.type ONLINE ONLINE rac1 ora.asm ora.asm.type ONLINE ONLINE rac1 ora.cvu ora.cvu.type ONLINE ONLINE rac1 ora.gsd ora.gsd.type OFFLINE OFFLINE ora....network ora....rk.type ONLINE ONLINE rac1 ora.oc4j ora.oc4j.type ONLINE ONLINE rac1 ora.ons ora.ons.type ONLINE ONLINE rac1 ora....SM2.asm application ONLINE ONLINE rac1 ora.rac1.gsd application OFFLINE OFFLINE ora.rac1.ons application ONLINE ONLINE rac1 ora.rac2.vip ora....t1.type ONLINE ONLINE rac1 ora.scan1.vip ora....ip.type ONLINE ONLINE rac1 |
然后发现rac 1的crs自动拉起来了,执行数据库软件安装,可以看见所有的节点了。
- 本文固定链接: http://www.savedba.com/?p=180
- 转载请注明: 版权所有,文章允许转载,但必须以链接方式注明源地址,否则追究法律责任!