首页 科技问答 暴力删除后ORACLE GRID排障安装

暴力删除后ORACLE GRID排障安装

科技问答 264
1691604734,

组网及说明

实验目标:

环境:

虚机: VIRTUAL BOX 6.0.7  LCPU 4; MEM 4G  DISK :100G

OS: RHEL 7.3

DB: ORACLE EE Database 12.2.0.1  ORACLE Grid Infrastructure 12.1.0.1

实验目标:因为 DB 不能向下兼容 ORACLE GI , 所以需要把 GI升级为 12.2.0.1

介质: p33583921_122010_Linux-x86-64

告警信息

方法 1

升级的方式,使用 upgrade方式,结果预检显示需要空间远超预期,放弃。

方法 2

暴力删除 GI 12.1.0.1;直接安装 GI 12.2.0.1。一路坎坷。

问题描述

各种报错与版本不符

过程分析

过程见解决方法

解决方法

1. 展开 p33583921_122010_Linux-x86-64

2. 运行 gridSetup.sh

自检报了几个无关痛痒的 warning,忽略。

问题 1

运行安装,结果既不让安装也不让升级。提示系统已经存在一个(暴力删除的结果)。

解决:直接删除了 /etc/oracle/*.loc *.orig

 

然后可以选择安装模式 ,进入 setup程序。很顺利没有编译错误,但是在最后运行 root.sh时出现悲剧问题。

问题 2:运行 root.sh

现象:超慢,然后挂起,报错。

CRS-4123: Oracle High Availability Services has been started.

CLSRSC-400: A system reboot is required to continue installinsag

翻看 gridsetup的安装日志,在 2千多行中发现下面这个

modprobe: FATAL: Module oracleoks not found.

解决:

尝试使用 /sbin/modprobe -v oracleoks

测试失败

提示寻找如下 lib失败

oracleacfs.ko  oracleadvm.ko  oracleoks.ko

找了一下有,但是存于

/lib/modules/3.10.0-948.el7.x86_64/extra 目录下 ,既然有为什么会这样呢?

冥思苦想 3.10.0-948.el7.x86_64这个目录应该是个 release版本号

但是当前 release

[root@localhost extra]# uname -r

3.10.0-957.el7.x86_64

于是复制到 957的路径下去重 run,脚本可以继续向前走。果然是开源,毫无道理可言。

问题 3/root.sh虽然往前走了,但是继续报错
Installing Trace File Analyzer
CRS-4046: Invalid Oracle Clusterware configuration.
CRS-4000: Command Create failed, or completed with errors.

Failure initializing entries in /etc/oracle/scls_scr/localhost
/u01/12.2.0/grid/perl/bin/perl –I /u01/12.2.0/grid/perl/lib  –I /u01/12.2.0/grid/crs/install  /oracle/app/12.2.0/grid/crs/install/rootcrs.pl execution failed

解决:

删掉已有的 HAS配置,再做重配,报错。

# /crs/install/roothas.pl -deconfig -force –verbose

报出一堆错误,源码错。

换成

/crs/install/roothas.sh –deconfig –force –verbos

通过,不愧是开源。

 

问题4再往后显示成功,但是HAS 服务依然起不来。Setup完成不了,ASM实例无效。

[grid@localhost ~]$ crsctl stat  res -t

--------------------------------------------------------------------------------

Name           Target  State        Server                   State details

 

ora.ons         OFFLINE  ONLINE       localhost                STABLE

ora.cssd        OFFLINE  OFFLINE       localhost                STABLE

ora.diskmon     OFFLINE OFFLINE                               STABLE

 

检查配置

[grid@localhost ~]$ crsctl status  resource ora.cssd -p

NAME=ora.cssd

TYPE=ora.cssd.type

ACL=owner:grid:rwx,pgrp:oinstall:rwx,other::r--

AGENT_FILENAME=%CRS_HOME%/bin/cssdagent%CRS_EXE_SUFFIX%

AGENT_HB_INTERVAL=0

AGENT_HB_MISCOUNT=10

AUTO_START=never

CSS_CRITICAL=no

。。。。。。。。。。。。。。

 

运行命令:

crsctl modify resource ora.cssd -attr "AUTO_START=always"

CRS-4995:  The command 'Modify  resource' is invalid in crsctl. Use srvctl for this command.

 

但是在 STAND ALONG模式下,从不用 srvctl,于是直接强制改。

crsctl modify resource ora.cssd -attr "AUTO_START=always" –unsupported

[grid@localhost ~]$ crsctl status  resource ora.cssd -p

NAME=ora.cssd

TYPE=ora.cssd.type

ACL=owner:grid:rwx,pgrp:oinstall:rwx,other::r--

AGENT_FILENAME=%CRS_HOME%/bin/cssdagent%CRS_EXE_SUFFIX%

AUTO_START=always

CARDINALITY=1

 

 [grid@localhost ~]$ crsctl stop has

CRS-4133: Oracle High Availability Services has been stopped.

[grid@localhost ~]$ crsctl start has

CRS-4123: Oracle High Availability Services has been started.

[grid@localhost ~]$ crsctl stat  res -t

--------------------------------------------------------------------------------

Name           Target  State        Server                   State details

ora.ons         ONLINE  ONLINE       localhost                STABLE

ora.cssd        ONLINE  ONLINE       localhost                STABLE

ora.evmd       online  INTERMEDIATE localhost                STABLE

           服务启动

问题5配置ASMCA 失败

[main] [ 2023-04-10 17:15:36.919 EDT ] [OracleHome.getVersion:1086]  Current Version From Inventory: null

[main] [ 2023-04-10 17:15:36.919 EDT ] [OracleHome.getVersion:1018]  OracleHome.getVersion called.  Current Version: null

[main] [2023-04-10 17:15:36.921 EDT] [InventoryUtil.getOUIInvSession:349]  setting OUI READ level to ……

[main] [ 2023-04-10 17:15:36.921 EDT ] [OracleHome.getVersion:1037]  Homeinfo /u01/app/grid/product/12.1.0/grid,1

[main] [ 2023-04-10 17:15:36.921 EDT ] [OracleHome.getVersion:1086]  Current Version From Inventory: null

[main] [ 2023-04-10 17:15:36.921 EDT ] [SQLPlusEngine.getCmmdParams:226]  m_home null

[main] [ 2023-04-10 17:15:36.921 EDT ] [SQLPlusEngine.getCmmdParams:227]  version > 112 false

[main] [ 2023-04-10 17:15:36.921 EDT ] [SQLEngine.getEnvParams:607]  Default NLS_LANG: AMERICAN_AMERICA.AL32UTF8

[main] [ 2023-04-10 17:15:36.921 EDT ] [SQLEngine.getEnvParams:617]  NLS_LANG: AMERICAN_AMERICA.AL32UTF8

[main] [ 2023-04-10 17:15:36.922 EDT ] [SQLEngine.initialize:363]  Execing SQLPLUS/SVRMGR process...

[Finalizer] [ 2023-04-10 17:15:36.922 EDT ] [Util.finalize:134]  Util: finalized called for oracle.ops.mgmt.has.Util@6d9b5a4d

[main] [2023-04-10 17:15:36.932 EDT ] [UsmcaLogger.logException:186]  SEVERE:method oracle.sysman.assistants.usmca.backend.USMInstance:findOSAuthentication

[main] [ 2023-04-10 17:15:36.933 EDT ] [UsmcaLogger.logException:187]  There is an error in creating the following process:

/u01/app/grid/product/12.1.0/grid/bin/sqlplus -S /NOLOG

The error is:

Cannot run program "/u01/app/grid/product/12.1.0/grid/bin/sqlplus": error=2, No such file or directory

Cannot run program "/u01/app/grid/product/12.1.0/grid/bin/sqlplus": error=2,

查找相关文件/u01/app/oraInventory/ContentsXML

Cat inventory.XML

<HOME_LIST>

<HOME NAME="OraGI12Home1" LOC="/u01/app/grid/product/12.1.0/grid" TYPE="O" IDX="1" CRS="true"/>

<HOME NAME="OraDB12Home1" LOC="/u01/app/oracle/product/12.2.0/dbhome_1" TYPE="O" IDX="2"/>

<HOME NAME="OraDB12Home2" LOC="/u01/app/oracle/product/12.2.0/dbhome" TYPE="O" IDX="3"/>

<HOME NAME="OraGI12Home2" LOC="/trans" TYPE="O" IDX="4" CRS="true"/>

/etc/oratab

orcl:/u01/app/oracle/product/12.2.0/dbhome:N

# +ASM:/u01/app/grid/product/12.1.0/grid:N              # line added by Agent  à 注销

+ASM:/trans:N  -> 添加

 

问题 6 : 建库成功后,虽然不妨碍使用但是没有磁盘组信息是不完整的。

[grid@localhost ~]$ srvctl config database -db test

Database unique name: test

Database name: test

Oracle home: /u01/app/oracle/product/12.2.0/dbhome

Oracle user: oracle

Spfile: +DATA/TEST/PARAMETERFILE/spfile.274.1133872707

Password file:

Domain:

Start options: open

Stop options: immediate

Database role: PRIMARY

Management policy: AUTOMATIC

Disk Groups:

Services:

OSDBA group: oinstall

OSOPER group:

Database instance: test

 

[grid@localhost ~]$ srvctl add asm

[grid@localhost ~]$ srvctl config asm

ASM home: <CRS home>

Password file:

Backup of Password file:

ASM listener:

Spfile:

ASM diskgroup discovery string: ++no-value-at-resource-creation--never-updated-through-ASM++

[grid@localhost ~]$ srvctl config asm

ASM home: <CRS home>

Password file:

Backup of Password file:

ASM listener:

Spfile:

ASM diskgroup discovery string: ++no-value-at-resource-creation--never-updated-through-ASM++

[grid@localhost ~]$ srvctl config database -db test

Database unique name: test

Database name: test

Oracle home: /u01/app/oracle/product/12.2.0/dbhome

Oracle user: oracle

Spfile: +DATA/TEST/PARAMETERFILE/spfile.274.1133872707

Password file:

Domain:

Start options: open

Stop options: immediate

Database role: PRIMARY

Management policy: AUTOMATIC

Disk Groups:

Services:

OSDBA group: oinstall

OSOPER group:

Database instance: test

[grid@localhost ~]$ crsctl status  res -t

--------------------------------------------------------------------------------

Name           Target  State        Server                   State details

--------------------------------------------------------------------------------

Local Resources

--------------------------------------------------------------------------------

ora.LISTENER1.lsnr

               ONLINE  ONLINE       localhost                STABLE

ora.asm

               OFFLINE OFFLINE      localhost                STABLE

ora.ons

               ONLINE  ONLINE       localhost                STABLE

--------------------------------------------------------------------------------

Cluster Resources

--------------------------------------------------------------------------------

ora.cssd

      1        ONLINE  ONLINE       localhost                STABLE

ora.diskmon

      1        OFFLINE OFFLINE                               STABLE

ora.evmd

      1        ONLINE  ONLINE       localhost                STABLE

ora.test.db

      1        ONLINE  ONLINE       localhost                Open,HOME=/u01/app/o

                                                             racle/product/12.2.0

                                                             /dbhome,STABLE

--------------------------------------------------------------------------------

 [grid@localhost ~]$ crsctl start  res ora.asm –init

crsctl modify resource "ora.asm" -init -attr "AUTO_START=1"  -unsupported

CRS-2672: Attempting to start 'ora.asm' on 'localhost'

CRS-2676: Start of 'ora.asm' on 'localhost' succeeded

 [grid@localhost ~]$ srvctl config database -db test

Database unique name: test

Database name: test

Oracle home: /u01/app/oracle/product/12.2.0/dbhome

Oracle user: oracle

Spfile: +DATA/TEST/PARAMETERFILE/spfile.274.1133872707

Password file:

Domain:

Start options: open

Stop options: immediate

Database role: PRIMARY

Management policy: AUTOMATIC

Disk Groups: DATA

Services:

OSDBA group: oinstall

OSOPER group:

Database instance: test

[grid@localhost ~]$ crsctl stat res -t

--------------------------------------------------------------------------------

Name           Target  State        Server                   State details

--------------------------------------------------------------------------------

Local Resources

--------------------------------------------------------------------------------

ora.DATA.dg

               ONLINE  ONLINE       localhost                STABLE

ora.LISTENER1.lsnr

               ONLINE  ONLINE       localhost                STABLE

ora.asm

               ONLINE  ONLINE       localhost                Started,STABLE

ora.ons

               ONLINE  ONLINE       localhost                STABLE

ora.cssd

      1        ONLINE  ONLINE       localhost                STABLE

ora.diskmon

      1        OFFLINE OFFLINE                               STABLE

ora.evmd

      1        ONLINE  ONLINE       localhost                STABLE

ora.test.db

      1        ONLINE  ONLINE       localhost                Open,HOME=/u01/app/o

                                                             racle/product/12.2.0

                                                             /dbhome,STABLE

 

问题 7 SRVCTL CONFIG ASM 无信息

[grid@localhost ~]$ srvctl config asm

ASM home: <CRS home>

Password file:

Backup of Password file:

ASM listener:

Spfile:

ASM diskgroup discovery string : ++no-value-at-resource-creation--never-updated-through-ASM++

内容来源:知了社区,基于知识共享署名-相同方式共享3.0中国大陆许可协议
CRM论坛(CRMbbs.com)——一个让用户更懂CRM的垂直性行业内容平台,CRM论坛致力于互联网、客户管理、销售管理、SCRM私域流量内容输出5年。 如果您有好的内容,欢迎向我们投稿,共建CRM多元化生态体系,创建CRM客户管理一体化生态解决方案。,暴力删除后ORACLE GRID排障安装