An example of abnormal restart of the oracle database caused by VCS: vcsoracle

Source: Internet
Author: User

An example of abnormal restart of the oracle database caused by VCS: vcsoracle
1. Environment Description

Operating system version: SUSE Linux Enterprise Server 10 sp2 (x86_64)

Database Version: Oracle 11.1.0.7.16

VCS version: 5.1

Blog: http://blog.csdn.net/hw_libo/article/details/41171561


2. Problem phenomena and Analysis

At about a.m., the SMS received an exception alert from the database.

(1) view Database alert logs

Sun Nov 16 03:46:51 2014Stopping background process SMCOStopping background process FBDAShutting down instance: further logons disabledSun Nov 16 03:46:54 2014kkjcre1p: unable to spawn jobq slave process, slot 0, error 1089Stopping background process CJQ0Stopping background process QMNCStopping background process MMNLStopping background process MMONSun Nov 16 03:47:03 2014Shutting down instance (immediate)License high water mark = 366Sun Nov 16 03:51:52 2014License high water mark = 366USER (ospid: 8654): terminating the instanceSun Nov 16 03:51:52 2014ORA-1092 : opiodr aborting process unknown ospid (28656_47367034883680)Sun Nov 16 03:51:52 2014LGWR waiting for instance termination

No obvious causes of exceptions were found.

At the same time, I also checked the reports for that time period and found that the database was under little pressure and there was no abnormal SQL.


(2) Check OS resource usage


System resources are normally used, and IO is not high.


(3) view the messages log

Nov 16 03:42:46 boscodb AgentFramework[10894]: VCS ERROR V-16-2-13027 Thread(4149214112) Resource(ora_vg) - monitor procedure did not complete within the expected time.Nov 16 03:42:46 boscodb Had[10258]: VCS ERROR V-16-2-13027 (boscodb) Resource(ora_vg) - monitor procedure did not complete within the expected time.Nov 16 03:43:01 boscodb su: (to root) root on none……Nov 16 03:45:39 boscodb su: (to oracle) root on noneNov 16 03:45:47 boscodb AgentFramework[10894]: VCS ERROR V-16-2-13210 Thread(4154813344) Agent is calling clean for resource(ora_vg) because 4 successive invocations of the monitor procedure did not complete within the expected time.Nov 16 03:45:47 boscodb Had[10258]: VCS ERROR V-16-2-13210 (boscodb) Agent is calling clean for resource(ora_vg) because 4 successive invocations of the monitor procedure did not complete within the expected time.……Nov 16 03:46:48 boscodb AgentFramework[10894]: VCS ERROR V-16-2-13006 Thread(4153760672) Resource(ora_vg): clean procedure did not complete within the expected time.Nov 16 03:46:48 boscodb Had[10258]: VCS ERROR V-16-2-13006 (boscodb) Resource(ora_vg): clean procedure did not complete within the expected time.Nov 16 03:46:49 boscodb AgentFramework[10899]: VCS ERROR V-16-2-13067 Thread(4155194272) Agent is calling clean for resource(ora_archive) because the resource became OFFLINE unexpectedly, on its own.Nov 16 03:46:50 boscodb Had[10258]: VCS ERROR V-16-2-13067 (boscodb) Agent is calling clean for resource(ora_archive) because the resource became OFFLINE unexpectedly, on its own.Nov 16 03:46:51 boscodb AgentFramework[10899]: VCS ERROR V-16-2-13068 Thread(4155194272) Resource(ora_archive) - clean completed successfully.
It turns out that VCS has encountered an error and the monitoring vg has timed out. These error messages are also logged in the VCS logs.


3. Solution

I checked the moniter script that monitors vg and found that there were no exceptions. In addition, there were no problems in the pressure of the script in other libraries.

The current configuration is:

        Application ora_vg (                StartProgram = "/opt/BOSCO_ora/bin/vg_start.sh"                StopProgram = "/opt/BOSCO_ora/bin/vg_stop.sh"                CleanProgram = "/opt/BOSCO_ora/bin/vg_clean.sh"                MonitorProgram = "/opt/BOSCO_ora/bin/vg_monitor.sh"                MonitorInterval = 30                MonitorTimeout = 30                )

You can use either of the following methods to increase the monitoring duration to 90 s:

(1) You can modify main. cf, but you need to restart VCS without an operation time window.

(2) dynamically modify commands

Haconf-makerw

Hares-modify ora_vg MonitorTimeout 90 # adjust the monitoring duration to 90 s. The default value is 60 s.

Haconf-dump


Blog: http://blog.csdn.net/hw_libo/article/details/41171561

-- Bosco QQ: 375612082

---- END ----
Bytes -------------------------------------------------------------------------------------------------------
All rights reserved. reprinted articles are allowed, but source addresses must be indicated by links. Otherwise, the documents will be held legally responsible!

Related Article

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.