昨晚一套核心庫的一個節點宕掉,然後reboot了,
在alert裡面發現如下資訊:
- Thu Jul 5 03:03:50 2012
- Errors in file /u01/Oracle/app/oracle/admin/crm/bdump/crm1_dbw9_14426.trc:
- ORA-07445: exception encountered: core dump [kslgetl()+32] [SIGSEGV] [Address not mapped to object] [0xF27D6E41F020E369] [] []
- Thu Jul 5 03:03:54 2012
- Errors in file /u01/oracle/app/oracle/admin/crm/bdump/crm1_pmon_14238.trc:
- ORA-00471: DBWR process terminated with error
- Thu Jul 5 03:03:54 2012
- Errors in file /u01/oracle/app/oracle/admin/crm/bdump/crm1_ckpt_14549.trc:
- ORA-00471: DBWR process terminated with error
- Thu Jul 5 03:03:54 2012
- Errors in file /u01/oracle/app/oracle/admin/crm/bdump/crm1_lgwr_14540.trc:
- ORA-00471: DBWR process terminated with error
- Thu Jul 5 03:03:54 2012
- Errors in file /u01/oracle/app/oracle/admin/crm/bdump/crm1_lms0_14351.trc:
- ORA-00471: DBWR process terminated with error
- Thu Jul 5 03:03:54 2012
- PMON: terminating instance due to error 471
- Thu Jul 5 03:03:58 2012
- Shutting down instance (abort)
- License high water mark = 3882
- Thu Jul 5 03:04:00 2012
- Instance terminated by PMON, pid = 14238
- Thu Jul 5 03:04:03 2012
- Instance terminated by USER, pid = 14287
- Thu Jul 5 03:04:30 2012
- Starting ORACLE instance (normal)
出現ORA-07445,第一反應就是去看trace,以下是call stack trace:
kslgetl>kclmvreqbg>kclrwrite>kcbbxsv>kcbb_coalesce>kcbbwlru>
在metalink上發現在10.2.0.3上的類似bug,5932514,5879114,最後根據call stack trace資訊發現和unpublished的bug:4637902的描述更靠近。