ASH to the rescue... for now...

After 4 unexplained crashes and a stalled Sev 1 SR I was pulling my hair out. Particularly vexing was the fact that a user was running a distributed query which was crashing a remote database.

I isolated the 4 crash times for the remote database and I wanted to see what users and SQLs were running on the local database up until the crash time. So for a crash happening around 11:16 I would use the following query. Note that access to these queries requires licensing of Diagnostic and Tuning packs :

SELECT DISTINCT SQL_ID, 
       (SELECT USERNAME FROM DBA_USERS WHERE USER_ID=DBA_HIST_ACTIVE_SESS_HISTORY.USER_ID) AS USERNAME 
  FROM dba_hist_active_sess_history 
 WHERE sample_time between to_date('2017-02-21 11:15','YYYY-MM-DD HH24:MI') 
       AND to_date('2017-02-21 11:16','YYYY-MM-DD HH24:MI') ORDER BY 2;

With a minute range I was guaranteed to get at least a few samples of what was going on before the crash. The DISTINCT filters out any duplicates so I'm left with a list of active SQL_IDs and their users.

After running this and adjusting times to coincide with the 4 crashes I was able to find 1 common user and 3 distinct SQL_IDs across the 4 crashes.

I dove into the three SQL_IDs using this:

select * From dba_hist_sqltext where SQL_ID='sql_id_here';

And discovered they were all a very similar format with a huge "IN" dominating the WHERE clause. At least while we work the SR for a final solution we know what SQL _not_ to run. Reminds me of a Henny Youngman quote:

Tags: 

Add new comment