In Oracle-L mailing list a question was asked about under which conditions can the explain plan report a wrong execution plan (not the one which was actually used when a problem happened).
I replied this with the following, but thought to show an example test case of this problem too:
The optimizer statistics the EXPLAIN PLAN ends up using are different from the statistics the other session ended up using
Explain plan does not use bind variable peeking, thus will not optimize for current bind variable values
Explain plan treats all bind variables as VARCHAR2, thus you ma have implicit datatype conversion happening during the plan execution, (meaning to_char,to_number functions are added around variables/columns) and this for example may make optimizer to ignore some indexes if you get unlucky.
Of course explain plan doesn’t really execute the plan so the implicit datatype conversion you see is in the explained plan only, but if you actually execute the statement (with correct bind datatypes) then there’s no implicit datatype conversion. And that’s where the difference comes from.
And here comes an example of condition number 3 above. Lets use a little bit of bad design out there and put numeric values into varchar2 columns:
SQL> create table t (id varchar2(10), name varchar2(100)); Table created. SQL> insert into t select to_char(object_id), object_name from dba_objects; 51449 rows created.
Now we add a little index for lookup performance and gather stats:
SQL> create index i on t(id); Index created. SQL> exec dbms_stats.gather_table_stats(user,'T',cascade=>true); PL/SQL procedure successfully completed.
Now lets define a bind variable of NUMBER type and set a value for it:
SQL> var x number SQL> SQL> exec :x:=99999 PL/SQL procedure successfully completed.
Now lets use “explain plan for” to estimate the execution plan:
SQL> explain plan for 2 select sum(length(name)) from t where id > :x; Explained. SQL> select * from table(dbms_xplan.display) ; PLAN_TABLE_OUTPUT ------------------------------------------------------------------------------------- Plan hash value: 3694077449 ------------------------------------------------------------------------------------- | Id | Operation | Name | Rows | Bytes | Cost (%CPU)| Time | ------------------------------------------------------------------------------------- | 0 | SELECT STATEMENT | | 1 | 29 | 56 (0)| 00:00:01 | | 1 | SORT AGGREGATE | | 1 | 29 | | | | 2 | TABLE ACCESS BY INDEX ROWID| T | 2572 | 74588 | 56 (0)| 00:00:01 | |* 3 | INDEX RANGE SCAN | I | 463 | | 3 (0)| 00:00:01 | ------------------------------------------------------------------------------------- Predicate Information (identified by operation id): --------------------------------------------------- 3 - access("ID">:X) 15 rows selected.
Explain plan command nicely reports that we’d be using an index range scan, which would be a good thing to do given my test data and search condition.
Now lets actually run the statement and see the REAL execution plan actually used for the execution. I’ll use dbms_xplan.display_CURSOR
for this. If you don’t pass the SQL_ID and SQL child number into that function it will just report the last SQL statement executed in your current session. But the key difference between the dbms_xplan.DISPLAY
and DISPLAY_CURSOR
is that the latter goes to library cache and fetches the actual SQL plan used from there. The explain plan command just reparses the SQL statement and estimates a plan using current optimizer statistics and environment, ignoring any bind variable values and assuming that all bind variables are of type VARCHAR2:
SQL> select sum(length(name)) from t where id > :x; SUM(LENGTH(NAME)) ----------------- SQL> select * from table(dbms_xplan.display_cursor); PLAN_TABLE_OUTPUT ---------------------------------------------------------------------------------- SQL_ID 7zm570j6kj597, child number 0 ------------------------------------- select sum(length(name)) from t where id > :x Plan hash value: 2966233522 --------------------------------------------------------------------------- | Id | Operation | Name | Rows | Bytes | Cost (%CPU)| Time | --------------------------------------------------------------------------- | 0 | SELECT STATEMENT | | | | 60 (100)| | | 1 | SORT AGGREGATE | | 1 | 29 | | | |* 2 | TABLE ACCESS FULL| T | 2572 | 74588 | 60 (5)| 00:00:01 | --------------------------------------------------------------------------- Predicate Information (identified by operation id): --------------------------------------------------- 2 - filter(TO_NUMBER("ID")>:X) 19 rows selected.
We actually used a full table scan!
Also check this related article by Kerry Osborne.