Oracle:跳跃式索引(Skip Scan Index)浅析

在Oracle9i中,有一个新的特性:跳跃式索引(Skip Scan Index)。当表有一个复合索引,而在查询中有除了索引中第一列的其他列作为条件,并且优化器模式为CBO,这时候查询计划就有可能使用到SS。此外,还可以通过使用提示index_ss(CBO下)来强制使用SS。


举例:

sql> create table test1 (a number,b char(10),c varchar2(10)); 
  Table created. 
  sql> create index test_idx1 on test1(a,b); 
  Index created. 
  sql> set autotrace on 
  sql> select /*+index_ss(test1 test_idx1)*/* from test1 a 
  2 where b ='a'; 
  no rows selected 
  Execution Plan 
  0 SELECT STATEMENT Optimizer=CHOOSE (Cost=2 Card=1 Bytes=32) 
  1 0 TABLE ACCESS (BY INDEX ROWID) OF 'TEST1' (Cost=2 Card=1 Bytes=32) 
  2 1 INDEX (SKIP SCAN) OF 'TEST_IDX1' (NON-UNIQUE)


但并不是任何情况下都会使用到SS。在Oracle的官方文档中,除了提到需要CBO,并且对表进行过分析外,还需要保证第一列的distinct value非常小。这一段是从官方文档上摘取的关于SS的一段解释:

Index skip scans improve index scans by nonprefix columns since it is often faster to scan index blocks than scanning table data blocks.
  In this case a composite index is split logically into smaller subindexes. The number of logical subindexes depends on the cardinality of the initial column. Hence it is now possible to use the index even if the leading column is not used in a where clause.


Oracle并没有公布过关于SS更多的内部技术细节。但注意上面的这句话:

In this case a composite index is split logically into smaller subindexes. The number of logical subindexes depends on the cardinality of the initial column.
即Oralce会对复合索引进行逻辑划分,分成多个子索引。可以这样理解,Oracle将索引从逻辑上划分为a.num_distinct个子索引,每次对一个子索引进行扫描。因此SS的索引扫描成本为a.num_distinct.


下面做一些试验,看看在什么情况下Oracle采用SS.
  首先要保证使用SS的几个必要条件:
  · Optimizer为CBO
  · 相关表要有正确的统计数据
  · Oracle DB版本为9i以上
  下面就是一个使用到SS的特殊条件:第一列的distinct num要足够小。小到什么程度呢?
  还是以上面的表为例(省略中间的麻烦步骤,取两个临界值做实验):

<strong><span style="color:#ff0000;">取第一列distinct number为37:</span></strong>

sql> truncate table test1; 
  Table truncated. 
  sql> begin 
  2 for i in 1..100000 loop 
  3 insert into test1 values (mod(i,37),to_char(i),to_char(i)); 
  4 end loop;p; 
  5 commit; 
  6 end; 
  7 / 
  PL/sql procedure successfully completed. 
  sql> analyze table test1 compute statistics; 
  Table analyzed. 
  sql> set autotrace on explain 
  sql> select * from test1 
  2 where b = '500'; 
  A B C 
  ---------- ---------- ---------- 
  19 500 500 
  Execution Plan 
  ---------------------------------------------------------- 
  0 SELECT STATEMENT Optimizer=CHOOSE (Cost=37 Card=1 Bytes=17) 
  1 0 TABLE ACCESS (FULL) OF 'TEST1' (Cost=37 Card=1 Bytes=17)

<span style="color:#ff0000;"><strong>再取第一列distinct number为36:</strong></span>

sql> truncate table test1; 
  Table truncated. 
  sql> begin 
  2 for i in 1..100000 loop 
  3 insert into test1 values (mod(i,36),to_char(i)); 
  4 end loop; 
  5 commit; 
  6 end; 
  7 / 
  PL/sql procedure successfully completed. 
  sql> analyze table test1 compute statistics; 
  Table analyzed. 
  sql> select * from test1 where b = '500'; 
  A B C 
  ---------- ---------- ---------- 
  32 500 500 
  Execution Plan 
  ---------------------------------------------------------- 
  0 SELECT STATEMENT Optimizer=CHOOSE (Cost=12 Card=1 Bytes=17) 
  1 0 TABLE ACCESS (BY INDEX ROWID) OF 'TEST1' (Cost=12 Card=1 B 
  ytes=17) 
  2 1 INDEX (SKIP SCAN) OF 'TEST_IDX1' (NON-UNIQUE) (Cost=37 C 
  ard=1)


从上面试验结果看,FTS的cost是37。当第一列distinct number小于这个值时,Oracle选择了SS。


继续试验:

sql> select count(*) from test1 
  2 where b <= '1'; 
  COUNT(*) 
  ---------- 
  1 
  Execution Plan 
  ---------------------------------------------------------- 
  0 SELECT STATEMENT Optimizer=CHOOSE (Cost=12 Card=1 Bytes=10) 
  1 0 SORT (AGGREGATE) 
  2 1 INDEX (SKIP SCAN) OF 'TEST_IDX1' (NON-UNIQUE) (Cost=37 C 
  ard=1 Bytes=10)

<span style="color:#ff0000;"><strong>注意:在b中’10’是比’1’大的最小值(char(10)类型)</strong></span>
    
sql> select count(*) from test1 
  2 where b <= '10'; 
  COUNT(*) 
  ---------- 
  2 
  Execution Plan 
  ---------------------------------------------------------- 
  0 SELECT STATEMENT Optimizer=CHOOSE (Cost=37 Card=1 Bytes=10) 
  1 0 SORT (AGGREGATE) 
  2 1 TABLE ACCESS (FULL) OF 'TEST1' (Cost=37 Card=773 Bytes=7 
  730)

<span style="color:#ff0000;"><strong>观察结果,这时候影响的因素是cardinality了。第二个查询计划中的cardinality值(773)正是b<=’10’的cardinality值:</strong></span>

sql> set autotrace off 
  sql> select 100000*(to_number('31302020202020202020','xxxxxxxxxxxxxxxxxxxx')-to 
  _number('31202020202020202020','xxxxxxxxxxxxxxxxxxxx'))/(to_number('39393939392 
  020202020','xxxxxxxxxxxxxxxxxxxx')-to_number('31202020202020202020','xxxxxxxxx 
  xxxxxxxxxxx'))+1 from dual; 
  100000*(TO_NUMBER('31302020202020202020','XXXXXXXXXXXXXXXXXXXX')-TO_NUMBER('3120 
  -------------------------------------------------------------------------------- 
  772.791768

<span style="color:#ff0000;"><strong>再看一个含有第一列条件的等效的语句:</strong></span>

sql> set autotrace on explain 
  sql> select count(*) from test1 
  2 where a>=0 
  3 and b <='1'; 
  COUNT(*) 
  ---------- 
  1 
  Execution Plan 
  ---------------------------------------------------------- 
  0 SELECT STATEMENT Optimizer=CHOOSE (Cost=12 Card=1 Bytes=12) 
  1 0 SORT (AGGREGATE) 
  2 1 INDEX (SKIP SCAN) OF 'TEST_IDX1' (NON-UNIQUE) (Cost=37 C 
  ard=1 Bytes=12)



再做几个有趣的试验,下面的试验条件是不满足SS的,但是请注意查询返回列队查询计划的影响:
  sql> truncate table test1; 
  Table truncated. 
  sql> begin 
  2 for i in 1..100000 loop 
  3 insert into test1 values (i,to_char(i)); 
  4 end loop; 
  5 commit; 
  6 end; 
  7 / 
  PL/sql procedure successfully completed. 
  sql> analyze table test1 compute statistics; 
  Table analyzed. 
  sql> select * from test1 
  2 where b = '500'; 
  A B C 
  ---------- ---------- ---------- 
  500 500 500 
  Execution Plan 
  ---------------------------------------------------------- 
  0 SELECT STATEMENT Optimizer=CHOOSE (Cost=37 Card=1 Bytes=19) 
  1 0 TABLE ACCESS (FULL) OF 'TEST1' (Cost=37 Card=1 Bytes=19)

  <span style="color:#ff0000;"><strong>改变返回列:</strong></span>

  sql> select count(*) from test1 
  2 where b = '500'; 
  COUNT(*) 
  ---------- 
  1 
  Execution Plan 
  ---------------------------------------------------------- 
  0 SELECT STATEMENT Optimizer=CHOOSE (Cost=34 Card=1 Bytes=10) 
  1 0 SORT (AGGREGATE) 
  2 1 INDEX (FAST FULL SCAN) OF 'TEST_IDX1' (NON-UNIQUE) (Cost 
  =34 Card=1 Bytes=10)

<span style="color:#ff0000;"><strong>再改变一种:</strong></span>

  sql> select a from test1 
  2 where b = '500'; 
  A 
  ---------- 
  500 
  Execution Plan 
  ---------------------------------------------------------- 
  0 SELECT STATEMENT Optimizer=CHOOSE (Cost=34 Card=1 Bytes=14) 
  1 0 INDEX (FAST FULL SCAN) OF 'TEST_IDX1' (NON-UNIQUE) (Cost=3 
  4 Card=1 Bytes=14)

<span style="color:#ff0000;"><strong>使用RBO呢?</strong></span>

  sql> select /*+rule*/a from test1 
  2 where b = '500'; 
  A 
  ---------- 
  500 
  Execution Plan 
  ---------------------------------------------------------- 
  0 SELECT STATEMENT Optimizer=HINT: RULE 
  1 0 TABLE ACCESS (FULL) OF 'TEST1'

值得一提的是,上述任何一个例子在8i中执行的话,都不会使用到索引(无论是否符合SS的条件)。

相关文章

数据库版本:11.2.0.4 RAC(1)问题现象从EM里面可以看到,在23号早上8:45~8:55时,数据库等待会话暴增...
(一)问题背景最近在对一个大约200万行数据的表查看执行计划时,发现存在异常,理论上应该返回100多万...
(一)删除备份--DELETE命令用于删除RMAN备份记录及相应的物理文件。当使用RMAN执行备份操作时,会在RM...
(1)DRA介绍 数据恢复顾问(Data Recovery Advise)是一个诊断和修复数据库的工具,DRA能够修复数据文...
RMAN(Recovery Manager)是Oracle恢复管理器的简称,是集数据库备份(backup)、修复(restore)和恢复...
(1)备份对象 可以使用RMAN进行的备份对象如下: --整个数据库:备份所有的数据文件和控制文件; --数...