GIN指数怎么了，无法避免SEQ扫描？

我创建了一个这样的表，

create table mytable(hash char(40), title varchar(500));
create index name_fts on mytable using gin(to_tsvector('english', 'title'));
CREATE UNIQUE INDEX md5_uniq_idx ON mytable(hash);

当我查询标题时，

test=# explain analyze select * from mytable where to_tsvector('english', title) @@ 'abc | def'::tsquery limit 10;
                                                     QUERY PLAN
--------------------------------------------------------------------------------------------------------------------
 Limit  (cost=0.00..277.35 rows=10 width=83) (actual time=0.111..75.549 rows=10 loops=1)
   ->  Seq Scan on mytable  (cost=0.00..381187.45 rows=13744 width=83) (actual time=0.110..75.546 rows=10 loops=1)
         Filter: (to_tsvector('english'::regconfig, (title)::text) @@ '''abc'' | ''def'''::tsquery)
         Rows Removed by Filter: 10221
 Planning time: 0.176 ms
 Execution time: 75.564 ms
(6 rows)

索引未被使用。什么好主意吗?我有10万行

你的索引定义中有一个错别字，应该是

ON mytable USING gin (to_tsvector('english', title))

不是

ON mytable USING gin (to_tsvector('english', 'title'))

你写它的方式，它是一个常量，而不是一个被索引的字段，这样的索引对于像你执行的这样的搜索确实是无用的。

要查看索引是否可以使用，可以执行

SET enable_seqscan=off;

，然后再次运行查询。
如果该索引仍未被使用，则该索引可能无法使用。

除此之外，你的执行计划还有一些让我感到奇怪的地方。PostgreSQL估计对mytable的顺序扫描将返回13744行，而不是您所说的1000万行。您是否禁用了自动真空功能，还是有其他原因导致您的表统计数据如此不准确?

相关内容

最新更新

热门标签：