由于查询中的限制而丢失信息

我有一个列为col1-col10的表tablename。不是每一行都填充了col4，而是每一行填充了col1、col2和col3。当col4满足条件时，我想获取所有{col1，col2，col3}元组，然后从表名中获取与元组{col1、col2、col3}匹配的所有行。

我有这个问题：

select t.*
from mytable t
where exists (
select 1
from mytable t1
where 
t1.col1 = t.col1 
and t1.col2 = t.col2 
and t1.col3 = t.col3 
and t1.col4 >= 1000
)
LIMIT 1000

表的大小很大，所以我必须添加限制。由于限制，对于某些｛col1、col2、col3｝，无法获取结果数据集中的所有行。然后，我想从表名中获取与元组{col1，col2，col3}匹配的所有行。

我不介意在我的结果中有更少的{col1，col2，col3}元组，但我想要我所拥有的元组的完整信息。

我怎样才能做到这一点？

您没有提到哪个数据库，但下面的查询应该运行得更快。你可以做：

select t.*
from t
join (
select distinct col1, col2, col3 
from t
where col4 >= 1000
limit 100
) x on t.col1 = x.col1 and t.col2 = x.col2 and t.col3 = x.col3;

有了以下索引，查询应该会变得更快：

create index ix1 on t (col4, col1, col2, col3);
create index ix2 on t (col1, col2, col3);

一种更有效的方法是使用窗口函数：

select t.*
from (select t.*,
sum(case when col4 > 1000 then 1 else 0 end) over (partition by col1, col2, col3) as cnt_matches
from mytable t
) t
where cnt_matches > 0;

相关内容

最新更新

热门标签：