我在ROR应用程序中使用"chewy"gem进行弹性搜索。但我没有找到任何关于弹性搜索滚动api的文档。当我跳到记录的最后一页时,我得到了下面的错误。
[500] {"error":{"root_cause":[{"type":"query_phase_execution_exception","reason":"Result window is too
large, from + size must be less than or equal to: [10000] but was [19450]. See the scroll api for a more
efficient way to request large data sets. This limit can be set by changing the [index.max_result_window]
index level parameter."}],"type":"search_phase_execution_exception","reason":"all shards failed",
"phase":"query","grouped":true,"failed_shards":[{"shard":0,"index":"recordings","node":"tgLqH_wwRUG6NmY0PCB0nA",
"reason":{"type":"query_phase_execution_exception","reason":"Result window is too large, from + size must
be less than or equal to: [10000] but was [19450]. See the scroll api for a more efficient way to request
large data sets. This limit can be set by changing the [index.max_result_window] index level
parameter."}}]},"status":500}
有没有办法在chewygem中实现弹性搜索滚动api,或者他们有其他选择?
只需缩小查询大小,就可以批量使用滚动:
# @example Call the `scroll` API until all the documents are returned
#
# # Index 1,000 documents
# client.indices.delete index: 'test'
# 1_000.times do |i| client.index index: 'test', type: 'test', id: i+1, body: {title: "Test #{i}"} end
# client.indices.refresh index: 'test'
#
# # Open the "view" of the index by passing the `scroll` parameter
# # Sorting by `_doc` makes the operations faster
# r = client.search index: 'test', scroll: '1m',
body: {size: 100, sort: ['_doc']}
#
# # Display the initial results
# puts "--- BATCH 0 -------------------------------------------------"
# puts r['hits']['hits'].map { |d| d['_source']['title'] }.inspect
#
# # Call the `scroll` API until empty results are returned
# while r = client.scroll(scroll_id: r['_scroll_id'], scroll: '5m') and not r['hits']['hits'].empty? do
# puts "--- BATCH #{defined?($i) ? $i += 1 : $i = 1} -------------------------------------------------"
# puts r['hits']['hits'].map { |d| d['_source']['title'] }.inspect
# puts
# end
使用Elasticsearch DSL Gem 的示例