Azure认知搜索成为短语搜索运营商的亮点



我们正在尝试使用Azure认知搜索来对存储在Azure Blob存储中的文档启用全文搜索。我们需要的功能之一是显示特定文档的热门亮点。

我们注意到,虽然搜索一个确切短语时只正确匹配包含该确切短语的文档,但会为短语中的单个单词而不是完整短语返回高亮显示。

示例

对于短语搜索"供应协议",我们会得到"供应"one_answers"协议"的突出显示。

请求:

{
"search": ""supply agreement"",
"select": "metadata_storage_name,metadata_storage_path,language",
"searchFields": "merged_content",
"highlight": "merged_content"
}

响应:

{
"@odata.context": "https://....search.windows.net/indexes('...')/$metadata#docs(*)",
"value": [
{
"@search.score": 0.047654618,
"@search.highlights": {
"merged_content": [
"Customer has agreed to engage Supplier to <em>supply</em> the Products and Supplier has agreed to accept the engagement on the terms set out in this <em>Agreement</em>.",
"<em>Agreement</em>n1.",
"Tax means goods and services, value added or similar consumption based tax applicable to the <em>supply</em> of the Products under this <em>agreement</em>.",
...
]
},
"metadata_storage_name": "a2b23e30-c1e0-4c52-a659-d8705662d699.docx",
"metadata_storage_path": "...",
"language": "en"
},
...
]
}

这是Azure认知搜索API当前版本的已知问题吗?

目前没有办法突出显示整个短语,但我有个好消息要告诉你。我们正在跟踪并计划发布突出短语的工作,尽管我还没有具体的日期要宣布。

Luis Cabrera-Azure认知搜索首席项目经理

最新更新