Sitecore RichText字段在Solr中没有正确索引，仅用于Production

我有一个字段，其类型在Sitecore中设置为Rich text。在较低的环境中，可以正确地对值进行索引，并将HTML文本正确地存储在Solr中。

在生产中，对于所有项目，HTML将被完全剥离。

更新：一个区别是，在较低的环境中，我们在Prem和Production上都有Solr，这是Solr云

更新：我已经检查了CM和CD服务器，并且都有Body Copy字段的字段读取器

更新：现在所有项目都会发生这种情况。早些时候，可能其他项目没有更新和发布，所以它们正确地显示了HTML

问题可能是什么？

它只发生在生产上

我已经验证了配置是否符合预期。字段为Body Copy

<fieldReaders type="Sitecore.ContentSearch.FieldReaders.FieldReaderMap, Sitecore.ContentSearch">
<param desc="id">defaultFieldReaderMap</param>
<mapFieldByTypeName hint="raw:AddFieldReaderByFieldTypeName">
<fieldReader fieldTypeName="html|rich text"                                       fieldReaderType="Sitecore.ContentSearch.FieldReaders.RichTextFieldReader, Sitecore.ContentSearch" />
</mapFieldByTypeName>
<mapFieldByFieldName hint="raw:AddFieldReaderByFieldName">
<fieldReader fieldName="Body Copy" fieldReaderType="Sitecore.ContentSearch.FieldReaders.DefaultFieldReader, Sitecore.ContentSearch" />
</mapFieldByFieldName>
</fieldReaders>

现在所有内容都发生了这种情况

我已经解决了报告HTML错误的字段中的HTML错误，但也没有解决

如果它均匀地发生在所有字段中，那么我猜测它一定是您的SOLR配置。

请查看测试和生产系统上相关SOLR Core的managed-schema.xml文档

您可以通过文件系统(在prem上(执行此操作：
即C:\Solr\server\Solr\web_index\conf\managed-schema.xml

或者通过SOLR仪表板(cloud和prem(：
即。https://solr-domain:8983/solr/#/web_index/files?file=managed-模式

我怀疑你会发现过滤器的不同之处。

请参阅此链接上的HTMLStripCharFilterFactory，以获取可能导致您描述的问题的过滤器示例：https://solr.apache.org/guide/8_1/charfilterfactories.html#solr-htmlstripcharfilter工厂

如果这有帮助，请告诉我。

问候院长

相关内容

最新更新

热门标签：