我们知道。在很多搜索网站上都有这样的一个功能,就是竞价排名,绑定某个关键字,给钱了那我给你排前面。
包括很多视频网站的一些站内搜索功能也有这样的需求;
最近在看solr官网,solr的查询功能中就包含了这一功能。参考地址如下:
https://wiki.apache.org/solr/QueryElevationComponent#elevateIds.2FexcludeIds
下面这个网址包含了solr所有功能;对solr感兴趣的可以收藏下!
https://cwiki.apache.org/confluence/display/solr/Spell+Checking
那么接下来我们来实现我们的功能把。
<searchComponent name="elevator" class="org.apache.solr.handler.component.QueryElevationComponent"> <!-- pick a fieldType to analyze queries --> <str name="queryFieldType">string</str> <str name="config-file">elevate.xml</str> </searchComponent> <!-- A request handler for demonstrating the elevator component --> <requestHandler name="/elevate" class="org.apache.solr.handler.component.SearchHandler" startup="lazy"> <lst name="defaults"> <str name="echoParams">explicit</str> <!-- <str name="df">text</str> --> </lst> <arr name="last-components"> <str>elevator</str> </arr> </requestHandler>
config-file:elevate.xml,这个文件solr默认已经存在,需要自己配置;默认跟solrconfig.xml同一目录,如无特殊要求可不做修改
searchComponent的name属性是和requestHandler有一个绑定关系的,大家注意一下;
还有就是,requestHandler定义的名称/elevate是我们后面需要用到的节点名,就像selelct一样。使用这一功能的时候把select换成elevate。
requestHandler标签中的defaults-->df是定义了你要做竞价排名,或者说人工干预的查询字段;如果此处不指定的话,需要在查询的时候指定;
<elevate> <query text="isle"> <doc id="6493" /> <!-- put the actual ipod at the top --> <doc id="3971" exclude="true" /> <!-- exclude this cable --> </query> </elevate>
以上是一个简单的示例,当用户查询isle时,docID为6493的文档会被提到最前。而这里的docID为3971的文档将会被提出到查询的结果之外,因为配置了exclude属性为true。
如果不配置默认为false,即不排除!
接下来我们查询下看看。这里先看下正常的查询结果。这里我们把enableElevation设置为false即可
<response> <lst name="responseHeader"> <int name="status">0</int> <int name="QTime">1</int> <lst name="params"> <str name="enableElevation">false</str> <str name="df">name</str> <str name="q">isle</str> <str name="forceElevation">true</str> </lst> </lst> <result name="response" numFound="6" start="0"> <doc> <str name="id">302</str> <str name="name">The Isle</str> <str name="year">2001</str> <long name="_version_">1480190152307376130</long> </doc> <doc> <str name="id">2</str> <str name="name">Isle of Man TT 2004 Review</str> <str name="year">2004</str> <long name="_version_">1480190152202518528</long> </doc> <doc> <str name="id">6493</str> <str name="name">Val Lewton: Isle of the Dead / Bedlam</str> <str name="year">1945</str> <long name="_version_">1480190153170354189</long> </doc> <doc> <str name="id">3871</str> <str name="name">The Who: Live at the Isle of Wight Festival</str> <str name="year">1970</str> <long name="_version_">1480190152951201796</long> </doc> <doc> <str name="id">9409</str> <str name="name">Isle of Wight Festival: Message to Love: 1970</str> <str name="year">1970</str> <long name="_version_">1480190153367486468</long> </doc> <doc> <str name="id">13617</str> <str name="name"> Jethro Tull: Nothing Is Easy: Live at the Isle of Wight 1970 </str> <str name="year">2005</str> <long name="_version_">1480190153579298828</long> </doc> </result> </response>
接下来我们把enableElevation设置为true,查询做下对比
http://localhost:8081/solr/collection1/elevate?q=isle&enableElevation=true&forceElevation=true
查询结果如下:’
<response> <lst name="responseHeader"> <int name="status">0</int> <int name="QTime">18</int> <lst name="params"> <str name="enableElevation">true</str> <str name="df">name</str> <str name="q">isle</str> <str name="forceElevation">true</str> </lst> </lst> <result name="response" numFound="6" start="0"> <doc> <str name="id">6493</str> <str name="name">Val Lewton: Isle of the Dead / Bedlam</str> <str name="year">1945</str> <long name="_version_">1480190153170354189</long> </doc> <doc> <str name="id">302</str> <str name="name">The Isle</str> <str name="year">2001</str> <long name="_version_">1480190152307376130</long> </doc> <doc> <str name="id">2</str> <str name="name">Isle of Man TT 2004 Review</str> <str name="year">2004</str> <long name="_version_">1480190152202518528</long> </doc> <doc> <str name="id">3871</str> <str name="name">The Who: Live at the Isle of Wight Festival</str> <str name="year">1970</str> <long name="_version_">1480190152951201796</long> </doc> <doc> <str name="id">9409</str> <str name="name">Isle of Wight Festival: Message to Love: 1970</str> <str name="year">1970</str> <long name="_version_">1480190153367486468</long> </doc> <doc> <str name="id">13617</str> <str name="name"> Jethro Tull: Nothing Is Easy: Live at the Isle of Wight 1970 </str> <str name="year">2005</str> <long name="_version_">1480190153579298828</long> </doc> </result> </response>
可以看到docID为6493的文档被提到最前。而docID为3971的文档被剔除结果集。
好了,到这基本上功能就完成了。有什么不对的地方请多加指点;