Sphinx 全文搜索引擎速成指南

作者: sadly 发布日期: 2008-6-02 查看数: 1078 出自: http://www.phpx.com

weight desc,hits asc'; offset，结果记录集的起始位置，默认是0 limit，从结果记录集中取出的数量，默认是20条 index，要搜索的索引名称 ... where query='test;index=cgfinal'; ... where query='test;index=test1,test2,test3;'; minid,maxid，匹配最小与最大文档ID weights，以逗号分割的分配给sphinx全文检索字段的权重列表 ... where query='test;weights=1,2,3;'; filter,!filter，以逗号分隔的属性名与一堆要匹配的值 #只包括1,5,19的组 ... where query='test;filter=group_id,1,5,19;'; #不包括3,11的组 ... where query='test;!filter=group_id,3,11'; range,!range，逗号分隔的属性名一最小与最大要匹配的值 #从3至7的组 ... where query='test;range=group_id,3,7;'; #不包括从5至25的组 ... where query='test;!range=group_id,5,25;'; maxmatches，每个查询最大匹配的值 ... where query='test;maxmatches=2000;'; groupby，group by 方法与属性 ... where query='test;groupby=day:published_ts;'; ... where query='test;groupby=attr:group_id;'; groupsort，group by 的排序 ... where query='test;gropusort='@count desc';需要注意的重要一点是让sphinx进行排序，过滤，切分结果记录集比用MySQL的where,orderby 和limit将有更好的效率。有两个原因，首先sphinx做了很多优化，在这些任务上它比mySQL做得更出色，其次searchd在打包，sphinxSE在传输与解包上需要的数据量更少。你可以通过运用join在sphinxSE的搜索表和其他引擎类型的表做并联查询。这有一个从example.sql中documents表的例子： mysql> SELECT content, date_added FROM test.documents docs -> JOIN t1 ON (docs.id=t1.id) -> WHERE query="one document;mode=any"; +-------------------------------------+---------------------+ | content | docdate | +-------------------------------------+---------------------+ | this IS my test document number two | 2006-06-17 14:04:28 | | this IS my test document number one | 2006-06-17 14:04:28 | +-------------------------------------+---------------------+ 2 rows IN SET (0.00 sec) mysql> SHOW ENGINE SPHINX STATUS; +--------+-------+---------------------------------------------+ | Type | Name | STATUS | +--------+-------+---------------------------------------------+ | SPHINX | stats | total: 2, total found: 2, time: 0, words: 2 | | SPHINX | words | one:1:2 document:2:2 | +--------+-------+---------------------------------------------+ 2 rows IN SET (0.00 sec)8. SphinxSE的SQL查询例子演练从eht_articles中查询标题含有“动画”关键字的记录。 SELECT c.* FROM eht_articles AS c,sphinx AS t WHERE c.articlesid=t.id AND query='@title 动画;mode=extended'提示说明：要指定某个字段进行搜索，要用@字段名+空格+关键字+分号+mode=extended 如果不指定字段，则系统会对TITLE,CONTENTS进行搜索，对什么字段进行全文检索取决于在sphinx.conf中sql_query定义的select 中的字段（文本类型）从eht_articles中查询文章内容或标题含有“CGArt”关键字的记录。 SELECT c.* FROM eht_articles AS c,sphinx AS t WHERE c.articlesid=sphinx.id AND query='动画'若AUTHOR,TITLE,CONTENTS三个字段都全文索引了，但只想搜title,或contents中含有“动画”关键字的文章 SELECT c.* FROM eht_articles AS c,sphinx AS t WHERE c.articlesid=t.id AND query='@title 动画 | @contents 动画; mode=extended'查询标题含有“动画”关键字，catalogid为7，edituserid为1的记录 SELECT c.* FROM eht_articles AS c,sphinx AS t WHERE c.articlesid=t.id AND query='@title 动画; filter=edituserid,1;filter=catalogid,7;mode=extended'提示采用filter=字段名称,值就相当于where中的字段名=值，filter提到的字段必须在sphinx的source部分的字段属性定义中定义，如 sql_attr_uint = CATALOGID sql_attr_uint = EDITUSERID sql_attr_uint = HITS sql_attr_timestamp = ADDTIME查询标题含有“动画”关键字，按人气Hits从大至小，栏目ID从大至小排序 SELECT c.* FROM eht_articles AS c,sphinx AS t WHERE c.articlesid=t.id AND query='@title 动画;mode=extended; sort=extended:hits desc,catalogid desc'在sphinx中，select出来的内容是按weight从大至小排序的，weight是根据sphinx内部一定的算法算出来的，越大就表示越匹配，如果想按匹配度从大至小排序，则可以： SELECT c.* FROM eht_articles AS c,sphinx AS t WHERE c.articlesid=t.id AND query='@title 动画;mode=extended; sort=@weight desc'搜内容或标题含有优秀或Icon或设计，按catalogid分组，按匹配度从高至低排序 SELECT t.*,c.* FROM eht_articles AS c,sphinx AS t WHERE c.articlesid=t.id AND query='优秀 | Icon | 设计; mode=extended;groupby=attr:catalogid;groupsort=@weight;'9. 如何自动重建索引 10. 相关资源用php构建自定义搜索引擎官方手册文档本文中提到的sphinx.conf配置文件(用GBK编码查看) (at)gmail.com>

秒客网

Sphinx 全文搜索引擎速成指南

相关文章

Sphinx 全文搜索引擎 速成指南

相关文章

Sphinx 全文搜索引擎速成指南