HanLP中文分词Lucene插件,支持包括Solr在内的基于Lucene的系统
<dependency>
<groupId>com.hankcs.nlp</groupId>
<artifactId>hanlp-lucene-plugin</artifactId>
<version>1.1.6</version>
</dependency>
<dependency>
<groupId>com.hankcs.nlp</groupId>
<artifactId>hanlp-lucene-plugin</artifactId>
<version>1.1.5</version>
</dependency>
<dependency>
<groupId>com.hankcs.nlp</groupId>
<artifactId>hanlp-lucene-plugin</artifactId>
<version>1.1.4</version>
</dependency>
<dependency>
<groupId>com.hankcs.nlp</groupId>
<artifactId>hanlp-lucene-plugin</artifactId>
<version>1.1.3</version>
</dependency>
<dependency>
<groupId>com.hankcs.nlp</groupId>
<artifactId>hanlp-lucene-plugin</artifactId>
<version>1.1.2</version>
</dependency>
修正Windows回车换行符\r\n导致的高亮错位问题:https://github.com/hankcs/HanLP/issues/222
新增繁体中文分词,用户自定义词典,停用词,人名识别,地名识别,机构名识别等配置项
现在可以支持演示高亮搜索结果了,请参考演示:https://github.com/hankcs/hanlp-solr-plugin/blob/91eefb5f00192c31f0fc26f0215556b808791785/src/test/java/com/hankcs/lucene/HighLighterTest.java
hotfix:修复连续\n引发的不分词bug。