{"id":15025244,"url":"https://github.com/magese/ik-analyzer-solr","last_synced_at":"2025-05-15T10:05:34.819Z","repository":{"id":43114236,"uuid":"132854880","full_name":"magese/ik-analyzer-solr","owner":"magese","description":"ik-analyzer for solr 7.x-8.x","archived":false,"fork":false,"pushed_at":"2024-02-19T01:29:26.000Z","size":11110,"stargazers_count":1148,"open_issues_count":14,"forks_count":172,"subscribers_count":28,"default_branch":"master","last_synced_at":"2025-04-14T16:55:20.116Z","etag":null,"topics":["ik-analysis","ik-analyzer","java","lucene","solr","solrcloud"],"latest_commit_sha":null,"homepage":"","language":"Java","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"other","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/magese.png","metadata":{"files":{"readme":"README-CLOUD.md","changelog":null,"contributing":"CONTRIBUTING.md","funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2018-05-10T05:53:09.000Z","updated_at":"2025-04-14T08:24:16.000Z","dependencies_parsed_at":"2024-10-14T08:42:47.541Z","dependency_job_id":"bed77d26-6c4f-469f-8114-3e404eff65e8","html_url":"https://github.com/magese/ik-analyzer-solr","commit_stats":{"total_commits":130,"total_committers":4,"mean_commits":32.5,"dds":0.0692307692307692,"last_synced_commit":"f0059e2c78d82bbd12630388378073de4ffcc7db"},"previous_names":["magese/ik-analyzer-solr7"],"tags_count":11,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/magese%2Fik-analyzer-solr","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/magese%2Fik-analyzer-solr/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/magese%2Fik-analyzer-solr/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/magese%2Fik-analyzer-solr/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/magese","download_url":"https://codeload.github.com/magese/ik-analyzer-solr/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":254319718,"owners_count":22051072,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["ik-analysis","ik-analyzer","java","lucene","solr","solrcloud"],"created_at":"2024-09-24T20:01:52.823Z","updated_at":"2025-05-15T10:05:34.714Z","avatar_url":"https://github.com/magese.png","language":"Java","funding_links":[],"categories":["人工智能"],"sub_categories":[],"readme":"\n## Solr-Cloud说明\n\n##### 因为`Solr-Cloud`中的配置文件是交由`zookeeper`进行管理的， 所以为了方便更新动态词典， 所以也要将动态词典文件上传至`zookeeper`中，目录与`solr`的配置文件目录一致。\n\n#### 注意：因为`zookeeper`中的配置文件大小不能超过`1m`，当词典列表过多时，需将词典文件切分成多个。\n\n\n1. 将jar包放入每台服务器的Solr服务的`Jetty`或`Tomcat`的`webapp/WEB-INF/lib/`目录下；\n\n2. 将`resources`目录下的`IKAnalyzer.cfg.xml`、`ext.dic`、`stopword.dic`放入solr服务的`Jetty`或`Tomcat`的`webapp/WEB-INF/classes/`目录下；\n    ```console\n    ① IKAnalyzer.cfg.xml (IK默认的配置文件，用于配置自带的扩展词典及停用词典)\n    ② ext.dic (默认的扩展词典)\n    ③ stopword.dic (默认的停词词典)\n    ```\n    注意：与单机版不同，`ik.conf`及`dynamicdic.txt`请不要放在`classes`目录下！\n\n3. 将`resources`目录下的`ik.conf`及`dynamicdic.txt`放入solr配置文件夹中，与solr的`managed-schema`文件同目录中；\n    ```console\n        ① ik.conf (动态词典配置文件)\n            files (动态词典列表，可以设置多个词典表，用逗号进行分隔，默认动态词典表为dynamicdic.txt)\n            lastupdate (默认值为0，每次对动态词典表修改后请修改该值，必须大于上次的值，不然不会将词典表中新的词语添加到内存中。)\n        ② dynamicdic.txt (默认的动态词典，在此文件配置的词语不需重启服务即可加载进内存中。以#开头的词语视为注释，将不会加载到内存中。)\n    ```\n\n4. 配置Solr的`managed-schema`，添加`ik分词器`，示例如下；\n   ```xml\n   \u003c!-- ik分词器 --\u003e\n   \u003cfieldType name=\"text_ik\" class=\"solr.TextField\"\u003e\n     \u003canalyzer type=\"index\"\u003e\n         \u003ctokenizer class=\"org.wltea.analyzer.lucene.IKTokenizerFactory\" useSmart=\"false\" conf=\"ik.conf\"/\u003e\n         \u003cfilter class=\"solr.LowerCaseFilterFactory\"/\u003e\n     \u003c/analyzer\u003e\n     \u003canalyzer type=\"query\"\u003e\n         \u003ctokenizer class=\"org.wltea.analyzer.lucene.IKTokenizerFactory\" useSmart=\"true\" conf=\"ik.conf\"/\u003e\n         \u003cfilter class=\"solr.LowerCaseFilterFactory\"/\u003e\n     \u003c/analyzer\u003e\n   \u003c/fieldType\u003e\n   ```\n\n5. 将配置文件上传至`zookeeper`中，首次使用请重启服务或reload Collection。\n\n6. 测试分词：\n    * 此时的动态词典文件为空\n    ![](./img/dic.png)\n    * 配置文件lastupdate为0\n    ![](./img/conf.png)\n    * 测试分词\n    ![](./img/cloud-analyzer.png)\n\n7. 测试动态词典：\n    * 增加动态词典词语并上传至`zookeeper`\n    ![](./img/dic2.png)\n    * 修改配置文件并上传至`zookeeper`\n    ![](./img/conf2.png)\n    * 测试分词\n    ![](./img/cloud-analyzer2.png)\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmagese%2Fik-analyzer-solr","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fmagese%2Fik-analyzer-solr","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmagese%2Fik-analyzer-solr/lists"}