温馨提示×

温馨提示×

您好,登录后才能下订单哦!

密码登录×
登录注册×
其他方式登录
点击 登录注册 即表示同意《亿速云用户服务条款》

elasticsearch分词器怎么使用

发布时间:2022-03-22 16:10:20 阅读:397 作者:iii 栏目:大数据
开发者测试专用服务器限时活动,0元免费领,库存有限,领完即止! 点击查看>>

本篇内容主要讲解“elasticsearch分词器怎么使用”,感兴趣的朋友不妨来看看。本文介绍的方法操作简单快捷,实用性强。下面就让小编来带大家学习“elasticsearch分词器怎么使用”吧!

1、在Elasticsearch的plugins目录下建一个ik目录,将ik安装包解压到该目录 或 ./bin/elasticsearch-plugin install 

2、重启Elasticsearch

3、测试分词器

GET /_analyze
{
  "text""中华人民共和国国徽",
  # 两种:ik_smart较粗粒度的拆分 和 ik_max_word较细粒度的拆分
  "analyzer""ik_max_word"
}

4、使用分词器 在模版中定义好分词

"settings": {
    "index": {
      "number_of_shards""1",
      "number_of_replicas""0"
    },
    "analysis": {
      "analyzer": {
        "ik":{
          "tokenizer" : "ik_max_word"
        }
      }
    }
  },
  .......
  "mappings": {
      "_doc" : {   
        "dynamic_templates": [
          {
            "integers": {
              "mapping": {
                "type""long"
              },
              "match_mapping_type""long",
              "match""*"
            }
          },
          {
            "doubles": {
              "mapping": {
                "type""double"
              },
              "match_mapping_type""double",
              "match""*"
            }
          },
          {
            "time_mapping": {
              "match_pattern""regex",
              "mapping": {
                "format""yyyy-MM-dd HH:mm:ss||strict_date_optional_time||epoch_millis",
                "type""date"
              },
              "match_mapping_type""string",
              "match"".+(time|Time|TIME|date|Date|DATE)"
            }
          },
          {
            "ip_match": {
              "match_pattern""regex",
              "mapping": {
                "type""ip"
              },
              "match_mapping_type""string",
              "match"".+(IP|Ip|ip|iP)"
            }
          },
          {
            "port_match": {
              "match_pattern""regex",
              "mapping": {
                "type""long"
              },
              "match_mapping_type""string",
              "match"".+(Port|port|PORT)"
            }
          },
          {
            "name_mapping": {
              "match_pattern""regex",
              "mapping": {
                "fields": {
                  "keyword": {
                    "ignore_above": 2560,
                    "type""keyword"
                  }
                },
                "type""text"
              },
              "match_mapping_type""string",
              "match"".+(Name|name)"
            }
          },
          {
            "strings": {
              "mapping": {
                "type""keyword"
              },
              "match_mapping_type""string",
              "match""*"
            }
          }
        ],
        "date_detection"false,
        "properties": {
          "content": {
            "type" : "text",
            "analyzer""ik_max_word",
            "fields" : {
              "keyword" : {
                "type" : "keyword",
                "ignore_above" : 256
              }
            }
          },
          "level": {
            "type""keyword"
          },
          "target": {
            "type""keyword"
          },
          "source": {
            "type""keyword"
          },
          "alerttime": {
            "type""date",
            "format""yyyy-MM-dd HH:mm:ss"
          }
        }
      }
     }

到此,相信大家对“elasticsearch分词器怎么使用”有了更深的了解,不妨来实际操作一番吧!这里是亿速云网站,更多相关内容可以进入相关频道进行查询,关注我们,继续学习!

亿速云「云服务器」,即开即用、新一代英特尔至强铂金CPU、三副本存储NVMe SSD云盘,价格低至29元/月。点击查看>>

向AI问一下细节

免责声明:本站发布的内容(图片、视频和文字)以原创、转载和分享为主,文章观点不代表本网站立场,如果涉及侵权请联系站长邮箱:is@yisu.com进行举报,并提供相关证据,一经查实,将立刻删除涉嫌侵权内容。

原文链接:https://my.oschina.net/zlhblogs/blog/4782805

AI

开发者交流群×