Elastic Stack如何使用

发表于 2021-03-26 更新于 2023-02-22 分类于 op Changyan：本文字数： 6.8k 阅读时长 ≈ 25 分钟

如果你没有听说过Elastic Stack，那你一定听说过ELK，实际上ELK是三款软件的简称，分别是Elasticsearch、 Logstash、Kibana组成，在发展的过程中，又有新成员Beats的加入，所以就形成了Elastic Stack。所以说，ELK是旧的称呼，Elastic Stack是新的名字。

索引

创建索引

创建默认索引

创建索引api接口地址：127.0.0.1:9200/articles?pretty(创建articles索引)

请求方式：put

查看刚才创建好了articles状态

127.0.0.1:9200/articles/?pretty

{
    "articles": {
        "aliases": {},
        "mappings": {},
        "settings": {
            "index": {
                "routing": {
                    "allocation": {
                        "include": {
                            "_tier_preference": "data_content"
                        }
                    }
                },
                "number_of_shards": "1",
                "provided_name": "articles",
                "creation_date": "1625205220230",
                "number_of_replicas": "1",
                "uuid": "LoE8nBAVRnaLA4GPJfiBuA",
                "version": {
                    "created": "7130299"
                }
            }
        }
    }
}

number_of_shards 是指索引要做多少个分片，只能在创建索引时指定，后期无法修改。(创建时未指定，默认为1)
number_of_replicas 是指每个分片有多少个副本，后期可以动态修改。(创建时未指定，默认为1)

primary shard：主分片，每个文档都存储在一个分片中，当你存储一个文档的时候，系统会首先存储在主分片中，然后会复制到不同的副本中。默认情况下，一个索引有5个主分片。你可以在事先制定分片的数量，当分片一旦建立，分片的数量则不能修改。

replica shard：副本分片，每一个分片有零个或多个副本。副本主要是主分片的复制，可以增加高可用性，提高性能。
默认情况下，一个主分配有一个副本，但副本的数量可以在后面动态的配置增加。
副本必须部署在不同的节点上，不能部署在和主分片相同的节点上。

创建索引时并设置分片

创建索引api接口地址：127.0.0.1:9200/articles?pretty(创建articles索引)

请求方式：put

请求体：

 {
     "settings":{
         "index.number_of_shards":2,
         "index.number_of_replicas":1
     }
}

查看索引状态

{
    "articles": {
        "aliases": {},
        "mappings": {},
        "settings": {
            "index": {
                "routing": {
                    "allocation": {
                        "include": {
                            "_tier_preference": "data_content"
                        }
                    }
                },
                "number_of_shards": "2",
                "provided_name": "articles",
                "creation_date": "1625200722765",
                "number_of_replicas": "1",
                "uuid": "7cl2XAMLSWegRYMXFmI87Q",
                "version": {
                    "created": "7130299"
                }
            }
        }
    }
}

创建索引时并设置映射

创建用户索引api地址：127.0.0.1:9200/users?pretty

请求方式：put

请求体：

 {
     "mappings":{
         "properties":{
             "name":{
                 "type":"text",
                 "analyzer": "ik_max_word"
             },
             "age":{
                 "type":"integer"
             },
             "createtime":{
                 "type":"date"
             },
             "position":{
                 "type":"text",
                 "analyzer": "ik_max_word"
             },
             "url": {
                    "type": "keyword",
                    "index": false,
                    "doc_values": false
                }
         }
     }
}

type：字段类型

analyzer：分析器(这里使用了ik中文分词器，第三方插件需要安装)；不设置默认使用standard标准分析器，即逐个字符拆分。

index：禁用索引，这个字段不能被搜索，但是它并不妨碍做聚合。

doc_values：对一个字段进行排序；对一个字段进行聚合；某些过滤，比如地理位置过滤某些与字段相关的脚本计算；使用 docvalue_fields 返回搜索结果部分字段值

查询

查询所有文档

语法：elasticsearch服务地址/索引/_search

可选参数

_source：只获取 _source 部分参数，类似数据库查询中的指定字段，而不是 select * 返回所有字段(多个字段之间使用逗号分隔)

size: 要返回的结果数量，默认为 10

from: 要跳过的结果数量，默认为 0

查询5篇文章，从第10条开始查询，只显示id和title

使用get带参数请求查询

请求方式：get

请求地址：http://127.0.0.1:9903/articles/_search?_source=title,id&size=5&from=10

返回结果：

{
    "took": 5,
    "timed_out": false,
    "_shards": {
        "total": 3,
        "successful": 3,
        "skipped": 0,
        "failed": 0
    },
    "hits": {
        "total": {
            "value": 2222,
            "relation": "eq"
        },
        "max_score": 1.0,
        "hits": [
            {
                "_index": "articles",
                "_type": "_doc",
                "_id": "442",
                "_score": 1.0,
                "_source": {
                    "id": 442,
                    "title": "深入学习HTML5的history API"
                }
            },
            {
                "_index": "articles",
                "_type": "_doc",
                "_id": "450",
                "_score": 1.0,
                "_source": {
                    "id": 450,
                    "title": "想让百度删除不想收录的域名或快照的最快解决方法"
                }
            },
            {
                "_index": "articles",
                "_type": "_doc",
                "_id": "466",
                "_score": 1.0,
                "_source": {
                    "id": 466,
                    "title": "PHP采集远程图片保存本地"
                }
            },
            {
                "_index": "articles",
                "_type": "_doc",
                "_id": "490",
                "_score": 1.0,
                "_source": {
                    "id": 490,
                    "title": "8个最佳Web开发资源推荐"
                }
            },
            {
                "_index": "articles",
                "_type": "_doc",
                "_id": "530",
                "_score": 1.0,
                "_source": {
                    "id": 530,
                    "title": "前方高能反应！设计师最常见的五个设计误区"
                }
            }
        ]
    }
}

使用get/post带请求体查询

请求方式：post/get

请求地址：http://127.0.0.1:9903/articles/_search

请求体：

查询所有，返回指定fields字段，不返回_source，请求条数为5，从第10条开始获取。

{
    "query": {
        "match_all": {}
    },
    "fields": [
        "id",
        "title"
    ],
    "_source": false,
    "size": 5,
    "from": 10
}

返回结果：

{
    "took": 4,
    "timed_out": false,
    "_shards": {
        "total": 3,
        "successful": 3,
        "skipped": 0,
        "failed": 0
    },
    "hits": {
        "total": {
            "value": 2222,
            "relation": "eq"
        },
        "max_score": 1.0,
        "hits": [
            {
                "_index": "articles",
                "_type": "_doc",
                "_id": "442",
                "_score": 1.0,
                "fields": {
                    "title": [
                        "深入学习HTML5的history API"
                    ],
                    "id": [
                        442
                    ]
                }
            },
            {
                "_index": "articles",
                "_type": "_doc",
                "_id": "450",
                "_score": 1.0,
                "fields": {
                    "title": [
                        "想让百度删除不想收录的域名或快照的最快解决方法"
                    ],
                    "id": [
                        450
                    ]
                }
            },
            {
                "_index": "articles",
                "_type": "_doc",
                "_id": "466",
                "_score": 1.0,
                "fields": {
                    "title": [
                        "PHP采集远程图片保存本地"
                    ],
                    "id": [
                        466
                    ]
                }
            },
            {
                "_index": "articles",
                "_type": "_doc",
                "_id": "490",
                "_score": 1.0,
                "fields": {
                    "title": [
                        "8个最佳Web开发资源推荐"
                    ],
                    "id": [
                        490
                    ]
                }
            },
            {
                "_index": "articles",
                "_type": "_doc",
                "_id": "530",
                "_score": 1.0,
                "fields": {
                    "title": [
                        "前方高能反应！设计师最常见的五个设计误区"
                    ],
                    "id": [
                        530
                    ]
                }
            }
        ]
    }
}

根据文档id查询

GET <index>/_doc/<_id>      查询指定文档id的文档信息
HEAD <index>/_doc/<_id>     查询指定文档id的文档是否存在，只判断文档是否存在，head 返回的信息更少、 性能更高，满足特殊业务场景使用:
GET <index>/_source/<_id>   查询指定文档id，只返回 _source 信息
HEAD <index>/_source/<_id>  查询指定文档id的文档是否存在，只判断文档是否存在，head 返回的信息更少、 性能更高，满足特殊业务场景使用:

语法：GET elasticsearch服务器地址/索引/_doc/文档id

可选参数：

_source：只获取 _source 部分参数，类似数据库查询中的指定字段，而不是 select * 返回所有字段(多个字段之间使用逗号分隔)；默认返回所有字段；设为false不返回任何字段

查询id为530的文档，只显示id和title

使用_doc查询，返回文档信息

请求方式：GET

请求地址：http://127.0.0.1:9903/articles/_doc/530?_source=title,id

返回结果：

{
    "_index": "articles",
    "_type": "_doc",
    "_id": "530",
    "_version": 1,
    "_seq_no": 205,
    "_primary_term": 4,
    "found": true,
    "_source": {
        "id": 530,
        "title": "前方高能反应！设计师最常见的五个设计误区"
    }
}

使用_source查询，只返回source
请求方式：GET
请求地址：http://127.0.0.1:9903/articles/_source/530?_source=title,id 或者http://127.0.0.1:9903/articles/_source/530?_source_includes=title,id
返回结果：
1
2
3
4
{
"id": 530,
"title": "前方高能反应！设计师最常见的五个设计误区"
}

批量查询

Mutil get：ES 同时支持批量查询，需要使用 _mget API

查询文档 ID 等于 466 和 490 的文档信息

内容太长，此处只取id和title

请求方式：get/post

请求地址：http://127.0.0.1:9903/articles/_mget?_source=title,id

请求体：

{
    "docs": [
        {
            "_id": "466"
        },
        {
            "_id": "490"
        }
    ]
}

返回结果：

{
    "docs": [
        {
            "_index": "articles",
            "_type": "_doc",
            "_id": "466",
            "_version": 1,
            "_seq_no": 195,
            "_primary_term": 4,
            "found": true,
            "_source": {
                "id": 466,
                "title": "PHP采集远程图片保存本地"
            }
        },
        {
            "_index": "articles",
            "_type": "_doc",
            "_id": "490",
            "_version": 1,
            "_seq_no": 198,
            "_primary_term": 4,
            "found": true,
            "_source": {
                "id": 490,
                "title": "8个最佳Web开发资源推荐"
            }
        }
    ]
}

Query DSL

查询索引包括全文本查询、组合查询、结构化查询等。

Search和Filter区别

Query 查询
用于解答文档是否存在，并且告知返回文档与查询条件的匹配度，返回 _score 评分供用户选择。
Filter 查询
只用于返回文档是否与查询匹配，但是不会告诉你匹配度，即不进行评分。在做聚合查询时，filter 经常发挥更大的作用。因为没有评分 Elasticsearch 的处理速度就会提高，提升了整体响应时间。同时 filter 可以缓存查询结果，而 Query 则不能缓存。

使用场景

如果涉及到全文检索以及评分相关业务使用 Query，其他场景推荐使用 Filter 查询。

组合查询

Boolean 查询

Boolean 查询包含 must、filter、should、must_not。

must :必须匹配并且返回评分（文档必须匹配这些条件才能被包含进来。）；

filter 忽略评分，(必须匹配，但它以不评分、过滤模式来进行。这些语句对评分没有贡献，只是根据过滤标准来排除或包含文档。)

should 相当于数据库查询中的 or，针对 should 有一个特殊的情况，也就是所有的搜索只有 should ，那么必须满足should 里的其中一个才会被搜索到。(如果满足这些语句中的任意语句，将增加 _score ，否则，无任何影响。它们主要用于修正每个文档的相关性得分。)

must_not 为不匹配，相当于不等于(文档 必须不 匹配这些条件才能被包含进来。)。

查询作者为2；类别为3；浏览量不在2000-8000之间的文档
请求方式：get/post
请求地址：http://127.0.0.1:9903/articles/_search?_source=title,id,author,views,cat
请求体：

{
    "query": {
        "bool": {
            "must": {
                "term": {
                    "author": 2
                }
            },
            "filter": {
                "term": {
                    "cat": 4
                }
            },
            "must_not": [
                {
                    "range": {
                        "views": {
                            "gte": 2000,
                            "lte": 8000
                        }
                    }
                }
            ]
        }
    }
}

返回结果：

{
    "took": 7,
    "timed_out": false,
    "_shards": {
        "total": 3,
        "successful": 3,
        "skipped": 0,
        "failed": 0
    },
    "hits": {
        "total": {
            "value": 7,
            "relation": "eq"
        },
        "max_score": 1.0,
        "hits": [
            {
                "_index": "articles",
                "_type": "_doc",
                "_id": "2047",
                "_score": 1.0,
                "_source": {
                    "author": 2,
                    "cat": 4,
                    "id": 2047,
                    "title": "虚拟机使用lvm管理新增磁盘",
                    "views": 8848
                }
            },
            {
                "_index": "articles",
                "_type": "_doc",
                "_id": "927",
                "_score": 1.0,
                "_source": {
                    "author": 2,
                    "cat": 4,
                    "id": 927,
                    "title": "为什么说编程是有史以来最好的工作",
                    "views": 1237
                }
            },
            {
                "_index": "articles",
                "_type": "_doc",
                "_id": "457",
                "_score": 1.0,
                "_source": {
                    "author": 2,
                    "cat": 4,
                    "id": 457,
                    "title": "Flex 布局语法教程",
                    "views": 1168
                }
            },
            {
                "_index": "articles",
                "_type": "_doc",
                "_id": "597",
                "_score": 1.0,
                "_source": {
                    "author": 2,
                    "cat": 4,
                    "id": 597,
                    "title": "从浏览器多进程到JS单线程，JS运行机制最全面的一次梳理",
                    "views": 8824
                }
            },
            {
                "_index": "articles",
                "_type": "_doc",
                "_id": "2028",
                "_score": 1.0,
                "_source": {
                    "author": 2,
                    "cat": 4,
                    "id": 2028,
                    "title": "Docker 入门教程03 使用容器工作",
                    "views": 9479
                }
            },
            {
                "_index": "articles",
                "_type": "_doc",
                "_id": "246",
                "_score": 1.0,
                "_source": {
                    "author": 2,
                    "cat": 4,
                    "id": 246,
                    "title": "Unicode与JavaScript详解",
                    "views": 42
                }
            },
            {
                "_index": "articles",
                "_type": "_doc",
                "_id": "1541",
                "_score": 1.0,
                "_source": {
                    "author": 2,
                    "cat": 4,
                    "id": 1541,
                    "title": " 你还在用 os.path？快来感受一下 pathlib 给你带来的便捷吧！ ",
                    "views": 9638
                }
            }
        ]
    }
}

删除

删除所有文档

请求路径：POST /索引名/_delete_by_query

请求体：

{
  "query": {
    "match_all": {}
  }
}

示例

查询标题包含python web的文档

请求路径：GET http://127.0.0.1:9903/articles/_search?_source=title,id,author,views,cat&size=5&q=title:python web

返回结果：

{
    "took": 11,
    "timed_out": false,
    "_shards": {
        "total": 3,
        "successful": 3,
        "skipped": 0,
        "failed": 0
    },
    "hits": {
        "total": {
            "value": 816,
            "relation": "eq"
        },
        "max_score": 4.800988,
        "hits": [
            {
                "_index": "articles",
                "_type": "_doc",
                "_id": "1385",
                "_score": 4.800988,
                "_source": {
                    "author": 13,
                    "cat": 2,
                    "id": 1385,
                    "title": " Python爬虫利器四之PhantomJS的用法 ",
                    "views": 7790
                }
            },
            {
                "_index": "articles",
                "_type": "_doc",
                "_id": "1392",
                "_score": 4.795393,
                "_source": {
                    "author": 20,
                    "cat": 2,
                    "id": 1392,
                    "title": " Python爬虫进阶一之爬虫框架概述 ",
                    "views": 5728
                }
            },
            {
                "_index": "articles",
                "_type": "_doc",
                "_id": "459",
                "_score": 4.630121,
                "_source": {
                    "author": 18,
                    "cat": 7,
                    "id": 459,
                    "title": "web前端规范",
                    "views": 1731
                }
            },
            {
                "_index": "articles",
                "_type": "_doc",
                "_id": "1329",
                "_score": 4.488963,
                "_source": {
                    "author": 19,
                    "cat": 3,
                    "id": 1329,
                    "title": " [Python3网络爬虫开发实战] 1.6-Web库的安装 ",
                    "views": 1492
                }
            },
            {
                "_index": "articles",
                "_type": "_doc",
                "_id": "342",
                "_score": 4.456541,
                "_source": {
                    "author": 5,
                    "cat": 7,
                    "id": 342,
                    "title": "浅谈大型web系统架构",
                    "views": 4317
                }
            }
        ]
    }
}

请求所有字段中包含python web的文档

请求路径：GET http://127.0.0.1:9903/articles/_search?_source=title,id,author,views,cat&size=5&q=_all:python web

返回结果：

{
    "took": 7,
    "timed_out": false,
    "_shards": {
        "total": 3,
        "successful": 3,
        "skipped": 0,
        "failed": 0
    },
    "hits": {
        "total": {
            "value": 576,
            "relation": "eq"
        },
        "max_score": 4.630121,
        "hits": [
            {
                "_index": "articles",
                "_type": "_doc",
                "_id": "459",
                "_score": 4.630121,
                "_source": {
                    "author": 18,
                    "cat": 7,
                    "id": 459,
                    "title": "web前端规范",
                    "views": 1731
                }
            },
            {
                "_index": "articles",
                "_type": "_doc",
                "_id": "342",
                "_score": 4.456541,
                "_source": {
                    "author": 5,
                    "cat": 7,
                    "id": 342,
                    "title": "浅谈大型web系统架构",
                    "views": 4317
                }
            },
            {
                "_index": "articles",
                "_type": "_doc",
                "_id": "812",
                "_score": 4.456541,
                "_source": {
                    "author": 1,
                    "cat": 3,
                    "id": 812,
                    "title": "想做web开发 就学JavaScript",
                    "views": 8885
                }
            },
            {
                "_index": "articles",
                "_type": "_doc",
                "_id": "410",
                "_score": 4.3550134,
                "_source": {
                    "author": 16,
                    "cat": 4,
                    "id": 410,
                    "title": "Web开发初学指南",
                    "views": 4300
                }
            },
            {
                "_index": "articles",
                "_type": "_doc",
                "_id": "578",
                "_score": 4.3550134,
                "_source": {
                    "author": 10,
                    "cat": 1,
                    "id": 578,
                    "title": "Web Worker 使用教程",
                    "views": 2814
                }
            }
        ]
    }
}

全文搜索标题包含python或web的文档，使用请求体的方式