https://www.elastic.co/downloads/elasticsearch
wget https://download.elasticsearch.org/elasticsearch/release/org/elasticsearch/distribution/tar/elasticsearch/2.0.0/elasticsearch-2.0.0.tar.gz
在项目根目录下新建 data 文件夹
#mkdir data
修改文件 elasticsearch.yml
#pwd
/data2/elasticsearch-2.0.0/config
[root@10 config]# vim elasticsearch.yml
修改成服务 ip 和 port
network.host: 10.13.1.139
http.port: 54321
在项目根目录下没有 plugins 文件夹,需通过安装插件新建该文件夹,
the core plugins can be installed as follows:
[root@10 bin]# pwd
/data2/elasticsearch-2.0.0/bin
[root@10 bin]# ./plugin install analysis-icu
-> Installing analysis-icu...
Plugins directory [/data2/elasticsearch-2.0.0/plugins] does not exist. Creating...
Trying https://download.elastic.co/elasticsearch/release/org/elasticsearch/plugin/analysis-icu/2.0.0/analysis-icu-2.0.0.zip ...
Downloading
非 Root 账号启动服务时,会新建文件
[root@10 elasticsearch-2.0.0]# chmod 777 *
切换到非 Root 用户,启动服务
[root@10 ~]# su - hugang
[hugang@10 ~]$ cd /data2/elasticsearch-2.0.0/bin
[hugang@10 bin]$ ./elasticsearch -d
验证服务
[hugang@10 bin]$ curl -X GET http://10.13.1.139:54321
{
"name" : "Madame Hydra",
"cluster_name" : "elasticsearch",
"version" : {
"number" : "2.0.0",
"build_hash" : "de54438d6af8f9340d50c5c786151783ce7d6be5",
"build_timestamp" : "2015-10-22T08:09:48Z",
"build_snapshot" : false,
"lucene_version" : "5.2.1"
},
"tagline" : "You Know, for Search"
}
Logstash:Collect, Enrich & Transport Data;Logstash 负责收集,丰富和传输 log 数据,Pipeline 由三部分组成:
在配置文件指定这三个元素,运行 Logstash 时,需在执行脚本 bin/logstash 加上该配置文件作为脚本参数。
运行环境要求:JDK minimum version 1.7.0_51
https://www.elastic.co/downloads/logstash
wget https://download.elastic.co/logstash/logstash/logstash-2.0.0.tar.gz
根据自己业务需求,新建 Pipeline 配置文件,文件名自定义,但是内容必须含有 input, filter, output 三元素。
collectlogtoredis.conf
input {
file {
path => "/data1/weibo8074/logs/exposure.log"
}
}
filter {
grok {
match => { "message" => "(?<uida>[0-9]{10}) (?<uidb>[0-9]{10}) (?<idList>([0-9]{16},){1,}[0-9]{16})" }
}
}
output {
redis { host => "10.13.1.139" port => 6379 data_type => "list" key => "logstash:collect:exposurelog" }
}
input: 指定 Logstash 收集数据的来源,支持多种来源,比如:file, stdin, syslog, elasticsearch 等,具体可参照:https://www.elastic.co/guide/en/logstash/current/input-plugins.html
filter: 提取有效信息的规则,支持如下:grok, json, xml, csv 等,具体可参照: https://www.elastic.co/guide/en/logstash/current/filter-plugins.html
output: 将数据发送到特定目的地,支持如下:csv, kafka, syslog, stdout, redis 等,具体可参照: https://www.elastic.co/guide/en/logstash/current/output-plugins.html
collectlogtoredis.conf 分析
file {
path => ...
}
path 为配置选项(必填), 类型为 array, 可以如下定义, 支持通配符:
path => [ "/var/log/messages", "/var/log/*.log" ]
path => "/data/mysql/mysql.log"
stat_interval 配置选项:How often (in seconds) we stat files to see if they have been modified. 默认 1s.
match 为配置选项,Value type is hash, Default value is {};A hash of matches of field ⇒ value
For example:
filter {
grok { match => { "message" => "Duration: %{NUMBER:duration}" } }
}
If you need to match multiple patterns against a single field, the value can be an array of patterns
filter {
grok { match => { "message" => [ "Duration: %{NUMBER:duration}", "Speed: %{NUMBER:speed}" ] } }
}
执行 bin/logstash agent -f collectlogtoredis.conf
https://www.elastic.co/downloads/kibana
wget https://download.elastic.co/kibana/kibana/kibana-4.2.1-linux-x64.tar.gz
在 config/kibana.yml 修改成你本地 elasticsearch 服务地址
elasticsearch.url: "http://10.13.1.139:54321"
./bin/kibana
访问 kibana 服务:
ip:5601
具体配置请参照:https://www.elastic.co/webinars/getting-started-with-kibana?baymax=rtp&elektra=blog&iesrc=ctr