Flume监控之Ganglia
原创
©著作权归作者所有:来自51CTO博客作者年轻即出发的原创作品,请联系作者获取转载授权,否则将追究法律责任
1 Ganglia的安装与部署
1) 安装httpd服务与php
# yum -y install httpd php |
2) 安装其他依赖
# yum -y install rrdtool perl-rrdtool rrdtool-devel # yum -y install apr-devel |
3) 安装ganglia
# rpm -Uvh http://dl.fedoraproject.org/pub/epel/6/x86_64/epel-release-6-8.noarch.rpm # yum -y install ganglia-gmetad # yum -y install ganglia-web # yum install -y ganglia-gmond |
4) 修改配置文件
文件ganglia.conf:
# vi /etc/httpd/conf.d/ganglia.conf 修改为: # # Ganglia monitoring system php web frontend # Alias /ganglia /usr/share/ganglia <Location /ganglia> Order deny,allow Deny from all Allow from all # Allow from 127.0.0.1 # Allow from ::1 # Allow from .example.com </Location> |
文件gmetad.conf:
# vi /etc/ganglia/gmetad.conf 修改为: data_source "linux" 192.168.216.20 |
文件gmond.conf:
# vi /etc/ganglia/gmond.conf 修改为: cluster { name = "linux" owner = "unspecified" latlong = "unspecified" url = "unspecified" } udp_send_channel { #bind_hostname = yes # Highly recommended, soon to be default. # This option tells gmond to use a source address # that resolves to the machine's hostname. Without # this, the metrics may appear to come from any # interface and the DNS names associated with # those IPs will be used to create the RRDs. # mcast_join = 239.2.11.71 host = 192.168.216.20 port = 8649 ttl = 1 } udp_recv_channel { # mcast_join = 239.2.11.71 port = 8649 bind = 192.168.216.20 retry_bind = true # Size of the UDP buffer. If you are handling lots of metrics you really # should bump it up to e.g. 10MB or even higher. # buffer = 10485760 } |
文件config:
# vi /etc/selinux/config 修改为: # This file controls the state of SELinux on the system. # SELINUX= can take one of these three values: # enforcing - SELinux security policy is enforced. # permissive - SELinux prints warnings instead of enforcing. # disabled - No SELinux policy is loaded. SELINUX=disabled # SELINUXTYPE= can take one of these two values: # targeted - Targeted processes are protected, # mls - Multi Level Security protection. SELINUXTYPE=targeted |
尖叫提示:selinux本次生效关闭必须重启,如果此时不想重启,可以临时生效之:
5) 启动ganglia
$ sudo service httpd start $ sudo service gmetad start $ sudo service gmond start |
6) 打开网页浏览ganglia页面
尖叫提示:如果完成以上操作依然出现权限不足错误,请修改/var/lib/ganglia目录的权限:
$ sudo chmod -R 777 /var/lib/ganglia |
2 操作Flume测试监控
1) 修改flume-env.sh配置:
JAVA_OPTS="-Dflume.monitoring.type=ganglia -Dflume.monitoring.hosts=192.168.216.20:8649 -Xms100m -Xmx200m" |
2) 启动flume任务
$ bin/flume-ng agent \ --conf conf/ \ --name a1 \ --conf-file job/group-job0/flume-telnet.conf \ -Dflume.root.logger==INFO,console \ -Dflume.monitoring.type=ganglia \ -Dflume.monitoring.hosts=192.168.216.20:8649 |
3) 发送数据观察ganglia监测图
样式如图:
图例说明:
字段(图表名称) | 字段含义 |
EventPutAttemptCount | source尝试写入channel的事件总数量 |
EventPutSuccessCount | 成功写入channel且提交的事件总数量 |
EventTakeAttemptCount | sink尝试从channel拉取事件的总数量。这不意味着每次事件都被返回,因为sink拉取的时候channel可能没有任何数据。 |
EventTakeSuccessCount | sink成功读取的事件的总数量 |
StartTime | channel启动的时间(毫秒) |
StopTime | channel停止的时间(毫秒) |
ChannelSize | 目前channel中事件的总数量 |
ChannelFillPercentage | channel占用百分比 |
ChannelCapacity | channel的容量 |