分享

ganglia gmond sent message 'heartbeat' of length 52 with 1 errors

Joker 发表于 2016-3-28 17:01:10 [显示全部楼层] 回帖奖励 阅读模式 关闭右栏 5 13056
在部署完ganglia的时候,访问界面发现ganglia没有客户端连接上,
gomnd配置如下:[mw_shl_code=html,true]lobals {
  daemonize = yes
  setuid = yes
  user = xxx
  debug_level = 10
  max_udp_msg_len = 1472
  mute = no
  deaf = no
  allow_extra_data = yes
  host_dmax = 86400 /*secs. Expires (removes from web interface) hosts in 1 day */
  host_tmax = 20 /*secs */
  cleanup_threshold = 300 /*secs */
  gexec = no
  # By default gmond will use reverse DNS resolution when displaying your hostname
  # Uncommeting following value will override that value.
  # override_hostname = "mywebserver.domain.com"
  # If you are not using multicast this value should be set to something other than 0.
  # Otherwise if you restart aggregator gmond you will get empty graphs. 60 seconds is reasonable
  send_metadata_interval = 0 /*secs */

}


cluster {
  name = "hadoop"
  owner = "xxx"
  latlong = "unspecified"
  url = "unspecified"
}


host {
  location = "vmlin0427"
}

/* Feel free to specify as many udp_send_channels as you like.  Gmond
   used to only support having a single channel */
udp_send_channel {
  #bind_hostname = yes # Highly recommended, soon to be default.
                       # This option tells gmond to use a source address
                       # that resolves to the machine's hostname.  Without
                       # this, the metrics may appear to come from any
                       # interface and the DNS names associated with
                       # those IPs will be used to create the RRDs.
  #mcast_join = 239.2.11.71
  host = xx.xx.xx.xxx #我hadoop主节点
  port = 8653
  ttl = 1
}


/* You can specify as many udp_recv_channels as you like as well. */
udp_recv_channel {
  #mcast_join = 239.2.11.71
  port = 8653
  #bind = 239.2.11.71
  #retry_bind = true
  # Size of the UDP buffer. If you are handling lots of metrics you really
  # should bump it up to e.g. 10MB or even higher.
  # buffer = 10485760
}

/* You can specify as many tcp_accept_channels as you like to share
   an xml description of the state of the cluster */
tcp_accept_channel {
  port = 8653
  # If you want to gzip XML output
  gzip_output = no
}[/mw_shl_code]

gmond的日志如下  [mw_shl_code=html,true]        metric 'cpu_user' being collected now
[tcp] Starting TCP listener thread...
        metric 'cpu_user' has value_threshold 1.000000
        metric 'cpu_system' being collected now
        metric 'cpu_system' has value_threshold 1.000000
        metric 'cpu_idle' being collected now
        metric 'cpu_idle' has value_threshold 5.000000
        metric 'cpu_nice' being collected now
        metric 'cpu_nice' has value_threshold 1.000000
        metric 'cpu_aidle' being collected now
        metric 'cpu_aidle' has value_threshold 5.000000
        metric 'cpu_wio' being collected now
        metric 'cpu_wio' has value_threshold 1.000000
        metric 'cpu_steal' being collected now
        metric 'cpu_steal' has value_threshold 1.000000
        metric 'load_one' being collected now
        metric 'load_one' has value_threshold 1.000000
        metric 'load_five' being collected now
        metric 'load_five' has value_threshold 1.000000
        metric 'load_fifteen' being collected now
        metric 'load_fifteen' has value_threshold 1.000000
        metric 'proc_run' being collected now
        metric 'proc_run' has value_threshold 1.000000
        metric 'proc_total' being collected now
        metric 'proc_total' has value_threshold 1.000000
        metric 'mem_free' being collected now
        metric 'mem_free' has value_threshold 1024.000000
        metric 'mem_shared' being collected now
        metric 'mem_shared' has value_threshold 1024.000000
        metric 'mem_buffers' being collected now
        metric 'mem_buffers' has value_threshold 1024.000000
        metric 'mem_cached' being collected now
        metric 'mem_cached' has value_threshold 1024.000000
        metric 'swap_free' being collected now
        metric 'swap_free' has value_threshold 1024.000000
        metric 'bytes_out' being collected now
********** bytes_out:  125.396408
        metric 'bytes_out' has value_threshold 4096.000000
        metric 'bytes_in' being collected now
********** bytes_in:  114.593567
        metric 'bytes_in' has value_threshold 4096.000000
        metric 'pkts_in' being collected now
********** pkts_in:  1.465350
        metric 'pkts_in' has value_threshold 256.000000
        metric 'pkts_out' being collected now
********** pkts_out:  1.487917
        metric 'pkts_out' has value_threshold 256.000000
        metric 'disk_total' being collected now
Counting device /dev/mapper/VolGroup-lv_root (20.34 %)
Counting device /dev/sda1 (11.62 %)
Counting device /dev/mapper/VolGroup-LogVol02 (5.47 %)
For all disks: 116.766 GB total, 108.774 GB free for users.
        metric 'disk_total' has value_threshold 1.000000
        metric 'disk_free' being collected now
Counting device /dev/mapper/VolGroup-lv_root (20.34 %)
Counting device /dev/sda1 (11.62 %)
Counting device /dev/mapper/VolGroup-LogVol02 (5.47 %)
For all disks: 116.766 GB total, 108.774 GB free for users.
        metric 'disk_free' has value_threshold 1.000000
        metric 'part_max_used' being collected now
Counting device /dev/mapper/VolGroup-lv_root (20.34 %)
Counting device /dev/sda1 (11.62 %)
Counting device /dev/mapper/VolGroup-LogVol02 (5.47 %)
For all disks: 116.766 GB total, 108.774 GB free for users.
        metric 'part_max_used' has value_threshold 1.000000
        sending metadata for metric: heartbeat
  msg.Ganglia_metadata_msg_u.gfull.metric_id.name: heartbeat

  msg.Ganglia_metadata_msg_u.gfull.metric_id.host: vmlin0427

        sent message 'heartbeat' of length 52 with 0 errors
        sending metadata for metric: cpu_num
  msg.Ganglia_metadata_msg_u.gfull.metric_id.name: cpu_num

  msg.Ganglia_metadata_msg_u.gfull.metric_id.host: vmlin0427

        sent message 'cpu_num' of length 48 with 0 errors
        sending metadata for metric: cpu_speed
  msg.Ganglia_metadata_msg_u.gfull.metric_id.name: cpu_speed

  msg.Ganglia_metadata_msg_u.gfull.metric_id.host: vmlin0427

        sent message 'cpu_speed' of length 52 with 0 errors
        sending metadata for metric: mem_total
  msg.Ganglia_metadata_msg_u.gfull.metric_id.name: mem_total

  msg.Ganglia_metadata_msg_u.gfull.metric_id.host: vmlin0427

        sent message 'mem_total' of length 52 with 0 errors
        sending metadata for metric: swap_total
  msg.Ganglia_metadata_msg_u.gfull.metric_id.name: swap_total

  msg.Ganglia_metadata_msg_u.gfull.metric_id.host: vmlin0427

        sent message 'swap_total' of length 52 with 0 errors
        sending metadata for metric: boottime
  msg.Ganglia_metadata_msg_u.gfull.metric_id.name: boottime

  msg.Ganglia_metadata_msg_u.gfull.metric_id.host: vmlin0427

        sent message 'boottime' of length 48 with 0 errors
        sending metadata for metric: machine_type
  msg.Ganglia_metadata_msg_u.gfull.metric_id.name: machine_type

  msg.Ganglia_metadata_msg_u.gfull.metric_id.host: vmlin0427

        sent message 'machine_type' of length 60 with 0 errors
        sending metadata for metric: os_name
  msg.Ganglia_metadata_msg_u.gfull.metric_id.name: os_name

  msg.Ganglia_metadata_msg_u.gfull.metric_id.host: vmlin0427

        sent message 'os_name' of length 56 with 0 errors
        sending metadata for metric: os_release
  msg.Ganglia_metadata_msg_u.gfull.metric_id.name: os_release

  msg.Ganglia_metadata_msg_u.gfull.metric_id.host: vmlin0427

        sent message 'os_release' of length 76 with 0 errors
        sending metadata for metric: location
  msg.Ganglia_metadata_msg_u.gfull.metric_id.name: location

  msg.Ganglia_metadata_msg_u.gfull.metric_id.host: vmlin0427

        sent message 'location' of length 60 with 0 errors
        sending metadata for metric: gexec
  msg.Ganglia_metadata_msg_u.gfull.metric_id.name: gexec

  msg.Ganglia_metadata_msg_u.gfull.metric_id.host: vmlin0427

        sent message 'gexec' of length 52 with 0 errors
        sending metadata for metric: cpu_user
  msg.Ganglia_metadata_msg_u.gfull.metric_id.name: cpu_user

  msg.Ganglia_metadata_msg_u.gfull.metric_id.host: vmlin0427

        sent message 'cpu_user' of length 48 with 0 errors
        sending metadata for metric: cpu_system
  msg.Ganglia_metadata_msg_u.gfull.metric_id.name: cpu_system

  msg.Ganglia_metadata_msg_u.gfull.metric_id.host: vmlin0427

        sent message 'cpu_system' of length 52 with 0 errors
        sending metadata for metric: cpu_idle
  msg.Ganglia_metadata_msg_u.gfull.metric_id.name: cpu_idle

  msg.Ganglia_metadata_msg_u.gfull.metric_id.host: vmlin0427

        sent message 'cpu_idle' of length 48 with 0 errors
        sending metadata for metric: cpu_nice
  msg.Ganglia_metadata_msg_u.gfull.metric_id.name: cpu_nice

  msg.Ganglia_metadata_msg_u.gfull.metric_id.host: vmlin0427

        sent message 'cpu_nice' of length 48 with 0 errors
        sending metadata for metric: cpu_aidle
  msg.Ganglia_metadata_msg_u.gfull.metric_id.name: cpu_aidle

  msg.Ganglia_metadata_msg_u.gfull.metric_id.host: vmlin0427

        sent message 'cpu_aidle' of length 52 with 0 errors
        sending metadata for metric: cpu_wio
  msg.Ganglia_metadata_msg_u.gfull.metric_id.name: cpu_wio

  msg.Ganglia_metadata_msg_u.gfull.metric_id.host: vmlin0427

        sent message 'cpu_wio' of length 48 with 0 errors
        sending metadata for metric: cpu_steal
  msg.Ganglia_metadata_msg_u.gfull.metric_id.name: cpu_steal

  msg.Ganglia_metadata_msg_u.gfull.metric_id.host: vmlin0427

        sent message 'cpu_steal' of length 52 with 0 errors
        sending metadata for metric: load_one
  msg.Ganglia_metadata_msg_u.gfull.metric_id.name: load_one

  msg.Ganglia_metadata_msg_u.gfull.metric_id.host: vmlin0427

        sent message 'load_one' of length 48 with 0 errors
        sending metadata for metric: load_five
  msg.Ganglia_metadata_msg_u.gfull.metric_id.name: load_five

  msg.Ganglia_metadata_msg_u.gfull.metric_id.host: vmlin0427

        sent message 'load_five' of length 52 with 0 errors
        sending metadata for metric: load_fifteen
  msg.Ganglia_metadata_msg_u.gfull.metric_id.name: load_fifteen

  msg.Ganglia_metadata_msg_u.gfull.metric_id.host: vmlin0427

        sent message 'load_fifteen' of length 52 with 0 errors
        sending metadata for metric: proc_run
  msg.Ganglia_metadata_msg_u.gfull.metric_id.name: proc_run

  msg.Ganglia_metadata_msg_u.gfull.metric_id.host: vmlin0427

        sent message 'proc_run' of length 48 with 0 errors
        sending metadata for metric: proc_total
  msg.Ganglia_metadata_msg_u.gfull.metric_id.name: proc_total

  msg.Ganglia_metadata_msg_u.gfull.metric_id.host: vmlin0427

        sent message 'proc_total' of length 52 with 0 errors
        sending metadata for metric: mem_free
  msg.Ganglia_metadata_msg_u.gfull.metric_id.name: mem_free

  msg.Ganglia_metadata_msg_u.gfull.metric_id.host: vmlin0427

        sent message 'mem_free' of length 48 with 0 errors
        sending metadata for metric: mem_shared
  msg.Ganglia_metadata_msg_u.gfull.metric_id.name: mem_shared

  msg.Ganglia_metadata_msg_u.gfull.metric_id.host: vmlin0427

        sent message 'mem_shared' of length 52 with 0 errors
        sending metadata for metric: mem_buffers
  msg.Ganglia_metadata_msg_u.gfull.metric_id.name: mem_buffers

  msg.Ganglia_metadata_msg_u.gfull.metric_id.host: vmlin0427

        sent message 'mem_buffers' of length 52 with 0 errors
        sending metadata for metric: mem_cached
  msg.Ganglia_metadata_msg_u.gfull.metric_id.name: mem_cached

  msg.Ganglia_metadata_msg_u.gfull.metric_id.host: vmlin0427

        sent message 'mem_cached' of length 52 with 0 errors
        sending metadata for metric: swap_free
  msg.Ganglia_metadata_msg_u.gfull.metric_id.name: swap_free

  msg.Ganglia_metadata_msg_u.gfull.metric_id.host: vmlin0427

        sent message 'swap_free' of length 52 with 0 errors
        sending metadata for metric: bytes_out
  msg.Ganglia_metadata_msg_u.gfull.metric_id.name: bytes_out

  msg.Ganglia_metadata_msg_u.gfull.metric_id.host: vmlin0427

        sent message 'bytes_out' of length 52 with 0 errors
        sending metadata for metric: bytes_in
  msg.Ganglia_metadata_msg_u.gfull.metric_id.name: bytes_in

  msg.Ganglia_metadata_msg_u.gfull.metric_id.host: vmlin0427

        sent message 'bytes_in' of length 48 with 0 errors
        sending metadata for metric: pkts_in
  msg.Ganglia_metadata_msg_u.gfull.metric_id.name: pkts_in

  msg.Ganglia_metadata_msg_u.gfull.metric_id.host: vmlin0427

        sent message 'pkts_in' of length 48 with 0 errors
        sending metadata for metric: pkts_out
  msg.Ganglia_metadata_msg_u.gfull.metric_id.name: pkts_out

  msg.Ganglia_metadata_msg_u.gfull.metric_id.host: vmlin0427

        sent message 'pkts_out' of length 48 with 0 errors
        sending metadata for metric: disk_total
  msg.Ganglia_metadata_msg_u.gfull.metric_id.name: disk_total

  msg.Ganglia_metadata_msg_u.gfull.metric_id.host: vmlin0427

        sent message 'disk_total' of length 56 with 0 errors
        sending metadata for metric: disk_free
  msg.Ganglia_metadata_msg_u.gfull.metric_id.name: disk_free

  msg.Ganglia_metadata_msg_u.gfull.metric_id.host: vmlin0427

        sent message 'disk_free' of length 56 with 0 errors
        sending metadata for metric: part_max_used
  msg.Ganglia_metadata_msg_u.gfull.metric_id.name: part_max_used

  msg.Ganglia_metadata_msg_u.gfull.metric_id.host: vmlin0427

        sent message 'part_max_used' of length 56 with 0 errors
        metric 'cpu_user' being collected now
        metric 'cpu_user' has value_threshold 1.000000
        metric 'cpu_system' being collected now
        metric 'cpu_system' has value_threshold 1.000000
        metric 'cpu_idle' being collected now
        metric 'cpu_idle' has value_threshold 5.000000
        metric 'cpu_nice' being collected now
        metric 'cpu_nice' has value_threshold 1.000000
        metric 'cpu_aidle' being collected now
        metric 'cpu_aidle' has value_threshold 5.000000
        metric 'cpu_wio' being collected now
        metric 'cpu_wio' has value_threshold 1.000000
        metric 'cpu_steal' being collected now
        metric 'cpu_steal' has value_threshold 1.000000
        metric 'load_one' being collected now
        metric 'load_one' has value_threshold 1.000000
        metric 'load_five' being collected now
        metric 'load_five' has value_threshold 1.000000
        metric 'load_fifteen' being collected now
        metric 'load_fifteen' has value_threshold 1.000000
        sent message 'heartbeat' of length 52 with 1 errors
        sent message 'heartbeat' of length 52 with 0 errors
        metric 'mem_free' being collected now
        metric 'mem_free' has value_threshold 1024.000000
        metric 'mem_shared' being collected now
        metric 'mem_shared' has value_threshold 1024.000000
        metric 'mem_buffers' being collected now
        metric 'mem_buffers' has value_threshold 1024.000000
        metric 'mem_cached' being collected now
        metric 'mem_cached' has value_threshold 1024.000000
        metric 'swap_free' being collected now
        metric 'swap_free' has value_threshold 1024.000000
        metric 'bytes_out' being collected now
********** bytes_out:  4818.360840
        metric 'bytes_out' has value_threshold 4096.000000
        metric 'bytes_in' being collected now
********** bytes_in:  887.479736
        metric 'bytes_in' has value_threshold 4096.000000
        metric 'pkts_in' being collected now
********** pkts_in:  5.571890
        metric 'pkts_in' has value_threshold 256.000000
        metric 'pkts_out' being collected now
********** pkts_out:  25.011042
        metric 'pkts_out' has value_threshold 256.000000
        metric 'disk_free' being collected now
Counting device /dev/mapper/VolGroup-lv_root (20.34 %)
Counting device /dev/sda1 (11.62 %)
Counting device /dev/mapper/VolGroup-LogVol02 (5.47 %)
For all disks: 116.766 GB total, 108.774 GB free for users.
        metric 'disk_free' has value_threshold 1.000000
        metric 'part_max_used' being collected now
Counting device /dev/mapper/VolGroup-lv_root (20.34 %)
Counting device /dev/sda1 (11.62 %)
Counting device /dev/mapper/VolGroup-LogVol02 (5.47 %)
For all disks: 116.766 GB total, 108.774 GB free for users.
        metric 'part_max_used' has value_threshold 1.000000
        sent message 'bytes_out' of length 52 with 1 errors
        sent message 'bytes_in' of length 48 with 0 errors
        sent message 'pkts_in' of length 48 with 0 errors
        sent message 'pkts_out' of length 48 with 0 errors
        metric 'cpu_user' being collected now
        metric 'cpu_user' has value_threshold 1.000000
        metric 'cpu_system' being collected now
        metric 'cpu_system' has value_threshold 1.000000
        metric 'cpu_idle' being collected now
        metric 'cpu_idle' has value_threshold 5.000000
        metric 'cpu_nice' being collected now
        metric 'cpu_nice' has value_threshold 1.000000
        metric 'cpu_aidle' being collected now
        metric 'cpu_aidle' has value_threshold 5.000000
        metric 'cpu_wio' being collected now
        metric 'cpu_wio' has value_threshold 1.000000
        metric 'cpu_steal' being collected now
        metric 'cpu_steal' has value_threshold 1.000000
        metric 'load_one' being collected now
        metric 'load_one' has value_threshold 1.000000
        metric 'load_five' being collected now
        metric 'load_five' has value_threshold 1.000000
        metric 'load_fifteen' being collected now
        metric 'load_fifteen' has value_threshold 1.000000
        sent message 'heartbeat' of length 52 with 1 errors
        sent message 'heartbeat' of length 52 with 0 errors
        metric 'cpu_user' being collected now
        metric 'cpu_user' has value_threshold 1.000000
        metric 'cpu_system' being collected now
        metric 'cpu_system' has value_threshold 1.000000
        metric 'cpu_idle' being collected now
        metric 'cpu_idle' has value_threshold 5.000000
        metric 'cpu_nice' being collected now
        metric 'cpu_nice' has value_threshold 1.000000
        metric 'cpu_aidle' being collected now
        metric 'cpu_aidle' has value_threshold 5.000000
        metric 'cpu_wio' being collected now
        metric 'cpu_wio' has value_threshold 1.000000
        metric 'cpu_steal' being collected now
        metric 'cpu_steal' has value_threshold 1.000000
        metric 'load_one' being collected now
        metric 'load_one' has value_threshold 1.000000
        metric 'load_five' being collected now
        metric 'load_five' has value_threshold 1.000000
        metric 'load_fifteen' being collected now
        metric 'load_fifteen' has value_threshold 1.000000
        sent message 'heartbeat' of length 52 with 1 errors
        sent message 'heartbeat' of length 52 with 0 errors
        metric 'proc_run' being collected now
        metric 'proc_run' has value_threshold 1.000000
        metric 'proc_total' being collected now
        metric 'proc_total' has value_threshold 1.000000
        sent message 'proc_run' of length 48 with 1 errors
        sent message 'proc_total' of length 52 with 0 errors
        metric 'mem_free' being collected now
        metric 'mem_free' has value_threshold 1024.000000
        metric 'mem_shared' being collected now
        metric 'mem_shared' has value_threshold 1024.000000
        metric 'mem_buffers' being collected now
        metric 'mem_buffers' has value_threshold 1024.000000
        metric 'mem_cached' being collected now
        metric 'mem_cached' has value_threshold 1024.000000
        metric 'swap_free' being collected now
        metric 'swap_free' has value_threshold 1024.000000
        metric 'bytes_out' being collected now
********** bytes_out:  4335.299805
        metric 'bytes_out' has value_threshold 4096.000000
        metric 'bytes_in' being collected now
********** bytes_in:  870.299927
        metric 'bytes_in' has value_threshold 4096.000000
        metric 'pkts_in' being collected now
********** pkts_in:  5.574999
        metric 'pkts_in' has value_threshold 256.000000
        metric 'pkts_out' being collected now
********** pkts_out:  23.274998
        metric 'pkts_out' has value_threshold 256.000000
        metric 'disk_free' being collected now
Counting device /dev/mapper/VolGroup-lv_root (20.34 %)
Counting device /dev/sda1 (11.62 %)
Counting device /dev/mapper/VolGroup-LogVol02 (5.47 %)
For all disks: 116.766 GB total, 108.774 GB free for users.
        metric 'disk_free' has value_threshold 1.000000
        metric 'part_max_used' being collected now
Counting device /dev/mapper/VolGroup-lv_root (20.34 %)
Counting device /dev/sda1 (11.62 %)
Counting device /dev/mapper/VolGroup-LogVol02 (5.47 %)
For all disks: 116.766 GB total, 108.774 GB free for users.
        metric 'part_max_used' has value_threshold 1.000000
        metric 'cpu_user' being collected now
        metric 'cpu_user' has value_threshold 1.000000
        metric 'cpu_system' being collected now
        metric 'cpu_system' has value_threshold 1.000000
        metric 'cpu_idle' being collected now
        metric 'cpu_idle' has value_threshold 5.000000
        metric 'cpu_nice' being collected now
        metric 'cpu_nice' has value_threshold 1.000000
        metric 'cpu_aidle' being collected now
        metric 'cpu_aidle' has value_threshold 5.000000
        metric 'cpu_wio' being collected now
        metric 'cpu_wio' has value_threshold 1.000000
        metric 'cpu_steal' being collected now
        metric 'cpu_steal' has value_threshold 1.000000
        metric 'load_one' being collected now
        metric 'load_one' has value_threshold 1.000000
        metric 'load_five' being collected now
        metric 'load_five' has value_threshold 1.000000
        metric 'load_fifteen' being collected now
        metric 'load_fifteen' has value_threshold 1.000000
        sent message 'heartbeat' of length 52 with 1 errors
        sent message 'heartbeat' of length 52 with 0 errors
        sent message 'cpu_user' of length 48 with 0 errors
        sent message 'cpu_system' of length 52 with 0 errors
        sent message 'cpu_idle' of length 48 with 0 errors
        sent message 'cpu_nice' of length 48 with 0 errors
        sent message 'cpu_aidle' of length 52 with 0 errors
        sent message 'cpu_wio' of length 48 with 0 errors
        sent message 'cpu_steal' of length 52 with 0 errors
        sent message 'load_one' of length 48 with 0 errors
        sent message 'load_five' of length 52 with 0 errors
        sent message 'load_fifteen' of length 52 with 0 errors
        metric 'cpu_user' being collected now
        metric 'cpu_user' has value_threshold 1.000000
        metric 'cpu_system' being collected now
        metric 'cpu_system' has value_threshold 1.000000
        metric 'cpu_idle' being collected now
        metric 'cpu_idle' has value_threshold 5.000000
        metric 'cpu_nice' being collected now
        metric 'cpu_nice' has value_threshold 1.000000
        metric 'cpu_aidle' being collected now
        metric 'cpu_aidle' has value_threshold 5.000000
        metric 'cpu_wio' being collected now
        metric 'cpu_wio' has value_threshold 1.000000
        metric 'cpu_steal' being collected now
        metric 'cpu_steal' has value_threshold 1.000000
        metric 'load_one' being collected now
        metric 'load_one' has value_threshold 1.000000
        metric 'load_five' being collected now
        metric 'load_five' has value_threshold 1.000000
        metric 'load_fifteen' being collected now
        metric 'load_fifteen' has value_threshold 1.000000
        sent message 'heartbeat' of length 52 with 1 errors
        sent message 'heartbeat' of length 52 with 0 errors
        metric 'mem_free' being collected now
        metric 'mem_free' has value_threshold 1024.000000
        metric 'mem_shared' being collected now
        metric 'mem_shared' has value_threshold 1024.000000
        metric 'mem_buffers' being collected now
        metric 'mem_buffers' has value_threshold 1024.000000
        metric 'mem_cached' being collected now
        metric 'mem_cached' has value_threshold 1024.000000
        metric 'swap_free' being collected now
        metric 'swap_free' has value_threshold 1024.000000
        metric 'bytes_out' being collected now
********** bytes_out:  4339.208984
        metric 'bytes_out' has value_threshold 4096.000000
        metric 'bytes_in' being collected now
********** bytes_in:  869.526367
        metric 'bytes_in' has value_threshold 4096.000000
        metric 'pkts_in' being collected now
********** pkts_in:  5.623715
        metric 'pkts_in' has value_threshold 256.000000
        metric 'pkts_out' being collected now
********** pkts_out:  23.369661
        metric 'pkts_out' has value_threshold 256.000000
        metric 'disk_free' being collected now
Counting device /dev/mapper/VolGroup-lv_root (20.34 %)
Counting device /dev/sda1 (11.62 %)
Counting device /dev/mapper/VolGroup-LogVol02 (5.47 %)
For all disks: 116.766 GB total, 108.774 GB free for users.
        metric 'disk_free' has value_threshold 1.000000
        metric 'part_max_used' being collected now
Counting device /dev/mapper/VolGroup-lv_root (20.34 %)
Counting device /dev/sda1 (11.62 %)
Counting device /dev/mapper/VolGroup-LogVol02 (5.47 %)
For all disks: 116.766 GB total, 108.774 GB free for users.
        metric 'part_max_used' has value_threshold 1.000000
        metric 'cpu_user' being collected now
        metric 'cpu_user' has value_threshold 1.000000
        metric 'cpu_system' being collected now
        metric 'cpu_system' has value_threshold 1.000000
        metric 'cpu_idle' being collected now
        metric 'cpu_idle' has value_threshold 5.000000
        metric 'cpu_nice' being collected now
        metric 'cpu_nice' has value_threshold 1.000000
        metric 'cpu_aidle' being collected now
        metric 'cpu_aidle' has value_threshold 5.000000
        metric 'cpu_wio' being collected now
        metric 'cpu_wio' has value_threshold 1.000000
        metric 'cpu_steal' being collected now
        metric 'cpu_steal' has value_threshold 1.000000
        metric 'load_one' being collected now
        metric 'load_one' has value_threshold 1.000000
        metric 'load_five' being collected now
        metric 'load_five' has value_threshold 1.000000
        metric 'load_fifteen' being collected now
        metric 'load_fifteen' has value_threshold 1.000000
        sent message 'heartbeat' of length 52 with 1 errors
        sent message 'heartbeat' of length 52 with 0 errors
        metric 'cpu_user' being collected now
        metric 'cpu_user' has value_threshold 1.000000
        metric 'cpu_system' being collected now
        metric 'cpu_system' has value_threshold 1.000000
        metric 'cpu_idle' being collected now
        metric 'cpu_idle' has value_threshold 5.000000
        metric 'cpu_nice' being collected now
        metric 'cpu_nice' has value_threshold 1.000000
        metric 'cpu_aidle' being collected now
        metric 'cpu_aidle' has value_threshold 5.000000
        metric 'cpu_wio' being collected now
        metric 'cpu_wio' has value_threshold 1.000000
        metric 'cpu_steal' being collected now
        metric 'cpu_steal' has value_threshold 1.000000
        metric 'load_one' being collected now
        metric 'load_one' has value_threshold 1.000000
        metric 'load_five' being collected now
        metric 'load_five' has value_threshold 1.000000
        metric 'load_fifteen' being collected now
        metric 'load_fifteen' has value_threshold 1.000000
        sent message 'heartbeat' of length 52 with 1 errors
        sent message 'heartbeat' of length 52 with 0 errors
        metric 'proc_run' being collected now
        metric 'proc_run' has value_threshold 1.000000
        metric 'proc_total' being collected now
        metric 'proc_total' has value_threshold 1.000000
        metric 'mem_free' being collected now
        metric 'mem_free' has value_threshold 1024.000000
        metric 'mem_shared' being collected now
        metric 'mem_shared' has value_threshold 1024.000000
        metric 'mem_buffers' being collected now
        metric 'mem_buffers' has value_threshold 1024.000000
        metric 'mem_cached' being collected now
        metric 'mem_cached' has value_threshold 1024.000000
        metric 'swap_free' being collected now
        metric 'swap_free' has value_threshold 1024.000000
        metric 'bytes_out' being collected now
********** bytes_out:  4313.359375
        metric 'bytes_out' has value_threshold 4096.000000
        metric 'bytes_in' being collected now
********** bytes_in:  960.271423
        metric 'bytes_in' has value_threshold 4096.000000
        metric 'pkts_in' being collected now
********** pkts_in:  6.474639
        metric 'pkts_in' has value_threshold 256.000000
        metric 'pkts_out' being collected now
********** pkts_out:  23.148708
        metric 'pkts_out' has value_threshold 256.000000
        metric 'disk_free' being collected now
Counting device /dev/mapper/VolGroup-lv_root (20.34 %)
Counting device /dev/sda1 (11.62 %)
Counting device /dev/mapper/VolGroup-LogVol02 (5.47 %)
For all disks: 116.766 GB total, 108.774 GB free for users.
        metric 'disk_free' has value_threshold 1.000000
        metric 'part_max_used' being collected now
Counting device /dev/mapper/VolGroup-lv_root (20.34 %)
Counting device /dev/sda1 (11.62 %)
Counting device /dev/mapper/VolGroup-LogVol02 (5.47 %)
For all disks: 116.766 GB total, 108.774 GB free for users.
        metric 'part_max_used' has value_threshold 1.000000
        sent message 'proc_run' of length 48 with 1 errors
        sent message 'proc_total' of length 52 with 0 errors
        metric 'cpu_user' being collected now
        metric 'cpu_user' has value_threshold 1.000000
        metric 'cpu_system' being collected now
        metric 'cpu_system' has value_threshold 1.000000
        metric 'cpu_idle' being collected now
        metric 'cpu_idle' has value_threshold 5.000000
        metric 'cpu_nice' being collected now
        metric 'cpu_nice' has value_threshold 1.000000
        metric 'cpu_aidle' being collected now
        metric 'cpu_aidle' has value_threshold 5.000000
        metric 'cpu_wio' being collected now
        metric 'cpu_wio' has value_threshold 1.000000
        metric 'cpu_steal' being collected now
        metric 'cpu_steal' has value_threshold 1.000000
        metric 'load_one' being collected now
        metric 'load_one' has value_threshold 1.000000
        metric 'load_five' being collected now
        metric 'load_five' has value_threshold 1.000000
        metric 'load_fifteen' being collected now
        metric 'load_fifteen' has value_threshold 1.000000
        sent message 'heartbeat' of length 52 with 1 errors
        sent message 'heartbeat' of length 52 with 0 errors
        sent message 'cpu_user' of length 48 with 1 errors
        sent message 'cpu_system' of length 52 with 0 errors
        sent message 'cpu_idle' of length 48 with 0 errors
        sent message 'cpu_nice' of length 48 with 0 errors
        sent message 'cpu_aidle' of length 52 with 0 errors
        sent message 'cpu_wio' of length 48 with 0 errors
        sent message 'cpu_steal' of length 52 with 0 errors
        sent message 'load_one' of length 48 with 0 errors
        sent message 'load_five' of length 52 with 0 errors
        sent message 'load_fifteen' of length 52 with 0 errors
        sent message 'mem_free' of length 48 with 0 errors
        sent message 'mem_shared' of length 52 with 0 errors
        sent message 'mem_buffers' of length 52 with 1 errors
        sent message 'mem_cached' of length 52 with 1 errors
        sent message 'swap_free' of length 52 with 0 errors
        sent message 'disk_free' of length 56 with 0 errors
        sent message 'part_max_used' of length 56 with 0 errors
        metric 'cpu_user' being collected now
        metric 'cpu_user' has value_threshold 1.000000
        metric 'cpu_system' being collected now
        metric 'cpu_system' has value_threshold 1.000000
        metric 'cpu_idle' being collected now
        metric 'cpu_idle' has value_threshold 5.000000
        metric 'cpu_nice' being collected now
        metric 'cpu_nice' has value_threshold 1.000000
        metric 'cpu_aidle' being collected now
        metric 'cpu_aidle' has value_threshold 5.000000
        metric 'cpu_wio' being collected now
        metric 'cpu_wio' has value_threshold 1.000000
        metric 'cpu_steal' being collected now
        metric 'cpu_steal' has value_threshold 1.000000
        metric 'load_one' being collected now
        metric 'load_one' has value_threshold 1.000000
        metric 'load_five' being collected now
        metric 'load_five' has value_threshold 1.000000
        metric 'load_fifteen' being collected now
        metric 'load_fifteen' has value_threshold 1.000000
        sent message 'heartbeat' of length 52 with 0 errors
        metric 'mem_free' being collected now
        metric 'mem_free' has value_threshold 1024.000000
        metric 'mem_shared' being collected now
        metric 'mem_shared' has value_threshold 1024.000000
        metric 'mem_buffers' being collected now
        metric 'mem_buffers' has value_threshold 1024.000000
        metric 'mem_cached' being collected now
        metric 'mem_cached' has value_threshold 1024.000000
        metric 'swap_free' being collected now
        metric 'swap_free' has value_threshold 1024.000000
        metric 'bytes_out' being collected now
********** bytes_out:  4362.875000
        metric 'bytes_out' has value_threshold 4096.000000
        metric 'bytes_in' being collected now
********** bytes_in:  862.837891
        metric 'bytes_in' has value_threshold 4096.000000
        metric 'pkts_in' being collected now
********** pkts_in:  5.373368
        metric 'pkts_in' has value_threshold 256.000000
        metric 'pkts_out' being collected now
********** pkts_out:  23.467873
        metric 'pkts_out' has value_threshold 256.000000
        metric 'disk_free' being collected now
Counting device /dev/mapper/VolGroup-lv_root (20.34 %)
Counting device /dev/sda1 (11.62 %)
Counting device /dev/mapper/VolGroup-LogVol02 (5.47 %)
For all disks: 116.766 GB total, 108.774 GB free for users.
        metric 'disk_free' has value_threshold 1.000000
        metric 'part_max_used' being collected now
Counting device /dev/mapper/VolGroup-lv_root (20.34 %)
Counting device /dev/sda1 (11.62 %)
Counting device /dev/mapper/VolGroup-LogVol02 (5.47 %)
For all disks: 116.766 GB total, 108.774 GB free for users.
        metric 'part_max_used' has value_threshold 1.000000
        metric 'cpu_user' being collected now
        metric 'cpu_user' has value_threshold 1.000000
        metric 'cpu_system' being collected now
        metric 'cpu_system' has value_threshold 1.000000
        metric 'cpu_idle' being collected now
        metric 'cpu_idle' has value_threshold 5.000000
        metric 'cpu_nice' being collected now
        metric 'cpu_nice' has value_threshold 1.000000
        metric 'cpu_aidle' being collected now
        metric 'cpu_aidle' has value_threshold 5.000000
        metric 'cpu_wio' being collected now
        metric 'cpu_wio' has value_threshold 1.000000
        metric 'cpu_steal' being collected now
        metric 'cpu_steal' has value_threshold 1.000000
        metric 'load_one' being collected now
        metric 'load_one' has value_threshold 1.000000
        metric 'load_five' being collected now
        metric 'load_five' has value_threshold 1.000000
        metric 'load_fifteen' being collected now
        metric 'load_fifteen' has value_threshold 1.000000
        sent message 'heartbeat' of length 52 with 1 errors
        sent message 'heartbeat' of length 52 with 0 errors
        metric 'cpu_user' being collected now
        metric 'cpu_user' has value_threshold 1.000000
        metric 'cpu_system' being collected now
        metric 'cpu_system' has value_threshold 1.000000
        metric 'cpu_idle' being collected now
        metric 'cpu_idle' has value_threshold 5.000000
        metric 'cpu_nice' being collected now
        metric 'cpu_nice' has value_threshold 1.000000
        metric 'cpu_aidle' being collected now
        metric 'cpu_aidle' has value_threshold 5.000000
        metric 'cpu_wio' being collected now
        metric 'cpu_wio' has value_threshold 1.000000
        metric 'cpu_steal' being collected now
        metric 'cpu_steal' has value_threshold 1.000000
        metric 'load_one' being collected now
        metric 'load_one' has value_threshold 1.000000
        metric 'load_five' being collected now
        metric 'load_five' has value_threshold 1.000000
        metric 'load_fifteen' being collected now
        metric 'load_fifteen' has value_threshold 1.000000
        sent message 'heartbeat' of length 52 with 1 errors
        sent message 'heartbeat' of length 52 with 0 errors
        metric 'proc_run' being collected now
        metric 'proc_run' has value_threshold 1.000000
        metric 'proc_total' being collected now
        metric 'proc_total' has value_threshold 1.000000
        metric 'mem_free' being collected now
        metric 'mem_free' has value_threshold 1024.000000
        metric 'mem_shared' being collected now
        metric 'mem_shared' has value_threshold 1024.000000
        metric 'mem_buffers' being collected now
        metric 'mem_buffers' has value_threshold 1024.000000
        metric 'mem_cached' being collected now
        metric 'mem_cached' has value_threshold 1024.000000
        metric 'swap_free' being collected now
        metric 'swap_free' has value_threshold 1024.000000
        metric 'bytes_out' being collected now
********** bytes_out:  4307.301270
        metric 'bytes_out' has value_threshold 4096.000000
        metric 'bytes_in' being collected now
********** bytes_in:  875.875244
        metric 'bytes_in' has value_threshold 4096.000000
        metric 'pkts_in' being collected now
********** pkts_in:  5.525002
        metric 'pkts_in' has value_threshold 256.000000
        metric 'pkts_out' being collected now
********** pkts_out:  23.125006
        metric 'pkts_out' has value_threshold 256.000000
        metric 'disk_free' being collected now
Counting device /dev/mapper/VolGroup-lv_root (20.34 %)
Counting device /dev/sda1 (11.62 %)
Counting device /dev/mapper/VolGroup-LogVol02 (5.47 %)
For all disks: 116.766 GB total, 108.774 GB free for users.
        metric 'disk_free' has value_threshold 1.000000
        metric 'part_max_used' being collected now
Counting device /dev/mapper/VolGroup-lv_root (20.34 %)
Counting device /dev/sda1 (11.62 %)
Counting device /dev/mapper/VolGroup-LogVol02 (5.47 %)
For all disks: 116.766 GB total, 108.774 GB free for users.
        metric 'part_max_used' has value_threshold 1.000000
        metric 'cpu_user' being collected now
        metric 'cpu_user' has value_threshold 1.000000
        metric 'cpu_system' being collected now
        metric 'cpu_system' has value_threshold 1.000000
        metric 'cpu_idle' being collected now
        metric 'cpu_idle' has value_threshold 5.000000
        metric 'cpu_nice' being collected now
        metric 'cpu_nice' has value_threshold 1.000000
        metric 'cpu_aidle' being collected now
        metric 'cpu_aidle' has value_threshold 5.000000
        metric 'cpu_wio' being collected now
        metric 'cpu_wio' has value_threshold 1.000000
        metric 'cpu_steal' being collected now
        metric 'cpu_steal' has value_threshold 1.000000
        metric 'load_one' being collected now
        metric 'load_one' has value_threshold 1.000000
        metric 'load_five' being collected now
        metric 'load_five' has value_threshold 1.000000
        metric 'load_fifteen' being collected now
        metric 'load_fifteen' has value_threshold 1.000000
        sent message 'heartbeat' of length 52 with 1 errors
        sent message 'heartbeat' of length 52 with 0 errors
        metric 'cpu_user' being collected now
        metric 'cpu_user' has value_threshold 1.000000
        metric 'cpu_system' being collected now
        metric 'cpu_system' has value_threshold 1.000000
        metric 'cpu_idle' being collected now
        metric 'cpu_idle' has value_threshold 5.000000
        metric 'cpu_nice' being collected now
        metric 'cpu_nice' has value_threshold 1.000000
        metric 'cpu_aidle' being collected now
        metric 'cpu_aidle' has value_threshold 5.000000
        metric 'cpu_wio' being collected now
        metric 'cpu_wio' has value_threshold 1.000000
        metric 'cpu_steal' being collected now
        metric 'cpu_steal' has value_threshold 1.000000
        metric 'load_one' being collected now
        metric 'load_one' has value_threshold 1.000000
        metric 'load_five' being collected now
        metric 'load_five' has value_threshold 1.000000
        metric 'load_fifteen' being collected now
        metric 'load_fifteen' has value_threshold 1.000000
        sent message 'cpu_user' of length 48 with 1 errors
        sent message 'cpu_system' of length 52 with 0 errors
        sent message 'cpu_idle' of length 48 with 0 errors
        sent message 'cpu_nice' of length 48 with 0 errors
        sent message 'cpu_aidle' of length 52 with 0 errors
        sent message 'cpu_wio' of length 48 with 0 errors
        sent message 'cpu_steal' of length 52 with 0 errors
        sent message 'load_one' of length 48 with 0 errors
        sent message 'load_five' of length 52 with 0 errors
        sent message 'load_fifteen' of length 52 with 0 errors
        sent message 'heartbeat' of length 52 with 1 errors[/mw_shl_code]




已有(5)人评论

跳转到指定楼层
Joker 发表于 2016-3-28 17:04:31
gmetad的datasource如下
data_source "hadoop" xx.xx.xx.xx:8653
回复

使用道具 举报

bioger_hit 发表于 2016-3-28 17:50:48
Joker 发表于 2016-3-28 17:04
gmetad的datasource如下
data_source "hadoop" xx.xx.xx.xx:8653
没看到错误日志
回复

使用道具 举报

Joker 发表于 2016-3-28 18:01:04

gmond 日志
[mw_shl_code=html,true]Mar 28 15:19:22 vmlin0427 /usr/local/ganglia/sbin/gmond[12637]: [PYTHON] Can't open the python module path /usr/local/ganglia/lib64/ganglia/python_modules.#012
Mar 28 15:19:22 vmlin0427 /usr/local/ganglia/sbin/gmond[12637]: Module python_module failed to initialize.#012
Mar 28 15:26:05 vmlin0427 /usr/local/ganglia/sbin/gmond[12686]: [PYTHON] Can't open the python module path /usr/local/ganglia/lib64/ganglia/python_modules.#012
Mar 28 15:26:05 vmlin0427 /usr/local/ganglia/sbin/gmond[12686]: Module python_module failed to initialize.#012
Mar 28 15:26:05 vmlin0427 /usr/local/ganglia/sbin/gmond[12686]: Error 1 sending the modular data for gexec#012
Mar 28 15:44:39 vmlin0427 /usr/local/ganglia/sbin/gmond[12812]: [PYTHON] Can't open the python module path /usr/local/ganglia/lib64/ganglia/python_modules.#012
Mar 28 15:44:39 vmlin0427 /usr/local/ganglia/sbin/gmond[12812]: Module python_module failed to initialize.#012
Mar 28 15:56:23 vmlin0427 /usr/local/ganglia/sbin/gmond[12879]: [PYTHON] Can't open the python module path /usr/local/ganglia/lib64/ganglia/python_modules.#012
Mar 28 15:56:23 vmlin0427 /usr/local/ganglia/sbin/gmond[12879]: Module python_module failed to initialize.#012
Mar 28 15:58:32 vmlin0427 /usr/local/ganglia/sbin/gmond[12922]: [PYTHON] Can't open the python module path /usr/local/ganglia/lib64/ganglia/python_modules.#012
Mar 28 15:58:32 vmlin0427 /usr/local/ganglia/sbin/gmond[12922]: Module python_module failed to initialize.#012
Mar 28 15:58:32 vmlin0427 /usr/local/ganglia/sbin/gmond[12922]: Error creating UDP server on port 8653 bind=10.96.140.112.  Will try again...#012
Mar 28 15:59:01 vmlin0427 /usr/local/ganglia/sbin/gmond[12935]: [PYTHON] Can't open the python module path /usr/local/ganglia/lib64/ganglia/python_modules.#012
Mar 28 15:59:01 vmlin0427 /usr/local/ganglia/sbin/gmond[12935]: Module python_module failed to initialize.#012
Mar 28 15:59:01 vmlin0427 /usr/local/ganglia/sbin/gmond[12935]: Error creating UDP server on port 8653 bind=10.96.140.112.  Will try again...#012
Mar 28 16:30:40 vmlin0427 /usr/local/ganglia/sbin/gmond[13110]: [PYTHON] Can't open the python module path /usr/local/ganglia/lib64/ganglia/python_modules.#012
Mar 28 16:30:40 vmlin0427 /usr/local/ganglia/sbin/gmond[13110]: Module python_module failed to initialize.#012
Mar 28 16:44:38 vmlin0427 /usr/local/ganglia/sbin/gmond[13227]: [PYTHON] Can't open the python module path /usr/local/ganglia/lib64/ganglia/python_modules.#012
Mar 28 16:44:38 vmlin0427 /usr/local/ganglia/sbin/gmond[13227]: Module python_module failed to initialize.#012
Mar 28 16:51:59 vmlin0427 /usr/local/ganglia/sbin/gmond[13265]: [PYTHON] Can't open the python module path /usr/local/ganglia/lib64/ganglia/python_modules.#012
Mar 28 16:51:59 vmlin0427 /usr/local/ganglia/sbin/gmond[13265]: Module python_module failed to initialize.#012
Mar 28 17:12:23 vmlin0427 /usr/local/ganglia/sbin/gmond[13401]: [PYTHON] Can't open the python module path /usr/local/ganglia/lib64/ganglia/python_modules.#012
Mar 28 17:12:23 vmlin0427 /usr/local/ganglia/sbin/gmond[13401]: Module python_module failed to initialize.#012
Mar 28 17:12:23 vmlin0427 /usr/local/ganglia/sbin/gmond[13401]: Error 1 sending the modular data for proc_total#012
Mar 28 17:12:23 vmlin0427 /usr/local/ganglia/sbin/gmond[13401]: Error 1 sending the modular data for mem_free#012
Mar 28 17:13:03 vmlin0427 /usr/local/ganglia/sbin/gmond[13401]: Error 1 sending the modular data for mem_free#012
Mar 28 17:13:43 vmlin0427 /usr/local/ganglia/sbin/gmond[13401]: Error 1 sending the modular data for mem_free#012
Mar 28 17:14:23 vmlin0427 /usr/local/ganglia/sbin/gmond[13401]: Error 1 sending the modular data for mem_free#012
Mar 28 17:15:03 vmlin0427 /usr/local/ganglia/sbin/gmond[13401]: Error 1 sending the modular data for mem_free#012
Mar 28 17:15:43 vmlin0427 /usr/local/ganglia/sbin/gmond[13401]: Error 1 sending the modular data for mem_free#012
Mar 28 17:26:31 vmlin0427 /usr/local/ganglia/sbin/gmond[13462]: [PYTHON] Can't open the python module path /usr/local/ganglia/lib64/ganglia/python_modules.#012
Mar 28 17:26:31 vmlin0427 /usr/local/ganglia/sbin/gmond[13462]: Module python_module failed to initialize.#012[/mw_shl_code]



回复

使用道具 举报

wscl1213 发表于 2016-3-28 18:16:48
rpm -qa

apr是哪个版本的

apr-1.3.3-1
还是apr-1.3.3-2

最好尝试新版本  

回复

使用道具 举报

Joker 发表于 2016-3-29 11:02:53
wscl1213 发表于 2016-3-28 18:16
rpm -qa

apr是哪个版本的

如下:[mw_shl_code=html,true]apr-1.3.9-5.el6_2.x86_64
apr-util-1.3.9-3.el6_0.1.x86_64
apr-util-ldap-1.3.9-3.el6_0.1.x86_64
apr-devel-1.3.9-5.el6_2.x86_64
[/mw_shl_code]
回复

使用道具 举报

您需要登录后才可以回帖 登录 | 立即注册

本版积分规则

关闭

推荐上一条 /2 下一条