What tools are you using to collect system metrics? I am asking about IO throughput, temperature, disk space used, etc. I want to have the pretty graphs too.
Hi Dan,
I use two tools.
First tool Observium is SNMP based. I am in the process of migrating from
TurnKey Observium to its fork LibreNMS which is recently ported to Open and FreeBSD. Naturally since I am primarily OpenBSD user I will try to run LibreNMS on that platform first. The second tool is
collectd uses its own daemon. Observium has a plug-in to display collectd graphs. I will give you temporary read access to the part of my Lab so that see real live demo. Credentials are sent via PM.
Some comments which you can read at your leisure.
Observium started as a simple PHP interface for a SNMP walk through tool to collect metrics from servers, routers, switches, UPSs, and PDUs. At this time I am not monitoring my switches, UPSs, and PDUs so you have not seeing some of the best of what Observium has to offer. SNMP is the Observium biggest asset but also it weakest point. SNMP is pooling protocol. It is very difficult to pool devices behind firewalls/VPNs if you don't have a proxy. There is no proxy for Observium.
When you open Observium and click on one of the servers one of the tabs will be collectd.
Collectd works as a push mechanism which makes it very suitable for monitoring servers behind the firewalls and on VPNs. You install client which is trivial to configure a (takes about a minute). The biggest weakness of collectd is lack of decent front end. Actually IMHO Observium is the best front end for collectd. Most of what you see here
https://collectd.org/wiki/index.php/List_of_front-ends
Are half baked tools Collect-web (Perl based) being the best. I really like collectd for metric.
Observium can display log files. Since I am using turn-key Observium based on Debian it uses rsyslog as a backend. That causes troubles with OpenBSD syslog and even more with FreeBSD native syslog (BTW native OpenBSD syslog is capable of TLS and many other things and it is light years ahead of FreeBSD native syslog, situation is the same with sensorsd OpenBSD native sensoring framework, also OpenBSD native snmp shines vs bsd-snmp (from FreeBSD) vs net-snmp, OpenBSD also has native daemon for monitoring UPSs). However Observium supports syslog-ng. I came to the conclusion that log files should be displayed by a specialize data mining tool. I am in the process of putting syslog-ng + ELK OpenBSD server.
Observium used to have plugin for NfSen but it is luckily unmentioned. I hate those kind kitchen sink tools. Between I really like
NfSen for netflow monitoring.
jrm mentioned MySQL as something he dislikes about Observium. I agree. I add few more things. I dislike that it is difficult to use custom MIBs. PF at least on OpenBSD has magnificent MIBs for SNMP protocol but it is not possible to walk through from Observium. Observium people insist on using Debian or Ubuntu. They don't even officially support Red Hat.
Recently Observium got multimillion dollar grant. They will be using money to add more to that kitchen sink. Namely they want to add early alerting system (e-mail notification) and daemon correction which I already have. I use Monit and its centralized server M/Monit in particular.
I am really happy about LibreNMS fork of Observium and it has been long overdue. The whole fork thing reminds me of Foswiki fork of TWiki.
There is a native OpenBSD monitoring infrastructure symon
http://wpd.home.xs4all.nl/symon/
It is showing little bit age but it is usable.
I am also very familiar with M/Monit, Nagios, Munin, Ganglia, Cacti, and Zabbix. I could probably write a small article about monitoring or login tools for that mater.
Obviously on the above list only Munin and Cacti are strictly speaking metric monitoring.
jrm is gaga about Ganglia which I used to have in the Lab while we had a ROCKS cluster. The cluster has been decommissioned and Ganglia with it. However I might be getting new 42U node cluster and it will be monitored with Ganglia.