Today I was contacted by another Systems Engineer, that weird errors were detected in the NetBackup application. The errors could be seen in Reports -> Problems (in the GUI).
All of them looked the same and were triggered every 3 mins:
could not process request from pmoinfr01.local.domain
get_string() failed - premature end of file encountered (5)
Here the actual screenshot:
Already by the name 'pmoinfr01' I knew that my Nagios monitoring must be involved, the fact that this happens every 3 mins is another indicator (as my standard checks are launched every 3rd minute). But which check is doing this?
Besides the standard operating systems checks as CPU, Memory, Disk, etc. I also check typical NetBackup services and ports for their availability.
The important ports to check on a NetBackup Master server are: 13701, 13720, 13721, 13782. These ports are checked by check_tcp.
After some tcpdumping and manual tests, I figured out that check_tcp on port 13720 is causing these critical errors in the logs (Reports). It seems that check_tcp is submitting code to this port which can't be handled on the target side which then result in errors in the logs.So if one receives such errors in his NetBackup application, you must deactivate the monitoring of port 13720 (at least not by check_tcp) and the spamming in your logs will stop.
No comments yet.
Personal Internet VMware PHP Linux Shell Bluecoat Proxy Windows Hardware Virtualization Nagios MySQL DB Monitoring Mail Android Network Wyse Hacks Tomcat Postgres Apple Mac Surveillance Backup BSD ZFS Solaris SmartOS Unix Multimedia Perl Database MongoDB CMS OTRS FreeBSD Wordpress LXC Nginx Proxmox DNS Graphics GlusterFS Security Chef HAProxy Icinga Ansible HTML MariaDB Containers Rancher Docker AWS ELK Kibana Logstash Filebeat Varnish PGSQL PostgreSQL ElasticSearch CouchDB Bash Macintosh Container Minio Grafana InfluxDB Databases NFS OSSEC SystemD Java Zoneminder Elasticsearch SSL TLS Icingaweb2 Cloud Wireless Kubernetes Ubuntu