WEBCAST: Mender 3.1 introduces Monitor - Detect and analyze health issues with the new Mender add-on

Valeria · October 18, 2021, 12:39pm

Mender 3.1 introduces a new add-on offering: Mender Monitor. It contains a set of features to help ensure that all your devices and services are healthy.

Real-time monitoring of your device’s resources and state, combined with an anomaly detection engine gives you instant visibility into potential problems with your device fleet, so you can investigate before your customer would even notice. This ensures higher uptime and eliminates lengthy and costly support incidents.

Unlike classic cloud-monitoring solutions, Mender Monitor is built for IoT. This means that it uses edge processing, making it a fully distributed solution: the device creates and responds to alerts locally. When connectivity to the cloud is available, the alerts are synchronized to maintain full visibility. This makes it possible to instantly act on high severity alerts, such as overheating, without waiting for connectivity to be restored.

Sign up for this webcast to see a demo of the newest features live as well as receive the recording straight to your email inbox.

bender · November 4, 2021, 7:41am

Very interesting (I just got to watch it)…
We may be interested in monitoring custom conditions not handled by dbus/logs/service as demonstrated in the presentation so a custom script solution would be needed.
Is this planned to be implemented and how far away do you think it is?

btw in Pricing and plans | Mender
D-Bus signal monitoring
appears twice

Thanks

peter · November 4, 2021, 9:52am

Hello @bender and thank for your interest in the Mender Monitor add-on.

I have to know more, what is the exact use case? What does the custom script you mentioned return? How do you run it? How do you know that it is in a “ok state” and how that it is in “critical state” (requiring an alert).

best regards,
peter

bender · November 4, 2021, 11:49am

We have a script as part of a service that monitors a serial connection for measurements. There is a specific pattern that can be detected that suggests an external system has failed and we need to take action (call someone local).

peter · November 4, 2021, 11:57am

I understand. Assuming your script monitors the serial persistently, and we are looking for CRITICAL_ERROR pattern in its output, you could try to configure the Mender Monitor add-on check like that:

# mender-monitorctl create log serialconn "CRITICAL_ERROR" "@/usr/local/serial/bin/monitor" 1h
# mender-monitorctl enable log serialconn 
# systemctl restart mender-monitor

where /usr/local/serial/bin/monitor stands for the path to your script, and 1h means that if the CRITICAL_ERROR pattern has not appeared for past hour, the alert will be cancelled (device will appear as healthy).

peter

bender · November 4, 2021, 12:11pm

which output should we send CRITICAL_ERROR to?

peter · November 4, 2021, 9:08pm

standard output will do, i.e.: let the script just print it to stdout.

peter

bender · November 5, 2021, 6:18am

Thank you Peter!
Let us have a go and evaluate the solution

peter · November 5, 2021, 9:01am

do not forget to let me know how it goes! I am super curious, as your use case is very interesting.

pg

drewmoseley · November 16, 2021, 2:49pm

@peter is this form of mender-monitorctl usage documented somewhere? I need to periodically scan for specific log messages is dmesg and this seems like a good usage for this but I can’t find the docs.

Drew

Topic		Replies	Views
Mender-monitor does not work General Discussions	12	853	March 8, 2022
Mender 3.1 release: Monitor using edge processing \| Mender Announcements	1	378	September 29, 2021
Getting started - log monitoring Mender Monitor	0	471	July 5, 2023
How to monitor a crashing service/app/container Tutorials	0	2219	December 20, 2021
Mender-monitor reports disconnected after OTA General Discussions	5	306	May 17, 2022

WEBCAST: Mender 3.1 introduces Monitor - Detect and analyze health issues with the new Mender add-on

Related topics