Feature Request: Logs for successful deployments

fce · April 27, 2026, 1:00pm

Hi,

I would be curious to know about the deployments logs.
Are the only ones to have this need or maybe there are technical limitations that make it unfeasible ?
Thanks to the whole team in advance,

Context

Hosted mender portal. Deployments page.

Current behavior

Logs are visible only on failed deployments

Not on successful ones.

Feature Request

Allow the consultation of logs also on successful deployments ?

Use case

Troubleshooting.

in case of failed deployments, signs and hidden issues might have appeared on previous, otherwise successful deployments.
logs of successful deployments might still contain useful log lines coming from logging from within the user-supplied state scripts performing upgrade migration that might need developer investigation.

Thanks in advance for all information and opinions,

Have a nice day.

TheYoctoJester · April 28, 2026, 7:48pm

Hi @fce,

Thanks a lot for reaching out and your ideas! In fact I thought about along very similar lines a while back, and then came to some conclusions.

I agree that in some cases the logs of a successful deployment installation might hold interesting information
a successful installation (without relevant information) is understood as the default case, for the production case
when we’re talking about real product fleet sizes and deployment frequencies, like 10k devices and 4 deployments/year - then who is going to review 40k logs each year?

This brings us to the assumptions:

either this relevant information is present on all devices, because it is intrinsic to the deployment/software combination: then it would also show up on a single canary unit. Having one of those running in-house, and accessible to obtain such information, is what I would call a good engineering practise, and hence argue for having that.
or the relevant information is only present on a fraction, depending on unknown conditions. This is the tricky part. What is the information? As it ends up in the log, the precondition is that your logic already defines it as worth printing. Now here I would argue for the “fail fast, fail hard” logic. Instead of issuing “soft” warning to logs (which everybody will ignore), make every relevant information an error. This is obviously not a lot of fun, but in the end it forces you to make sure every case is properly covered, and therefore helps the overall quality of the product (and bonus, you get logs on the Mender server).

For corner cases there always is the Troubleshoot Add-On, but - as a preliminary analysis - my take is that a “good log server side storage” does eat storage but not yield noticeable information.

Greets,
Josef

PS: combining this, a possible line of thought might be: “for devices with the Troubleshoot Add-On, all logs are collected”. This reduces the scope to a manageable size - but then, on such a device the logs could inspected be anyways…

fce · May 7, 2026, 12:36pm

Hi Josef,

Thank you for your response,

Yes, I completely agree that the “fail fast, fail hard” (or “no news, good news.”) is the best approach for long term maintenance.

Yes, those 40k logs/year of success logs would be unjustified.

On assumption 1

Related to your first assumption, and just to exaustively explore the simple/pragmatic compromise space:
Would global (all-devices) rolling temporally-self-deleting successful logs be an option ?
(ex. “all successful update logs last 2 days”)
It would allow to investigate, and, in case of expiration, the updates could jet be re-triggered.
At the same time, in the back-end it would help keeping the storage-eating under relative control, while often offering implementation simplicity by having built-in direct support in cloud storage tooling (ex. aws s3 buckets that have x-days-self-delete policies).

Troubleshoot Add-On

I will need to think a bit more about the “Only on Troubleshoot Add-On” scenario, mostly in terms of UX, I am going a bit back and forth.
The deployment list does not show, currently, the installed plugins, so, the fact that successful logs are available only on certain devices and not on others would required the addition of a helper/visual justification, am I correct (same for API) ?

Thank you again in advance for all the information and opinions,

Have a great day,

TheYoctoJester · May 21, 2026, 2:35pm

Hi @fce,

on the Troubleshoot-Addon line of thinking, my considerations were more around using it to access the log. Essentially the log is just in journald, and if this is persisted on the device, then it can be inspected via the remote terminal. This could even be automated through the API.

Stepping back a bit, I think the line of thinking we should rather have is not necessarily the verbatim log, but a form of “deployment summary”. Such could include information like

how long did downloading take
were retries needed due to bad connectivity
did the user delay the installation
…

But this is actually a fully new feature then. Might be interesting to do a PoC here

Greetz,
Josef

Topic		Replies	Views
View logs through Mender web UI of successful deployments? General Discussions	3	1246	August 3, 2021
Reading logs for deployments? General Discussions	3	427	August 22, 2019
Cannot send inventory General Discussions	9	960	May 21, 2025
Get Deployment Log for Device General Discussions	0	332	April 1, 2022
Update Logs General Discussions	10	1528	August 5, 2020