I have a node that, seemingly overnight, started causing hello@storj.io to send me “Your Storage Node on the US1 satellite has been suspended for returning errors in response to audits.” emails…
…then later send “Your Storage Node on the US1 satellite is no longer suspended for returning errors during audits” email… sometimes only 10 minutes later.
Over and over: I’m seeing at least 15 of those loops so far. I did restart it, and the logs show normal upload+download events. But my scores are being hit (however the Audit number is still 100%?)
Replying to myself: I can see early in the logs it seemed to have gotten into a fast-restart loop: with entries like this:
2025-12-24 10:30:56,450 INFO spawnerr: command at ‘/app/storagenode-updater’ is not executable 2025-12-24 10:30:56,450 INFO gave up: storagenode-updater entered FATAL state, too many start retries too quickly
But when I manually stopped it, then restarted it… it seems to have come back up cleanly (on 1.142.7). I guess I’ll just wait and see if the emails stop.
OK… it has been an hour since the last suspend/unsuspend emails now.
I don’t know what caused storagenode-updater to become unhappy and start restart-looping the node… but a manual restart fixed it. Leaving the post up for future-people-with-problems
A suspension score is affected, when your node is online (answering on audit requests), but returns an unknown error instead of a piece hash. Known errors (“file not found”, “i/o error”, 3x response timeouts, a piece corruption) will affect an audit score instead.
So, you need to search for GET_AUDIT and GET_REPAIR requests which were answered with a some error or warning.