"error": "console: trust: satellite \"118UWpMCHzs6CvSgWd9BfFVjw5K9pZbJjkfZJexMtSkmKxvvAW\" is untrusted"
was related to some trust.json file.
Just don’t know what leads to the suspension and trying to make some actions that I’ve made in previous error cases.
When I have started first two nodes in Storj about 3 years ago all the errors caused by the elictricity shutdown or windows updates - and it was the events from my side.
For now I don’t understand why the things goes wrong without obvious reason. The bad events occurs every two month timeframe and it drains alot of time to fix them. But then they are appeared again.
We increased repair threshold recently to 60, so now we have a lot more repairs going on. The GET_REPAIR can affect suspension and audit scores as it does GET_AUDIT, so if your node answers with unknown error instead of piece, the suspension score will be affected. However, if your node is started to answer on GET_AUDIT or GET_REPAIR normally - the suspension score will be quickly recover.
So, please search for those errors not only for GET_AUDIT but also for GET_REPAIR.
Usually such errors indicates that your node become too slow to answer or maybe there is some disk or network issues.
For example, my nodes doesn’t have problems with GET_REPAIR, even if the number of such requests is now have a higher frequency than before.
There is no GET_AUDIT or GET_REPAIR issues, only WARN message about Stefan satellite in a normal state, as I have mentioned above.
It is some roaming random situation with suspension downgrade. I’m inspecting all nodes stasts 1 or 2 times a day - and there is no obvious reason for these events.
Could it be because the high capasity of the disks and max bandwidth capasity of SATA III archetecture?
The max capasity of the oldest node is 9Tb of 12Tb total.
And the disks are sitting in a Supermicro JBOD shelf that links with Dell H200E 6Gb/s SAS PCIe HBA controller with the PC.
There are must be something wrong when your node receives GET_AUDIT or GET_REPAIR, otherwise the suspension score would not be affected to the level where node become suspended.
Maybe you use a docker version without log redirection, then all logs got deleted with the container and failure records are now gone.
If you have logs redirected, then search for GET_AUDIT or GET_REPAIR and ERROR level, i.e.
I’m using Win10 with Docker on WSL2 based engine.
Docker Desktop 4.6.0 (75818); Docker Engine v20.10.13.
Nothing else than WSL2 have not rolled up.
Looking for logs within Docker Storagenode"N" container.
(Where “N” - is the number of specific Storagenode).