On Saturday 2 December 2023 I received a message by email stating that
“your storage node on AP1 satellite has been suspended for being offline for too long”
The node is active 24/7 for at least 1 year except for a 2 hour outage 2 weeks ago.
I have 4 satellites, and only one of them gives me this error.
I have collected the errors with this command
docker logs storagenode 2>&1 | grep -E “GET_AUDIT|GET_REPAIR” | grep failed
the result is this and the node that is in suspension does not appear which is
“121RTSDpyNZVcEU84Ticf2L1ntiuUimbWgfATz21tuvgk3vzoA6”
2023-12-04T15:31:57Z ERROR piecestore download failed {“process”: “storagenode”, “Piece ID”: “OPUHQZTHACY5U6IHBUCOLPGH263NB7FKTMVCP6OZOMKAZCIUYCFA”, “Satellite ID”: “12L9ZFwhzVpuEKMUNUqkaTLGzwY9G24tbiigLiXpmZWKwmcNDDs”, “Action”: “GET_REPAIR”, “Offset”: 0, “Size”: 1218560, “Remote Address”: “157.90.26.38:50830”, “error”: “manager closed: read tcp 10.0.3.2:28967->157.90.26.38:50830: read: connection timed out”, “errorVerbose”: “manager closed: read tcp 10.0.3.2:28967->157.90.26.38:50830: read: connection timed out\n\tstorj.io/drpc/drpcmanager.(*Manager).manageReader:231”}
2023-12-04T16:54:33Z ERROR piecestore download failed {“process”: “storagenode”, “Piece ID”: “HKBYI5JFLHVT35EHBQ4AUWWBFJBBBDBC5EHI2TYOMVBHECGBGLYQ”, “Satellite ID”: “12L9ZFwhzVpuEKMUNUqkaTLGzwY9G24tbiigLiXpmZWKwmcNDDs”, “Action”: “GET_REPAIR”, “Offset”: 0, “Size”: 1218560, “Remote Address”: “195.201.216.62:59692”, “error”: “manager closed: read tcp 10.0.3.2:28967->195.201.216.62:59692: read: connection timed out”, “errorVerbose”: “manager closed: read tcp 10.0.3.2:28967->195.201.216.62:59692: read: connection timed out\n\tstorj.io/drpc/drpcmanager.(*Manager).manageReader:231”}
2023-12-04T17:45:03Z ERROR piecestore download failed {“process”: “storagenode”, “Piece ID”: “6MJV5E7WWHOFB3SFA4R3WZ462CRZJMMKPVDZL5JPLN3TKE6VGX4A”, “Satellite ID”: “12EayRS2V1kEsWESU9QMRseFhdxYxKicsiFmxrsLZHeLUtdps3S”, “Action”: “GET_REPAIR”, “Offset”: 0, “Size”: 1218560, “Remote Address”: “199.102.71.25:54724”, “error”: “manager closed: read tcp 10.0.3.2:28967->199.102.71.25:54724: read: connection timed out”, “errorVerbose”: “manager closed: read tcp 10.0.3.2:28967->199.102.71.25:54724: read: connection timed out\n\tstorj.io/drpc/drpcmanager.(*Manager).manageReader:231”}
2023-12-04T19:09:49Z ERROR piecestore download failed {“process”: “storagenode”, “Piece ID”: “SJ652IA2RACQ5MWQM4NMAFUSAIFKHCJJNQHYXQU7E4Q2PD3COTZA”, “Satellite ID”: “12L9ZFwhzVpuEKMUNUqkaTLGzwY9G24tbiigLiXpmZWKwmcNDDs”, “Action”: “GET_REPAIR”, “Offset”: 0, “Size”: 580096, “Remote Address”: “159.69.36.227:40188”, “error”: “manager closed: read tcp 10.0.3.2:28967->159.69.36.227:40188: read: connection timed out”, “errorVerbose”: “manager closed: read tcp 10.0.3.2:28967->159.69.36.227:40188: read: connection timed out\n\tstorj.io/drpc/drpcmanager.(*Manager).manageReader:231”}
I attach screenshots of what I got yesterday and what I get today, during these two days, the node has not been paid, I just restarted it since I updated the version of Docker in case that was the incidence.
There are two nodes that have lowered the online status, the other two are still at the same percentage".
Yesterday
Today
I have read these entries in the forum and support but it doesn’t clarify anything and the % online keeps going down.
The ports in the firewall are open and working, otherwise the two nodes that do not drop the online status would drop in %.
what else can I check?
Regards