Hello storjlings
I have a raspberry with an 4 TB drive on it.
The node id is 1UfHZAzNqFRS9yjsgSeTeERXDrLo7dUw1f714qTgZXvzrq71Fu
Ive checked it today and it got the update 1.3.3 today but also the message that it got suspended on All but the US sattelite.
Can you check why and if i can fix it?
Thanks in advance
This morning the node has been upgraded and got the suspension. My node was working fine since September when I started. It’s under UPS and no fault in the last week.
I just restarted my pi and it seems that it got some in/egress.
In the logs iam finding also this entry 05-02T06:49:50.505Z ERROR piecestore failed to add bandwidth usage {"error": "bandwidthdb error: database is locked", "errorVerbose": "bandwidthdb error: database is locked\n\tstorj.io/storj/storagenode/storagenodedb.(*bandwidthDB).Add:59\n\tstorj.io/storj/storagenode/piecestore.(*Endpoint).saveOrder:728\n\tstorj.io/storj/storagenode/piecestore.(*Endpoint).doUpload:448\n\tstorj.io/storj/storagenode/piecestore.(*drpcEndpoint).Upload:216\n\tstorj.io/common/pb.DRPCPiecestoreDescription.Method.func1:987\n\tstorj.io/drpc/drpcmux.(*Mux).HandleRPC:107\n\tstorj.io/common/rpc/rpctracing.(*Handler).HandleRPC:66\n\tstorj.io/drpc/drpcserver.(*Server).handleRPC:111\n\tstorj.io/drpc/drpcserver.(*Server).ServeOne:62\n\tstorj.io/drpc/drpcserver.(*Server).Serve.func2:99\n\tstorj.io/drpc/drpcctx.(*Tracker).track:51"}
Your node has been suspended on 118UWpMCHzs6CvSgWd9BfFVjw5K9pZbJjkfZJexMtSkmKxvvAW1wFTAgs9DP5RSnCqKV1eLf6N9wtk4EAtmN5DpSxcs8EjT69tGE121RTSDpyNZVcEU84Ticf2L1ntiuUimbWgfATz21tuvgk3vzoA612EayRS2V1kEsWESU9QMRseFhdxYxKicsiFmxrsLZHeLUtdps3S12L9ZFwhzVpuEKMUNUqkaTLGzwY9G24tbiigLiXpmZWKwmcNDDs . If you have any questions regarding this please check our Node Operators thread on Storj forum.
Unfortunate no. On the satellite side, we noticed that some nodes are not working fine and we had to implement suspension mode because of that (~4 weeks ago). So your node was not working fine but for different reasons, you didn’t notice it before. → It is a feature and not a bug.
The error message you posted is unrelated. Search especially for any download failed with GET_AUDIT
OK, I noticed a lot of "database is locked" errors, both on GET and on GET_AUDIT.
So, what I have to do to fix this problem?
I already bought a new hardware (8 cores) and a new SATA drive will arrive on Tuesday next week, at this point I don’t know if it’s better to move this node on the new HD or start a new node and perform a graceful exit on the current node.
Please, let me know, I’m here to help you to better support your client, but I need a little help from Storj to fix this situation.
This tends to happen when there is an IO bottleneck. There can be several reasons for this, for example if you use a network protocol or USB2. It also happens with SMR drives that can’t keep up.
Neither would be my suggestion. Since it’s likely an IO bottleneck, you might be best off running a node on each HDD. That will cut the IO per HDD in half because the nodes would be sharing the same traffic.
Suspension is not definitive. You can recover from it if you can fix the issues. So hopefully this post gives you some ideas of what to look for.
No to both. The logs don’t impact the nodes performance. Removing or rotating logs can’t have any effect on this.
You’re right, my HD is running on USB2 interface, that’s why I ordered a new SATA drive which will be connected directly to the SATA port of my new board (Odroid HC2).
Thank you, the node is up and running now, but I’ve understood that this setup cannot cope with the workload required. I’ve invested some money in new hardware, both board and disk, next week I will deploy a more efficient node.
Hi,
The same problem appeared to me “Your node has been suspended on 118U …”. I use raspberry pi 4 with 8TB usb 3.0 HDD. I think this problem is a major one considering that there are many people with RPi, and if the problem is not solved, the Storj network will suffer. How can we solve this problem?
Thanks.