Ordersdb error: database is locked

cdhowie · June 24, 2020, 2:49am

I’ve been measuring I/O on both nodes on this system and I’m at a loss to explain what I’m seeing. Both nodes are seeing a very similar ingress rate. Here is 60 seconds of iostat averages:

Device             tps    kB_read/s    kB_wrtn/s    kB_read    kB_wrtn
sdd              42.97        11.60      1991.33        696     119480
sdc             182.37        53.13     14172.67       3188     850360

sdc is the disk for the node that’s having trouble. This is after vacuuming and defragging all databases, and limiting the node on sdc to three concurrent uploads. (Before, sdc was seeing 300+ tps and I think this was causing the lock timeouts – the disk was just too busy.)

Both HDDs are exactly the same model. Something about the storagenode on sdc is doing a ton of write I/O and I’ve no idea why. It doesn’t make sense.

Alexey · June 24, 2020, 8:38am

Perhaps it is a Garbage Collector copying pieces to the trash.

cdhowie · June 24, 2020, 2:19pm

No, that’s not it as far as I can tell. Note that the read kB/s doesn’t match the written kB/s. Over a 60 second window, if GC was the issue I’d expect those to be approximately equal, or at least much closer than 53 and 14,172. I also had a suspicion it was related to mass trash deletions so I monitored the output of find trash/ -type f | wc -l and it didn’t change over the course of several hours.

Something else is going on.

mrkeyboardcommando · July 1, 2020, 9:14am

Same here… after updating to v1.6.4

2020-07-01T09:13:24.936Z ERROR piecestore failed to add order {“error”: “ordersdb error: database is locked”, “errorVerbose”: “ordersdb error: database is locked\n\tstorj.io/storj/storagenode/storagenodedb.(*ordersDB).Enqueue:52\n\tstorj.io/storj/storagenode/piecestore.(*Endpoint).saveOrder:665\n\tstorj.io/storj/storagenode/piecestore.(*Endpoint).Upload:409\n\tstorj.io/common/pb.DRPCPiecestoreDescription.Method.func1:996\n\tstorj.io/drpc/drpcmux.(*Mux).HandleRPC:107\n\tstorj.io/common/rpc/rpctracing.(*Handler).HandleRPC:56\n\tstorj.io/drpc/drpcserver.(*Server).handleRPC:111\n\tstorj.io/drpc/drpcserver.(*Server).ServeOne:62\n\tstorj.io/drpc/drpcserver.(*Server).Serve.func2:99\n\tstorj.io/drpc/drpcctx.(*Tracker).track:51”}

graphtek · July 1, 2020, 10:54am

me too, on my node I have same error

mrkeyboardcommando · July 1, 2020, 11:07am

after a bit of waiting it seems to go away.

i had a little bit high iowait while this was going on.

i suspect i was due to the DB being either cleaned/restructured/upgraded or being read by the GC

Darthsaiber · July 7, 2020, 11:14pm

2020-07-07T19:28:24.676-0300    ERROR   piecestore      failed to add order     {"error": "ordersdb error: database is locked", "errorVerbose": "ordersdb error: database is locked\n\tstorj.io/storj/storagenode/storagenodedb.(*ordersDB).Enqueue:52\n\tstorj.io/storj/storagenode/piecestore.(*Endpoint).saveOrder:665\n\tstorj.io/storj/storagenode/piecestore.(*Endpoint).Upload.func5:312\n\tstorj.io/storj/storagenode/piecestore.(*Endpoint).Upload:328\n\tstorj.io/common/pb.DRPCPiecestoreDescription.Method.func1:996\n\tstorj.io/drpc/drpcmux.(*Mux).HandleRPC:107\n\tstorj.io/common/rpc/rpctracing.(*Handler).HandleRPC:56\n\tstorj.io/drpc/drpcserver.(*Server).handleRPC:111\n\tstorj.io/drpc/drpcserver.(*Server).ServeOne:62\n\tstorj.io/drpc/drpcserver.(*Server).Serve.func2:99\n\tstorj.io/drpc/drpcctx.(*Tracker).track:51"}
2020-07-07T19:28:26.255-0300    ERROR   piecestore      failed to add order     {"error": "ordersdb error: database is locked", "errorVerbose": "ordersdb error: database is locked\n\tstorj.io/storj/storagenode/storagenodedb.(*ordersDB).Enqueue:52\n\tstorj.io/storj/storagenode/piecestore.(*Endpoint).saveOrder:665\n\tstorj.io/storj/storagenode/piecestore.(*Endpoint).Upload:409\n\tstorj.io/common/pb.DRPCPiecestoreDescription.Method.func1:996\n\tstorj.io/drpc/drpcmux.(*Mux).HandleRPC:107\n\tstorj.io/common/rpc/rpctracing.(*Handler).HandleRPC:56\n\tstorj.io/drpc/drpcserver.(*Server).handleRPC:111\n\tstorj.io/drpc/drpcserver.(*Server).ServeOne:62\n\tstorj.io/drpc/drpcserver.(*Server).Serve.func2:99\n\tstorj.io/drpc/drpcctx.(*Tracker).track:51"}
2020-07-07T19:28:27.465-0300    ERROR   piecestore      failed to add order     {"error": "ordersdb error: database is locked", "errorVerbose": "ordersdb error: database is locked\n\tstorj.io/storj/storagenode/storagenodedb.(*ordersDB).Enqueue:52\n\tstorj.io/storj/storagenode/piecestore.(*Endpoint).saveOrder:665\n\tstorj.io/storj/storagenode/piecestore.(*Endpoint).Upload:409\n\tstorj.io/common/pb.DRPCPiecestoreDescription.Method.func1:996\n\tstorj.io/drpc/drpcmux.(*Mux).HandleRPC:107\n\tstorj.io/common/rpc/rpctracing.(*Handler).HandleRPC:56\n\tstorj.io/drpc/drpcserver.(*Server).handleRPC:111\n\tstorj.io/drpc/drpcserver.(*Server).ServeOne:62\n\tstorj.io/drpc/drpcserver.(*Server).Serve.func2:99\n\tstorj.io/drpc/drpcctx.(*Tracker).track:51"}
2020-07-07T19:28:28.823-0300    ERROR   piecestore      failed to add order     {"error": "ordersdb error: database is locked", "errorVerbose": "ordersdb error: database is locked\n\tstorj.io/storj/storagenode/storagenodedb.(*ordersDB).Enqueue:52\n\tstorj.io/storj/storagenode/piecestore.(*Endpoint).saveOrder:665\n\tstorj.io/storj/storagenode/piecestore.(*Endpoint).Upload:409\n\tstorj.io/common/pb.DRPCPiecestoreDescription.Method.func1:996\n\tstorj.io/drpc/drpcmux.(*Mux).HandleRPC:107\n\tstorj.io/common/rpc/rpctracing.(*Handler).HandleRPC:56\n\tstorj.io/drpc/drpcserver.(*Server).handleRPC:111\n\tstorj.io/drpc/drpcserver.(*Server).ServeOne:62\n\tstorj.io/drpc/drpcserver.(*Server).Serve.func2:99\n\tstorj.io/drpc/drpcctx.(*Tracker).track:51"}
2020-07-07T19:28:31.277-0300    ERROR   piecestore      failed to add order     {"error": "ordersdb error: database is locked", "errorVerbose": "ordersdb error: database is locked\n\tstorj.io/storj/storagenode/storagenodedb.(*ordersDB).Enqueue:52\n\tstorj.io/storj/storagenode/piecestore.(*Endpoint).saveOrder:665\n\tstorj.io/storj/storagenode/piecestore.(*Endpoint).Upload:409\n\tstorj.io/common/pb.DRPCPiecestoreDescription.Method.func1:996\n\tstorj.io/drpc/drpcmux.(*Mux).HandleRPC:107\n\tstorj.io/common/rpc/rpctracing.(*Handler).HandleRPC:56\n\tstorj.io/drpc/drpcserver.(*Server).handleRPC:111\n\tstorj.io/drpc/drpcserver.(*Server).ServeOne:62\n\tstorj.io/drpc/drpcserver.(*Server).Serve.func2:99\n\tstorj.io/drpc/drpcctx.(*Tracker).track:51"}

What kind of errors are these? should I be worried? What things should I look to correct these errors and that will not happen again?
My HDD is connected via SATA 3

Alexey · July 28, 2020, 8:56pm

3 posts were split to a new topic: The unknownScore is failing (close to suspension)

Tocard · August 7, 2020, 7:36am

I am very new to Storj and just setup my first node tonight. I am running on Debian 10 (Buster) with Docker and running into the same “database is locked” error. I run sudo docker run ... and tried both with and without --privileged option. The behavior is the same error for me in both cases.

Alexey · August 7, 2020, 8:31am

The “Database is locked” error is not related to “failed to start”, so moved to the appropriate topic.
You do not needed to specify the --privileged option, unless you are on QNAP. The issue with QNAP is investigated in a separate topic and have nothing common to your problem.

The “Database is locked” related to the slow disk subsystem or network connected drives.
How is the HDD connected to your PC?

Golem · August 7, 2020, 9:01am

My HDD is connected by USB 3 in a pi4 4GB and I have de same problem.
Could it be that the error: “database is locked” causes the uptime restart?
And SMR Disk can be the problem?

Odmin · August 7, 2020, 10:13am

I think time to look into thread that already discussed about “database locked”

Tocard · August 7, 2020, 4:31pm

I am running in a VM and had the mount via a CIFS share for some other setup work I was doing. I’ve since moved the device to be locally attached, which has solved the issue. Docker starts successfully now.
Thank you for pointing me in the right direction.

Alexey · August 7, 2020, 9:11pm

CIFS is not compatible with SQLite and thus - with storagenode. The NFS will not work too,
The only working network protocol is iSCSI.
But even then your node will loose the race for almost all pieces - there are a lot of nodes with direct attached drives.

So, as you figured that out - the local attached drive is the only working solution.