Hello all,
Long time Storj Node Operator here. I’m in a bit of a pickle. So with the deletion of test data recently, has gotten me to a point where I have about 6TB on a single node. I’m working on migrating that data to a new disk (16TB), but I’m finding that my migration just can’t keep up at this point.
I’m running the following command on my node:
rclone copy --transfers=24 --checkers=16 --ignore-checksum --progress --order-by modtime,descending /mnt/disk6/storj/Data/storage/blobs/ /mnt/disks/StorjNode1/Data/storage/blobs/ --exclude=/Data/storage/trash/** --exclude=/Data/storage/garbage/** --exclude=/Data/orders/archive/**
I’ve tweaked these commands a ton and found that this provided the best performance for me. I’m using rclone to handle parallelization, which I found to be better than rsync w/ the parallel package.
I’m finding that the overall activity of my node, along with the migration (rclone) commands is putting stress on my system and things just slow down. My iowait on the system is roughly 40-60% at this point, but I’m following these steps:
- Migrate (rclone) data from original disk to new disk (while node is running).
- Stop Node
- rclone again to ensure all data is copied.
- Update Config and point to new disk.
At this point, the high I/O is causing my online score to drop (92-93%), but everything else is fine. What level of impact do audits have on my “suspension”? Is there any point where I just say F*** it, and stop the rclone and point to the new disk?
I almost wish there was a way to set my node in a maintenance mode (to not accept ingress, but allow egress), so that I/O is reduced, and no new data is added.