Moving a node with Rsync is terribly slow, idea to improve

jammerdan · March 26, 2024, 4:56am

I had to move nodes. And using Rsync for this is very slow as it is not well suited for millions of tiny files and for some other reasons.
It is not even the time to transfer that makes it so terrible slow it is the time it needs to compare which files to send.
Here is data of 2 nodes for the final run:
Node 1 : Total received 2.85G, took from Thursday to Wednesday = 6 days
Node 2: Total received 343G, took from Wednesday to Sunday the week after = 11 days

The worst thing of all is that for these final run the node needs to be offline because that is the other problem with Rsync, it is not aware of changes while it is running.
So node 2 was offline for 11 days only to get finally moved. This does not only hurt the earnings but also the online reputation.

There would be a solution:
After the first or second Rsync run where the node is still online, we’d only need a list of files that need to be transferred instead of letting Rsync doing the comparison.
This list has to be provided by the node.

So my suggestion is to create a log mode or a node mode where it writes the paths of all new files in the blobs and the trash folder into a list. This would the record all the changes while Rsync and the node are running. For the final Rsync run instead of letting Rsync go through thousands of folders with millions of files to determine what has changed, we then could simply feed that list of files into Rsync and transfer those files only.

If I had such a file list for my node 1 with only 2.85 GB of files that need to be transferred, I am sure it would not have been offline for 6 days.

mihalko · March 26, 2024, 5:14am

What I did is to set the node size lower than the actual size. It stops receiving new data.
Only two rsync runs are needed then - transfer everything, put node offline and compare it to delete files which no longer exists after first transfer. Second run took just few (6?) hours for a node with size close to 10TB.

Mitsos · March 26, 2024, 6:57am

I’ve transferred a few dozen nodes so far (all sizes up to 10TB). rsync is the correct way to do it. You keep the node running. You run it once with -aP, let it finish. Rerun with -aP --delete-during and let it finish. Stop the node, wait for a bit for everything to settle (or issue a sync on Linux), and rerun it a third time with -aP --delete-during and the node is transferred. The maximum offline time that I can remember (for a 7TB node) is about 4 hours. rsync has been around for close to 30 years now, it can handle millions of files as effective as possible. In its default settings (ie if you use the above), it only compares sizes if the modification time is different (ie to scan an entire directory it only takes as long as doing an ls on that directory, literally a couple of seconds).

I for one do not support adding complexity to the storagenode software. The less code there is, the easier it is to audit it, and less likely for a bug to show up.

Roxor · March 26, 2024, 9:39am

If you’re moving the whole drive (like to evac one going bad) then use dd/ddrescue. Run a fsck on the source drive (to fix what you can), ddrescue it to the destination drive, then fsck it one last time on the new drive for good measure. Except for any damaged parts it should transfer at 150-250MB/s.

If you need to use a temp location (or are just moving to a larger drive)… try to use SSD for any image files. Like you could ddrescue the old drive (or just a partition) to an image file on SSD (which is 150-250MB/s), then loopback-mount that image to “see” the normal files… then copy/rsync it to the destination drive (and only eat that slow final copy of millions of files once, not twice). The downside to this is if you’re copying a 10TB HDD not everyone has 10TB of SSD space for a temp file.

Basically you’re trying to move partitions as large-continuous-blocks, because it’s way faster. Even if you’re only moving a Storj directory… it can be faster to copy an entire partition… then delete any files you don’t want once the copy completes. (Because you don’t care that you may have copied millions-of-extra-files quickly, because you don’t care how slow they are to delete, as long as the Storj node is happily running in the background)

Toyoo · March 27, 2024, 1:26am

If you make sure to stop ingress before rsync, then—perhaps surprisingly—there would be no new files in the blobs directory to compare against. As such, it would be enough to rsync databases, orders, etc.

arrogantrabbit · March 27, 2024, 5:40am

Keep running rsync on a live node until it completes in a few minutes, and only then shut down the node to do the last one.

Filesystem already does it. After few rsync runs all the metadata is in ram. Comparing the difference will be very fast. You can also tell rsync to only compare

What is the filesystem there?

storaje · July 28, 2024, 11:48am

LVM is what I will slowly migrate too. That way I can mix storj data and personal data on the same physical disk but use dd for block level moves across disks. LVM also offers ways to move data across physical disks.

I explain how I just recently moved from a standard partition toan LVM lv in this thread: Disable file walkers temporarily while migrating

Nodemansland · August 11, 2024, 7:14pm

I made the mistake of trying to move the entire storage folder on the same network. I shut the node down, then moved the folder, would have taken about a day to move 8TB. Unfortunately my breaker tripped at 2AM because my AC kicked on… and well that was the end of that node… Now I only use Rsync to transfer nodes, exacly as @Mitsos described.

Garfield · August 23, 2024, 3:31pm

I am moving my node on my Synology from /volume1/docker/storj to /volume2/docker-2/storj. Doing it with rsync is indeed very slow. I found a way to speed it up by running rsync in parallel. I’m using a docker container to do the move. This seems to at least somewhat improve the speed.

services:
    rsync:
        image: nixos/nix
        command: nix-shell -p rsync parallel --command \
            "find /source -type f | parallel -j8 rsync -vv --archive --progress --stats --mkpath {} {= s/source/target/ =}"
            # later run a single rsync with --delete flag
        volumes:
            - /volume1/docker/storj:/source
            - /volume2/docker-2/storj:/target

Alexey · August 24, 2024, 12:38pm

You can also use rclone, it supports this out of the box without scripting. And works much better.

Garfield · August 25, 2024, 12:33pm

Just switched to rclone, its way faster indeed!

Celizior · August 28, 2024, 5:49pm

2 workarounds would be
using zfs and moving a volume, but you maybe involving putting the node offline. I don’t have enough experience with zfs
using virtualization, with an hypervisor that can perform hot migration on your data

Garfield · August 28, 2024, 8:32pm

I ended up taking my node offline, and using the built in functionality from Synology to change the location of the shared folder to anothe volume. Not sure if it was the quickest, but it got the job done. Using Rclone/rsync while the node was online seemed to be IO bound and rather slow.

Ruskiem · August 28, 2024, 9:05pm

+1
best way is to take offline and clone it.
from smaller disk to bigger, its perfect if You have hardware HDD cloner.
just 1-3h or so.

lyoth · August 28, 2024, 9:52pm

There are multiple ways of doing this without any downtime.
If you are on ZFS, just add the larger drive into the pool as a mirror. Let it sync in the background, once it finish syncing, remove the old drive, and then run the zfs command to expand the pool.
If you are on LVM, add new disk to the VG, and use pvmove old_disk, and it will move it to the disk. The run the lvm command to expand your LV.

Both of these method is running at a block level, so it will run near your drive max read and write.

beli · August 30, 2024, 1:47pm

Hi!
Tried over one month to sync from a 8TB NAS with 1GB Ram(<-- the Problem) to a more powerful machine.

Gave up, graceful exit is running, new machine is growing faster with new node as sync from the old nas…

jammerdan · September 1, 2024, 4:05am

What could be done as we basically have a journal from files that get written while the rsync is running to parse the information from the logs.

From every successful upload we have the piece id and the satellite.
This translates into a piece, folder and filepath.
So we could write a log of every new file and feed that into rsync.

Alexey · September 1, 2024, 6:22am

I would recommend to try to use rclone sync, you can also increase the number of parallel transfers if the disk can keep up.

storaje · September 9, 2024, 5:23am

I’ve seen improvement in rsync by using ionice -c2 -n3 rsync ... to give it a higher IO priority.