My fullest node is a nearly full 8TB ext4 formatted drive and its probably pretty fragmented.
I’m trying to let the initial used space filewalker finish and it’s taking… 5 days so far. This may be “normal time” from what I’ve seen others post, but while looking at logs, I noticed…
it seems that the lazy gc-filewalker (garbage collection) and used-space-filewalker both started at the same time when the node started, and that both are still running 5 days later.
Would having both running at the same time be thrashing the disk?
Would it perhaps be optimal to have one finish first (probably GC) and then start the second one?
It seems my disk (10TB data of 16TB for node 20TB disk) is permanently running filewalkers.
before the tests beginnig it had 5 tb and was mostly silent (writes once a minute via primocache)
512GB nvme readcache. filewalk max: some hours a day.
my collector-interval-duration is unchanged, but on this same node, there was a 3hr 20min gap between when the first gc-filewalker finished and the next one began. (it began the same time as my node received a “retain” request)
The GC filewalker should run one by one if the parameter
$ docker exec -it storagenode ./storagenode setup --help | grep concur
--retain.concurrency int how many concurrent retain requests can be processed at the same time. (default 5)