Seeing customer deleting data over a day and still not stop

bebeo92 · April 24, 2024, 12:10am

Like topic title I now seeing my node continue delete/moving data to trash more than a day and still continue doing it, not sure this is a normal behavior

Node is in version 1.101.3

arrogantrabbit · April 24, 2024, 12:42am

Yes, deleting data is normal.

bebeo92 · April 24, 2024, 12:54am

yeah but constantly delete over 2 days now …
I don’t remember seeing this happen before

Ambifacient · April 24, 2024, 1:22am

To remove data from your disk, the node needs to check every piece to determine if it should be deleted or kept. If you have a slow hard drive, lots of capacity, using lazyfilewalker, not enough RAM to cache metadata, etc. it can be a long time to do this.

Alexey · April 24, 2024, 5:06am

Deletion is now performed by an expiration chore (collector) and trash chore, not by customers directly. The deleted data is gathered by a garbage collector (it’s invoked by a Bloom Filter sent from the each satellite) and sent to the trash first, then it will be permanently removed after 7 days. So, yes, removing can take a lot of time on slow disks.

bebeo92 · April 24, 2024, 6:06am

Thanks for your information

bebeo92 · May 4, 2024, 7:14am

Hi Alexey, I see a problem with this approach now
So after 7 days, expired data are now being remove from the node but also seeing another round of data moving to trash.
Combine this 2 things at the same time cause the disk in heavy stress

nyancodex · May 4, 2024, 9:33am

Data deleting is completely normal? you don’t delete your data? However in this case this may come from the saltlake sat, which is test data.

bebeo92 · May 4, 2024, 9:46am

I understand data delete is normal, but the way storj is implement I don’t think good for disks
When they will have to handle both actual data deletion from disk and continue moving data to trash
Never seeing this behavior before

nyancodex · May 4, 2024, 10:31am

It’s like a backup-plan for them. Sometimes they implement things in the wrong way which makes good data to be deleted, so a “recycle bin” is necessary to recover the data in case of wrong deletion.

bebeo92 · May 4, 2024, 10:47am

Yeah but actual delete data from disk and moving data to trash should not happen at same time ?

Alexey · May 4, 2024, 11:29am

They are async processes not dependent on each other. The trash removal is happening every 24h since the node start, the garbage collection - since the last Bloom Filter received from the satellites.
So you may place them in a different time interval by the simple restart.

bebeo92 · May 4, 2024, 11:44am

Is there a log/setting that indicate trash data remove from disk?
I just know this happen when I seeing my free disk storage increase

Alexey · May 4, 2024, 11:56am

Yes, all log entries for each filewalker/chore you can see there:

However, some entries could be outdated (I didn’t update the article after the latest change yet), I would appreciate if your could fix them, I made it a wiki.

bebeo92 · May 4, 2024, 3:17pm

Not seeing any log relate to “used-space-filewalker” but see “trash-cleanup-filewalker” so update your note with that

nyancodex · May 4, 2024, 10:56pm

check your log level in config?

Alexey · May 5, 2024, 2:55am

used-space-filewalker is running only on start, if you didn’t disable it.

bebeo92 · May 5, 2024, 3:48am

make sense, I’m not disable it
also, Is there any public link to sample config.yaml that contain all the setting with explain what setting will do?

Alexey · May 5, 2024, 4:06am

No and unlikely will be, it’s constantly changes, we cannot keep it updated so it will be always outdated.
But you may run this command to get all parameters and their defaults:

docker

docker exec -it storagenode ./storagenode setup --help

Please note, in the docker setup you will pass most of needed parameters directly in your docker run command, not in the config.yaml file, because command line arguments have a precedence above parameters in the config.yaml file.

PowerShell and Windows GUI:

& "$env:ProgramFiles\Storj\Storage Node\storagenode.exe" setup --help

You may also take a look on (but it’s likely outdated already):

snorkel · May 5, 2024, 5:54am

Yep, it’s a wiki, but I can’t update it on every version, only when I start a new node, which is rare, or start a new node just to get the config, and remove it, which I don’t do, or when others contribute.