Trash does not go away in 7 days

Solu · May 2, 2024, 6:23pm

Update:

After watchtower update to 1.102.3 last night the trash has now been successfully deleted and the free disk space is now shown correct.

arrogantrabbit · May 2, 2024, 7:25pm

I’d say this still applies, albeit the definition of suitable hardware will shift. I.e. if you have bunch of raspberry pies — forget it. If you have an actual server — sure. And that “sure” may extend to adding more drives to existing setup.

FWIW I have 4 empty bays. If the 100Mbps traffic becomes reality I’ll add 60TB without batting an eye.

The concern, however is how many folks have asymmetric connection. I.e. docsis customers. Gigabit down - 30 megabit up.

All that data that ie being uploaded may one day needed to be downloaded. This shall be tested too.

digitalfrank · May 2, 2024, 8:32pm

you are right about this, but it is also true that to be of quality the Storj service must be entrusted to SNOs who have invested in Server Farm level equipment and have connections of over 2gbit upload and 10gbit download. You’re right that not everyone has a high-level connection but unfortunately, since I’m also a Storj customer, if I pay I expect the files to download quickly and the file search responses to be immediate or almost so. Otherwise, if Stoj were based only on home users, the quality of the service would not take off. Obviously it’s just my opinion, certainly not shared by everyone but unfortunately in this jungle of cloud services the winner is the one who offers not only the best price for data storage but also the quality of this service.
therefore anyone who is not keeping up with the speed and quality of the network which is developing and expanding more and more will be disqualified or will lose income because they will not be able to keep up with the other SNOs. Unfortunately it’s the hard truth but I suppose that’s how it works

Roxor · May 2, 2024, 9:32pm

I’d love an excuse to use 20TB+ HDDs! But I think the average node is growing around 400GB/month right now? So not going to fill existing disks for years…

arrogantrabbit · May 2, 2024, 11:39pm

Good question. Ik not sure how fast storage is used up. I’m getting 1-1.5TB of ingress. I’m not sure how fast does stuff get deleted.

jammerdan · May 3, 2024, 5:15am

And here is another one:

A node claiming to run on 1.102.3 but still has the old hierarchy under trash without date folders.

/storage/trash/.trash-uses-day-dirs-indicator
/storage/trash/ukfu6bhbboxilvt7jrwlqk7y2tapb5d2r2tsmj2sjxvw5qaaaaaa/2a

Trash folders from all other satellites are empty.
What I don’t know is, if there is an issue with it or if it is normal, for example if the migration mabye has not yet started.

The gc-filewalker does not seem to have a problem with the current folders:

2024-05-02T20:28:21Z    INFO    lazyfilewalker.gc-filewalker.subprocess gc-filewalker completed {"Process": "storagenode", "satelliteID": "12EayRS2V1kEsWESU9QMRseFhdxYxKicsiFmxrsLZHeLUtdps3S", "piecesSkippedCount": 0, "Process": "storagenode", "piecesCount": 4804204, "trashPiecesCount": 78266, "piecesTrashed": 78266}

So the questions would be:

Why are the trash folders from the other satellites empty (no subfolders in there)
Why are there no date folders in the US-1 satellite folders?

The filewalker logs show that it is ready to go by the date:

2024-05-02T12:01:46Z    INFO    lazyfilewalker.trash-cleanup-filewalker.subprocess      trash-filewalker started        {"Process": "storagenode", "satelliteID": "12EayRS2V1kEsWESU9QMRseFhdxYxKicsiFmxrsLZHeLUtdps3S", "Process": "storagenode", "dateBefore": "2024-04-25T12:01:46Z"}

Honestly: The number of different issues I am seeing just on few of my nodes ( I do not check them systematically for these kind of trash errors) is far beyond what I have expected and what I believe it should be. This is very concerning.

nerdatwork · May 3, 2024, 6:24am

I see the same issue.

It created a new date folder without moving the old files. This happened earlier but I thought the newer BF will move the older folders in to per day folder.

jammerdan · May 3, 2024, 6:29am

Well spotted.
I did not see the date folder in my satellite folder, but it is there.

And interesting enough I have all the subfolders in there:

/storage/trash/ukfu6bhbboxilvt7jrwlqk7y2tapb5d2r2tsmj2sjxvw5qaaaaaa/2024-05-02/2a

So I have:

/storage/trash/.trash-uses-day-dirs-indicator
/storage/trash/ukfu6bhbboxilvt7jrwlqk7y2tapb5d2r2tsmj2sjxvw5qaaaaaa/2a
/storage/trash/ukfu6bhbboxilvt7jrwlqk7y2tapb5d2r2tsmj2sjxvw5qaaaaaa/2024-05-02/2a

OMG. The next mess.

Edit: Both subfolders contain piece files /trash/ukfu6bhbboxilvt7jrwlqk7y2tapb5d2r2tsmj2sjxvw5qaaaaaa/2a as well as /trash/ukfu6bhbboxilvt7jrwlqk7y2tapb5d2r2tsmj2sjxvw5qaaaaaa/2024-05-02/2a

nerdatwork · May 3, 2024, 6:43am

Yes both folders will have files as per their respective bloom filters. Its the migration to the per-day folder that messed up somewhere.

Latest GC log entry. This hasn’t finished yet.

2024-05-02T22:03:13Z INFO lazyfilewalker.gc-filewalker.subprocess gc-filewalker started {“Process”: “storagenode”, “satelliteID”: “12EayRS2V1kEsWESU9QMRseFhdxYxKicsiFmxrsLZHeLUtdps3S”, “Process”: “storagenode”,
“createdBefore”: “2024-04-23T17:59:59Z”, “bloomFilterSize”: 5069692}

jammerdan · May 3, 2024, 6:48am

The migration was a one time thing. So maybe at one point it would be safe to not to solve the underlying problem, but to wipe out the satellite trash folder to get rid of everything that is wrong in there and resume the operation with date folders.

Alexey · May 3, 2024, 8:45am

No, but now you can bring more hardware to the network. It’s still not a requirement, it’s opportunity, see a difference?
We need more simple nodes across the world, not necessarily increase your hardware. However, it will be accepted as well (we do not track, is it economically viable for your or not, sorry about this).
We posted a message, then you need to decide what you would like do (or do nothing, and this is OK too!). For example - I’ll do nothing. I do not have a physical access to my server, so I would not be able to expand (and I, of course, regrets about this, but well, it’s a life…)

For the overcomplicated setups like using Virtualization or RAID or other layers - for sure. My setup do not suffer. What’s a difference? My nodes followed the recommendations:

1 disk per node, no RAID;
no virtualization (well, this is not a recommendation, but for me it was obvious to do not waste resources on something, what can work normally without any virtualization);

Yes, the OS is a worst choice (Windows), however, they are works. And only that is important.
I have 2 docker for Windows Desktop nodes and the GUI one, they are works perfectly and can handle the load. Yes, this is an old 2019 server with i7 and 32GB of RAM and simple WD (and one Toshiba ) disks, three in total - everything is working perfectly. The server is in EU (some kind…), and not used currently for anything else (I moved my work to the cloud several years ago…).

Alexey · May 3, 2024, 8:48am

Perhaps it’s a nice idea, would you mind to make a pool request?

Alexey · May 3, 2024, 8:51am

I would disagree, the fleet of Raspberry Pis is very powerful. I have had one, 1GB of RAM and it worked perfectly during the past stress tests (which was much more stressful by the way - my Internet channel of 100Mbps was saturated on 100% for several days!), and it’s survived.
Also deletion of this data after, so I would strongly disagree with all respect.

Alexey · May 3, 2024, 8:52am

We do not recommend to invest in any circumstances even now. So, it’s your risk and your opportunity.

Could you please proof your claim?

Alexey · May 3, 2024, 8:58am

This perhaps a bug and discussed separately

Mitsos · May 3, 2024, 9:58am

Either you misstyped the quoted part, or that reply wasn’t meant to be for me. My setups are baremetal, one node per disk.

Flexing aside, that is not what the recent announcement has suggested. The announcement has suggested that the new prospective client which can’t be given a free one month trial to upload/download his/her/its data and see if the network can keep up, has to wait for the OK from the storj team after testing on a dozen petabytes of data, ie the 50% (my claim, we are still waiting on a confirmation of the exact figure for the past month) of currently “unpaid” free-account data currently sitting on the network. If I’m testing, I’d personally test with 10% of the data. Maybe the client doesn’t have this much data, but the announcement disproves that. The announcement specifically states, and I quote, that “this will be the new normal”: Data will be continuously deleted from the production satellites and migrated to the testing satellite for the next year or so. It will stay there for a couple of weeks (to avoid paying out the whole month for it) then deleted as well. That means the new client will match the current usage at best, or based on a logical 10% testing will far exceed the current network size. Or the client is simply overblowing his numbers.

That settles the use what you have/go by storj requirements part. Either the network expands to match the usage, or the answer to clients is “sorry, can’t handle the load right now, you can check back later”. As we have already established in another thread: The berry plant eventually runs out of free berries for anyone to pick.

Back to the topic at hand: The golden rule in life that should be taught in every single school class, whether it’s history or physics, is:
once, it happens
twice, it’s a coincidence
thrice, it’s a pattern

Judging by the replies above, I’m not the only one that had the date-named folders mixed with prefix-named folders. My questions to the team are:

Why wasn’t this scenario expected, and why wasn’t there a fallback mechanism (ie continue with the migration) in case the migration got interrupted?
What are the next steps with regards to SNOs and this stuck data? Do you still want us to test something on this data?
If the data isn’t needed because the bug has been identified, fixed and tested on staging systems, should we go ahead and delete this data?

thepaul · May 3, 2024, 1:19pm

I’ll take those, I guess.

While we would like to think that we are smart enough to foresee all possible bugs, it turns out we are still human . As to why there wasn’t a fallback mechanism, this is because the migration step is supposed to be extremely simple and quick (it consists only of three renames/moves) and in the unlikely event it was interrupted it could be fixed manually.

But somehow the migration seems to have been kicked off multiple times all at once, and in some cases deletions were already being processed before the migration, leaving behind directories in the old location. This seems to be because of file walker subprocesses opening the blob store at the same time as the main process, which wasn’t caught because unfortunately our test systems don’t spawn the filewalker subprocesses in the same way.

We don’t have a fix yet. This situation is not considered ultra-high priority because the only data at risk of being lost is already in the trash. We definitely do see how it’s a problem for our SNOs, though, and we would like very much to fix it. The plan is to recognize the recursive directory problem and the unmigrated directories problem and correct them automatically on node start.

If the amount of data in the trash is causing you problems, you can delete it. Ideally you should keep pieces from the last 7 days. It may be helpful if you keep the data, just so we can test our fixes, but you’re under no obligation to do so.

Mitsos · May 3, 2024, 1:59pm

Saw a video a couple of days back comparing the lack of finger pinch sensors on cybertruck vs other car manufacturers. That verifies that cybertruck is indeed designed by humans, while kia/honda/nissan/etc are designed by AI, and tesla will never compete on the AI front.

At some point in life you realize that Murphy’s Law is one of the three fundamental laws of the universe: What can go wrong, will.

The second fundamental law of the universe is Sodd’s Law, which is based on Murphy’s Law: What can go wrong, will, at the worst possible time.

The third, I already mentioned in my previous reply.

There will be a point where the storagenode software decides to
rm -rf /$directory
I pray to $deity that that variable doesn’t end up being empty. Or at least that a human can use an AI to predict that the variable will be empty at some point and only rm -rf if the variable isn’t empty.

The amount of trash isn’t causing problems. It’s only at 19TB for now. What’s a brand new 20TB ultrastar in the grand scheme of things? I’ll keep the data for the next 3 months (since changing logos are a higher priority right now) until a proper fix is implemented.

arrogantrabbit · May 3, 2024, 2:31pm

Wasn’t the recent test specifically tailored to anticipated customer with specific range or segment sizes?

Single disk has a 200 iops limit, since your 100Mbps connection was saturated and it handled it well, each file sent must have been quite large 125kb or larger.

Median file size nodes hold today is 16k. If that wasn’t that specific custom tailored test, but just scaled up existing customer traffic, you node would choke at about 10Mbps ingress.

It’s simple math: median file size times iops — max bandwidth.

snorkel · May 3, 2024, 6:33pm

If we delete the old trash manualy, does it require to run the used-space-file-walker to update the databases?
Or trash folder space is updated by other walkers?
I keep USFW off, including lazzy mode, so I need to know if I must enable it, if I decide to clean the trash.