Badger cache filewalker test results

snorkel · July 30, 2024, 3:32pm

What’s the needed version for this bagder again?
What size would be the database for 21TB of data?

Vadim · July 30, 2024, 3:42pm

I think that no one know yet, I have only data for 0.9 TB the whole map size of cache map.

pdeline06 · July 30, 2024, 3:45pm

What if the node is restarted during the initial pass? What will happen to caching? Will the progress made be reset or saved? The next time the node starts, will caching continue from where it stopped or will it start from 0?

Ambifacient · July 30, 2024, 3:53pm

I think you’re conflating the used-space filewalker with the cache.

Basically if the node needs the metadata of a piece, it will check the cache. If it is there, use it, otherwise go check the filesystem/disk and then store it in the cache.

So if the used-space filewalker is 50% done and the node restarts, it should be able to get through the first 50% quickly by using the cache, and then read metadata from the disk for the remaining 50% to fill the cache.

Ambifacient · July 30, 2024, 3:56pm

I am seeing around ~3GB for 12TB of data, so I’d estimate around 6GB for 21TB, but could be more. 10GB is probably safe.

pdeline06 · July 30, 2024, 4:06pm

But this is just an assumption. Has anyone checked how this will work in practice?

Ambifacient · July 30, 2024, 4:11pm

Try running the badger cache without running used-space, you’ll see over time the cache grows without a file walker.

Solu · July 30, 2024, 4:11pm

I will move one of my 16TB nodes tomorrow and will give it a try.

pdeline06 · July 30, 2024, 4:16pm

What is the name of the file in which this cache is stored?

Vadim · July 30, 2024, 4:22pm

it looks like this, it not 1 file.

snorkel · July 30, 2024, 6:15pm

Wait what?? There is no recalulation, deletes, updates, shrinks on the badger db? It just grows and grows?

Mitsos · July 30, 2024, 6:17pm

Cache is populated without a filewalker running (ie normal usage), hence “pre-warmed”.

snorkel · July 30, 2024, 6:19pm

So it accounts also for deletes? Meaning it deletes the db recordings of a piece that was deleted?

Mitsos · July 30, 2024, 6:23pm

It should. I’m thinking of using badger if we get a custom directory to save it in (ie on SSD) + leaving the node running for a couple of weeks to get the cache filled up by normal usage + gc, then run the used-space filewalker. That’s the ideal usage for me.

snorkel · July 30, 2024, 6:26pm

The path should be configurable. It would be stupid if it’s hardcoded to data dir. The main reason we use custom paths for databases is to avoid lockups and db malformats. I get this would be no different.
But, thinking about it, I wonder what are the IOPS for the badger db? Could a USB 3.0 stick coup with them?

JWvdV · July 30, 2024, 6:40pm

Yeah, but databases were in the storage directory (inconvenient choice), so you couldn’t bind another path there. As far as I understood from other topics, there is a dedicated directory for this cache meaning that you can just add a specific bind/mount on that path.

See up here: Budger cache filewalker test results - #4 by DisaSoft

Roxor · July 30, 2024, 6:55pm

Don’t worry. We can slap on a BadgerDB index to cache the BadgerDB index files.

JWvdV · July 30, 2024, 7:14pm

Yeah, before we’re going to call it a butcher cache.

Vadim · August 1, 2024, 4:31pm

@elek information mostly for you.
Ok I have bene abled to kill budger cache. one abrupted restart, and all done.
Node not starting, in logs only 2 rows that wallet and email readed from conf, thats all.
Renamed cache map and node started, generated new map filestatcache

Ambifacient · August 1, 2024, 10:31pm

Here are some results from today’s SLC bloom filters. All of the following nodes have had the badger cache warmed up. Some of these nodes were also clearing trash during the GC process. All of these are on XFS, no special filesystem metadata sauce.

Node 1: Walked 14M pieces trashing 250k pieces in 2 hours.

Node 2: Walked 18M pieces trashing 250k pieces in 5.25 hours.

Node 3: Walked 15M pieces trashing 320k pieces in 6 hours.

Node 4: Walked 17M pieces trashing 280k pieces in 2.5 hours.

Node 5: Walked 12M pieces trashing 8k pieces in 5.5 hours. Low trashing as I have stopped ingress for this node.

Overall quite pleased, even though GC walker doesn’t query the metadata for every piece.