Average Disk Space Used This Month went to Zero

DanielTB80 · February 9, 2024, 4:52am

Greetings,

I recently enabled the lazyfilewalker on 2024-02-03 due to the node being full but the average disk space was 1TB below the full disk space.

I discovered today that the US satellite average disk space went to zero yesterday.

The filewalker was completing without any issues according to the logs. Can someone provide some guidance on how to troubleshoot?

I can see from the logs that the US node data is still being used and downloaded, so it appears to be an issue with reporting.

Using Win GUI interface for reference.

Thanks!

digitalfrank · February 9, 2024, 5:16am

Same from me on all 30 nodes currently online

JWvdV · February 9, 2024, 5:20am

I can confirm this behaviour.
Seems to be on just some of my nodes, although almost halving the total node sizes.

All satellites:

US satellite:

Multinode:

Just wait a day and see whether or corrects itself?

digitalfrank · February 9, 2024, 5:21am

The system works correctly but for 2 days only US1 has not updated the data on the nodes. The others perfect and without problems.

exkavator · February 9, 2024, 1:56pm

I confirm. I have a similar problem with the satellite on all nodes in all locations.

Roxor · February 9, 2024, 4:41pm

US1 started acting strange for me a couple days ago. It claims a couple nodes are offline (while AP1/EU1/Saltlake say those nodes are fine). Those nodes show they’re online in a couple tools (including that QUIC is working) AND those nodes are still uploading/downloading data managed by the US1 satellite!

Like… how can I still be serving data for US1-related customers, and making the other three satellites happy…but the US1 satellite thinks the nodes are offline? I think they’re making other upgrades now (bloom filter code?) so I’ll wait a few days to see if it clears up…

00riddler · February 9, 2024, 5:21pm

To be honest it is not like it was correct before.
For me it has been for more than a month now and it is getting worse every day.
If nothing changes until end of this month all of my nodes will go offline.

As an example my smallest node that hat constantly over 3.2 TB used space. But the average is reported and paid as 2.5 TB.
So if you can not calculate correctly or do not want to pay us according to your usage it will be a fast descision. At in the end it is still a business for all of us. But one party is trying mess around with the other…

Igor · February 9, 2024, 8:14pm

erlich-silicon-valley-keep-it-positive

DanielTB80 · February 9, 2024, 8:55pm

US Satellite has updated and my overage usage is in line with historical calculations.

daki82 · February 10, 2024, 8:05am

Can confirm. All smoth now.

daki82 · February 10, 2024, 8:08am

I recommend to chill, fellow sno. The fix is on the way, just keep patient, it will take some time to take effect.

Alexey · February 10, 2024, 11:57am

12 posts were merged into an existing topic: Disk usage discrepancy?

Alexey · February 10, 2024, 11:59am

Hello @00riddler,
Welcome to the forum!

Please check for errors related to "walk" in your logs.

00riddler · February 10, 2024, 2:56pm

Hi and thanks for the welcome.
I have looked up the logs and all filewalker processes returned finished successfully with 0 pieces skipped.
So looking fine there.

Walter1 · February 10, 2024, 6:57pm

Do you a good way how to check if the filewalker-run has been completed? Watching in the logs or in htop is not a good way.

JWvdV · February 10, 2024, 9:26pm

@Alexey
Why has the topic on strange behaviors of the US1 satellite been merged in this one? This way everything just should be merged in a big STORJ-topic.

In my opinion a non-reporting satellite (up-front known cause), is quite different from having more disk usage than reported from a given satellite (up-front unknown cause).
Besides, the line of the topic feels a bit strange now, like people are discussing different things along each other. Just randomly ignoring intercurrent posts on a different subject.

Nevertheless, any updates on why the US satellite acted weird?

Alexey · February 11, 2024, 2:31am

I do not see any weird behavior of that satellite on my nodes, what should I looking for (except a disk usage discrepancy, which is affecting all satellites)?

JWvdV · February 11, 2024, 3:58am

I think you’re not taking us serious on this one. Because literally seven of my nodes had the same problem. I was actually looking to report it, when I saw the problem was already been acknowledged by others.

It wasn’t a disk usage discrepancy, but a late reporting issue. The same charts of the same node I posted above look now this way.

All satellites (of course with the strange ‘today overreporting’-glitch never solved):

US1 satellite:

As I already suggested in my first post on the issue: it corrected itself within 24 hours, but never happened to me before and apparently also not to many other members.

Multinode:

Usually a discrepancy is because of old remnants on the nodes from decommissioned satellites or unfinished filewalkers, a situation needing some research. In this case, up front it was very clear it was a glitch of the us1 satellite. And I’m curious to know whether there’s a reason why only this satellite apparently gave some people reason to worry.

Again: it was not a usual disk usage discrepancy, it was a (me at least) never before occurring reporting issue only attributable to the us1 satellite not reporting the used space.

Alexey · February 11, 2024, 4:42am

I see. This is usual thing when the tally didn’t finish in time (took longer than usual), it’s not a glitch, but likely more segments to scan. We do some autoscaling if that happen to speedup the tally.
Glitch is when it didn’t report for several days, but I didn’t see such a thing on my graphs.