Node is hurting other nodes?

Hello.

I recently worked on one of my bigger servers.

A disk went bad - I am replacing it. This made the audit score fall for the node - fine I guess.

But, on another node that did not have these issues, disk is fine- I am seeing this:


Now it’s not only these two nodes affected, many nodes have fallen in audit score.

Node A = bad disk, and bad audit.
Node B = suspended but fine disk.

Node C,D,E,F = fallen audit score.

Node G,H,I = is fine.

Maybe someone here knows better than me? :slight_smile:

sorry I can’t see the screenshot for some reason.

Are these all on the same system? maybe something more general happens (system crash, motherboard or controller problems) that caused corruption across the board?

2 Likes

It is possible that bad HDD throttled sata controller, so it affected other response time.

3 Likes

So HDD made windows go into a black screen for like 6 hours. Not sure if this messed it all up.


I hope you can see these image?

The nodes a super new. Is it best to just do the following:
Make new nodes for the ones suspended.
Keep the ones not suspended and hope audit goes up when problem is fixed?

Yes it’s possible.

Faulty disk had literally 120000 ms delay.

no no imeges, give it time to upload

Sorry- here is the images:

No idea why images wont upload - i have tried 2 different machines now.

I dont know, may be format of pictures not supported, but it not loading

Sorry - i added an imgur link.

20 chars chars chars

Can a node like this recover from suspension?

Or is it lost?
Thanks in advance.

Suspension is not DQ, it just not get ingress till it will recover to some point

1 Like

Okay thank you Vadim - i will keep an eye on this.

reinstatement after suspension takes, what, 30 days? So if the node really was extremely new then starting a new one wouldn’t be too bad.

1 Like

Minutes or hours up to weeks - it works a little bit different than online checks.

1 Like