Following up this thread regarding a problem I had on my node which led to a disqualification, I would kindly ask the development team to add a feature which would put the node “on hold” in case of audit fail to avoid being disqualified. The SNO operator will then have a certain number of days to fix the problem before being disqualified.
Summarising the thread linked above:
- I got disqualified on 4 satellites in a matter of hours
- I had no clues at all on what was going on because the log was clean, the hardware was responsive, the storage was fine, the node software was up, the Internet connection was up and my monitoring software was able to connect to the node port
- Perhaps was a Linux kernel bug which prevented the communication in some way, but neither Storj neither I were able to figure out the root cause
- On satellite side they could only see that the node was unable to fulfill the requests
The problems started at 12:41 and I got disqualified on 16:39, after less than 4 hours.
The point is that the satellite knew that something was wrong but the information was not propagated to the SNO who would have fixed the problem if he only knew that the node was suffering.
So the request is to suspend immediately the node in cases like this instead of disqualifying it , to preserve Storj network integrity and at the same time allow the operator to sort out the issue.