Alexey
February 26, 2022, 4:53am
6
I mean not devices are not desirable, I mean the network protocols for attaching storage are not desirable, including iSCSI in your case, because your setup is clearly not reliable.
Corrected my response to be more clear.
There is no specific timeout, it depends on underlaying OS.
This is exactly an issue, mentioned here:
opened 05:10AM - 26 Feb 22 UTC
closed 02:22PM - 14 Mar 23 UTC
Bug
<!--
Please make sure that we do not have any duplicates already open.
You ca… n ensure this by searching the issue list for this repository.
If there is a duplicate, please close your issue and add a comment to the
existing issue instead.
For more information about reporting issues, see
https://github.com/storj/storj/blob/main/docs/storagenode/CONTRIBUTING.md
---------------------------------------------------
GENERAL SUPPORT INFORMATION
---------------------------------------------------
The GitHub issue tracker is for bug reports and feature requests.
General support can be found at the following locations:
- Storj Community Forum - https://forum.storj.io
- File a ticket at https://support.storj.io/
---------------------------------------------------
BUG REPORT INFORMATION
---------------------------------------------------
-->
**Description**
If the HDD has issues or underlaying OS, the dir verification can hang forever waiting for write or read to finish, as result node will be disqualified very fast.
See https://forum.storj.io/t/tuning-audit-scoring/14084/32
<!--
Provide a more detailed introduction to the issue itself, and why you consider it to be a bug
-->
**Steps to reproduce the issue:**
1. Take or emulate freezing HDD (it is present in the system, but any request will hang forever)
2. Run storagenode
3. Check the state - it will freeze on write or read dir verification forever, resulting audit timeout on any read or write to blobs.
**Describe the results you expected:**
The verification dir methods should have a timeout and crash the node if the check is not succeed to prevent audit failures.
**Describe the results you received:**
The verification dir methods hangs forever and the node fail audits because of timeout and will be quickly disqualified.
**Additional information you deem important (e.g. issue happens only occasionally):**
Tests: https://github.com/storj/storj/pull/4183