Hi all. Today I checked my storagenode logs and noticed a very strange behavior: almost all download operations are failing (2 days ago everything was fine).
Audit requests is OK, upload (PUT) operations are fine too, but download requests are failing less than a second after start.
Here is some logs:
Example:
2019-08-26T23:28:42.518Z INFO piecestore download started {“Piece ID”: “YBWI5NEPK7TUYJIAKXESYFRXDDMJLVY3FHP6GPLOUTWJ3ABIGX3Q”, “SatelliteID”: “12L9ZFwhzVpuEKMUNUqkaTLGzwY9G24tbiigLiXpmZWKwmcNDDs”, “Action”: “GET_AUDIT”}
2019-08-26T23:28:42.870Z INFO piecestore downloaded {“Piece ID”: “YBWI5NEPK7TUYJIAKXESYFRXDDMJLVY3FHP6GPLOUTWJ3ABIGX3Q”, “SatelliteID”: “12L9ZFwhzVpuEKMUNUqkaTLGzwY9G24tbiigLiXpmZWKwmcNDDs”, “Action”: “GET_AUDIT”}
(Audit is OK)
2019-08-26T23:28:52.817Z INFO piecestore download started {“Piece ID”: “DG225BAD6DXM4OB2GRTXD6Z3HHRD42PIX7JCCHZVLZ5RWZJB3MNA”, “SatelliteID”: “12EayRS2V1kEsWESU9QMRseFhdxYxKicsiFmxrsLZHeLUtdps3S”, “Action”: “GET”}
2019-08-26T23:28:52.902Z INFO piecestore download failed {“Piece ID”: “DG225BAD6DXM4OB2GRTXD6Z3HHRD42PIX7JCCHZVLZ5RWZJB3MNA”, “SatelliteID”: “12EayRS2V1kEsWESU9QMRseFhdxYxKicsiFmxrsLZHeLUtdps3S”, “Action”: “GET”, “error”: … (error is “context canceled”, stack trace is omitted)
(download failed just after start)
As you can see, the difference between “download started” and “download failed” events is less than 100 milliseconds, and I have many examples like this in my log - is this OK?
UPD: (maybe this helps): failed GET requests happens for the Sattelites 12EayRS2V1kEsWESU9QMRseFhdxYxKicsiFmxrsLZHeLUtdps3S and 12L9ZFwhzVpuEKMUNUqkaTLGzwY9G24tbiigLiXpmZWKwmcNDDs.
GET Requests from satellite 118UWpMCHzs6CvSgWd9BfFVjw5K9pZbJjkfZJexMtSkmKxvvAW processed without errors.