so there was no indication of a crash. But if your service is stopped, then it was.
When the node crash it’s usually post a reason to the log. There are should be Unrecoverable and/or FATAL errors.
Please search for these errors in your logs and post the recent one.
Hmmm… drive is acting reasonably well, I presume those were IOPs you posted. Regardless of node running or not, that’s ok, 2 or 3 yr old WD or Seagate 8-10TB, kinda yanky on the read I/O, so it’s not some new enterprise hdd, or whatever.
Check your Windows system logs, for ‘service stopped unexpectedly’ errors, it has done this N times., etc. With regard to your Powershell not running correctly, and lack of FATAL in your logs, seems your storagenode.exe is being killed off n’ that also points to overtaxed memory management.
You gotta suspect a system issue, if you can’t even run a simple Powershell command… Obviously. And you really should regurgitate your system specs, and provide full details in this thread. Anyone coming across this thread has nothing to reference.
If you would still receive a >> prompt (which usually indicates the missing quote), then I would suggest to try to update a PowerShell to the latest version.
Nothing relative search found for those evenids. My powershell is not running incorrectly. There are (perhaps were) FATAL errors in the logs. Only every see the memory useage at about 1/2 use. Is there a specific process to check the memory amangement for STORJ?
I don’t, as the increased issues are only arising after diredtions from the forum.
CPUS : Intel Xeon (E5540 @ 2.53 (x2)
MEMORY: 72 GB
OS: Windows Server 2019 (64-bit)
Nothing indicates disks are of those states. I am also not able to find specific performance requirements(HDD wise) but I have noted that these results(above) are more than sufficient.
Writability check will time out if your disk subsystem is overwhelmed with IO. HDD can only sustain around 200 iops.
You need to offload as much IO as possible. Review old threads on storage system optimizations. For example, you would need to disable sync writes, disable atime updates, any indexers and scanners, move databases to an SSD. That’s pretty much all you can do on windows, it’s quite a limited platform.
You need to increase the writable check timeout on 30s,
This one is weird:
I would suggest to check the system disk for errors and fix them. If that’s SSD, it should not have context canceled errors (in this case this is mean that storagenode was unable to open this database). Could you please exclude the folder with databases from the antivirus scans?
Please also check databases for errors:
No specific process, just in the details, the storagenode.exe may spawn it’s own child processes two or three times, the aggregate memory consumption will show in Taskman.
Other than the above noted replies…so far, I’d only additionally suggest you defrag that disk, run a disk check on your db drive. There is maybe one more parameter I can think of in the config.yaml which may help. Server versions of windows are easily blown away by high ingress - ie: if you’ve got a 1 Gb or + internet connection and being rifled test data. It will overwhealm your IOPs on a regular drive like that, plugging up ram and killing itself, as mentioned earlier. Look in the config.yaml for ____.write-buffer-size… note what it was, if not hash ignored, and set it to 128 KB, or 256 KB.
As I have said to you, with the exception of the SDD, these have been done. Also note that the issue only has come up at the time of the problem, not prior.
You need to stop creating new account. WTF? If you can’t post, maybe there is a reason for that? Just think about it.
I don’t understand you question.
This is also quite meaningless redundant statement.
Esther stop trolling and wasting everyone’s time, or if you actually need a solution — do a little bit of homework. Don’t expect to be spoonfed on every step of the way.
Things like this are clearly explained in the previous discussion that the quote is from. I think you are impatient and want things done for you, but running a node requires a level of technical understanding that may be outside of your current skill set. Either that or you need to slow down, read what is being clearly asked of you to do and work with the people helping you instead of against them. It would help if you communicated in more detail and didn’t respond like you are sending text messages.
We are happy to help you get your nodes working properly (or at least as well as everyone else’s currently are) but we need to work together and reduce the friction.