I had an abrubt power off of my storage system, and now when storj restarts I get a message in the log “INFO hashstore unclean shutdown detected: reconciling logs”
Every several minutes I will get a message like “WARN hashstore mismatched log tail”.
It seems to be taking a VERY long time. like… almost two days so far.
How long should I expect it to take? I think the node had around 6TB of data. And is there something else I should be doing to fix it?
Mmm. You mean node is not running on the same host as the storage? Why? And if that’s the case – why do you use memtable, that is guaranteed to go out of sync?
But yeah two other nodes were not using memtbl and they did not suffer the same fate after the unclean storage shutdown. At least on this anecdote of one occurrence.
But the rebuild, or whatever it is, is STILL running…
… it’s STILL running. IT’s been like 4 days. Every 7 or so minutes I get a “mismatched log tail” WARN message, but the node has not yet come up.
I feel like I’m definitely getting a suspension or disqualification.
Is there a way to know how long this… whatever this is… will take or make it run faster? I’ve seen mention of a dedicated write-hashtbl program, but a) I don’t understand it and b) dunno if it will fix the particular problem I’m having.
It will be slower than the filewalker on piecestore. Because it’s literary doing the same thing with extra steps and without the benefit of metadata space locality.
You cannot make it faster because it’s limited by seek latency.
You are experiencing one of the tradeoffs of hashstore. It’s a worthwhile tradeoff, because abrupt power loss or kernel panic on a stable system is not a thing in modern times.