My nodes just faced a similar issue, but on Linux.
My ISP Box lost Internet connectivity for about 20 minutes and restarted for unknown reasons, leaving my nodes without Internet.
I thought they would reconnect automatically, but something abnormal happened apparently as most of my nodes shut down instead of resuming operation (I noticed this but couldn’t restart them before some hours as I was on the road).
Here is an excerpt of one of my nodes’ logs at the moment when it stopped (@ 2021-01-15T11:00:43Z), and when I restarted it:
2021-01-15T11:08:54.769Z INFO piecestore download started {"Piece ID": "6RSLYXDV4BOAQB2AG6JIMLJKGFEOC35MJESWI4F3FJQU6ODLISMQ", "Satellite ID": "1wFTAgs9DP5RSnCqKV1eLf6N9wtk4EAtmN5DpSxcs8EjT69tGE", "Action": "GET"}
2021-01-15T11:08:55.715Z INFO piecestore downloaded {"Piece ID": "ZVU2TPYPZF6323HQIP7S2THOLQ5WRKCNMF3TZLOTBUWTUJEJUW7A", "Satellite ID": "12rfG3sh9NCWiX3ivPjq2HtdLmbqCrvHVEzJubnzFzosMuawymB", "Action": "GET"}
2021-01-15T11:08:56.664Z INFO piecestore download started {"Piece ID": "2LTR33IJ5C7J5EOU5UJIYOJIYOII7MKCRMAU6TB2Y3CY3PYPJZDA", "Satellite ID": "1wFTAgs9DP5RSnCqKV1eLf6N9wtk4EAtmN5DpSxcs8EjT69tGE", "Action": "GET"}
2021-01-15T11:09:03.324Z INFO piecestore downloaded {"Piece ID": "6RSLYXDV4BOAQB2AG6JIMLJKGFEOC35MJESWI4F3FJQU6ODLISMQ", "Satellite ID": "1wFTAgs9DP5RSnCqKV1eLf6N9wtk4EAtmN5DpSxcs8EjT69tGE", "Action": "GET"}
2021-01-15T11:00:23.342Z INFO Configuration loaded {"Location": "/app/config/config.yaml"}
2021-01-15T11:00:23.367Z INFO Operator email {"Address": "[my@email.address]"}
2021-01-15T11:00:23.367Z INFO Operator wallet {"Address": "0x45BB1F8F88B372AD67B37B32639A86870Fa7c3eB"}
2021-01-15T11:00:23.585Z INFO Got a signal from the OS: "terminated"
2021-01-15T11:00:43.388Z ERROR Invalid configuration. {"error": "invalid contact.external-address: lookup \"[my.public.hostname]\" failed: lookup [my.public.hostname] on 192.168.1.1:53: read udp 172.17.0.8:43315->192.168.1.1:53: i/o timeout", "errorVerbose": "invalid contact.external-address: lookup \"[my.public.hostname]\" failed: lookup [my.public.hostname] on 192.168.1.1:53: read udp 172.17.0.8:43315->192.168.1.1:53: i/o timeout\n\tstorj.io/storj/storagenode.(*Config).Verify:154\n\tmain.cmdRun:155\n\tstorj.io/private/process.cleanup.func1.4:363\n\tstorj.io/private/process.cleanup.func1:381\n\tgithub.com/spf13/cobra.(*Command).execute:842\n\tgithub.com/spf13/cobra.(*Command).ExecuteC:950\n\tgithub.com/spf13/cobra.(*Command).Execute:887\n\tstorj.io/private/process.ExecWithCustomConfig:88\n\tstorj.io/private/process.ExecCustomDebug:70\n\tmain.main:385\n\truntime.main:204"}
2021-01-15T11:00:43.388Z FATAL Unrecoverable error {"error": "invalid contact.external-address: lookup \"[my.public.hostname]\" failed: lookup [my.public.hostname] on 192.168.1.1:53: read udp 172.17.0.8:43315->192.168.1.1:53: i/o timeout", "errorVerbose": "invalid contact.external-address: lookup \"[my.public.hostname]\" failed: lookup [my.public.hostname] on 192.168.1.1:53: read udp 172.17.0.8:43315->192.168.1.1:53: i/o timeout\n\tstorj.io/storj/storagenode.(*Config).Verify:154\n\tmain.cmdRun:155\n\tstorj.io/private/process.cleanup.func1.4:363\n\tstorj.io/private/process.cleanup.func1:381\n\tgithub.com/spf13/cobra.(*Command).execute:842\n\tgithub.com/spf13/cobra.(*Command).ExecuteC:950\n\tgithub.com/spf13/cobra.(*Command).Execute:887\n\tstorj.io/private/process.ExecWithCustomConfig:88\n\tstorj.io/private/process.ExecCustomDebug:70\n\tmain.main:385\n\truntime.main:204"}
2021-01-15T14:41:31.057Z INFO Configuration loaded {"Location": "/app/config/config.yaml"}
2021-01-15T14:41:31.061Z INFO Operator email {"Address": "[my@email.address]"}
2021-01-15T14:41:31.061Z INFO Operator wallet {"Address": "0x45BB1F8F88B372AD67B37B32639A86870Fa7c3eB"}
2021-01-15T14:41:37.287Z INFO Telemetry enabled {"instance ID": "12VHJRHeqGmnHsD3bcV2bkpfbv8jwER8NtX7h2wpgED8qDD6oHS"}
2021-01-15T14:41:38.150Z INFO db.migration Database Version {"version": 46}
(EDIT: Discourse seems to remove invalid UTF8 characters, but there is a series of ?
characters before 2021-01-15T11:00:23.342Z
)
I don’t get why invalid contact.external-address: lookup
could cause the Node to fail? Is it not supposed to wait until it is reachable again?
This said, it seems the host machine restarted or something before Internet got lost… it’s so weird, I don’t get what happened…