My combined ingress from all nodes was over 1 TB yesterday while the router reported only 600 GB. And there is a lot of other traffic. So I guess the new fake ingress is 2x to 3x real ingress.
You still donât get a benefit of running multiple nodes on the same subnet but it will scale up and down the traffic per node not per subnet. So lets say we have 2 nodes sharing the same subnet and one has a better success rate. It will get selected more often by the power of 2 node selection but it would still only reach halve the request rate it could get without the second node in the subnet.
I donât know what you are talking about. You are still on the wrong party. My node has a 99% success rate. There is no fake traffic if you win all the races. Also how do you expect netdata to show any fake traffic? There is no code change we could do that would let netdata show wrong numbers.
What is the timeframe to recover?
I was talking to @ACarneiro. Could it be you are on the wrong party?
I am a mind reader you know. We run a test and all the feedback that we got within minutes must be from tools that track this in minute intervals. So I know exactly what party we all signed up for and I see no room for fake traffic on this party.
Iâm mostly seeing spikes in Ingress, though. Not much change in egress at all.
Are you just testing uploads your end or am I just not being selected?
The part you quote was about the stomping herd effect. The power of 2 node selection shouldnât overload the node in the first place so no need for a recovery. As long as you still have a reasonable success rate the recovery should take just minutes.
If you manage to get a 0% success rate you might end up with no uploads at all. That makes recovery more difficult. In that case this timer will forget the 0% success rate after some time and your node would get selected one time at random and if you make it you will get selected more frequent. If you donât make it you will stay at 0% until the timer is over again for a new chance.
Edit: I think there are 4 generations in total. So it should retry once every 30 minutes? I mean this only applies to the nodes with the lowest success rate. I have a hard time to imagine how such a node would look like.
Iâm mostly thinking of a case where due to an unrelated I/O the node started performing badly. So, let say, I run a scrub on a partition where node is hosted, making my node lose 90% races for that time. How long after the scrub finishes my node will start getting the regular traffic again?
Also, would it be possible to publish these statistics, so that
- node operators can verify how their nodes compare to othersâ?
- for purposes of comparing performance of network in various geographical regions (an idea I recall being suggested somewhere on the forum as well)?
BTW, happy to see @BrightSilenceâs suggestion implemented!
Edit: and one more thing: would this approach penalize more nodes in geographical regions with less customers?
Also very compute efficient. Nice solution. And since you only have to beat a single node, itâs not like nodes wonât be selected at all. Except for I guess the absolute worst one. But that one probably deserves it. And I see youâve already taken care of the fact that they will be tried again at some point.
Though I feel like it also leaves some performance on the table. Every time you compare 2 slow nodes, you end up selecting a slow node anyway. Though I donât know how to solve that efficiently and without causing the stomping herd effect. So Iâll shut up for now and give that some thought first. (Until I think of something more inspired than power of 3! node selection
)
Whatâs the logic of this? Does it look at a specific timeframe? Last x uploads? Or something similar to how scores are updated?
Sometimes I have decent ideas, I guess. I have suggested changing the RS settings many times as well, so Iâm a happy camper right now, seeing all these tests happening.
Node selection only deals with the whole node population on upload. Downloads are already limited to the nodes that have the pieces. Currently I believe 39 downloads are initiated for 29 successful pieces. (@littleskunk correct me if Iâm wrong or this has changed) Since the success threshold is now 65, they couldnât select double 39 to use power of 2 node selection. Given that and since it seems uploads are a priority atm, Iâm guessing itâs only uploads for now.
Ps. I noticed this line still lists 80 as default success threshold.
10 minutes per generation * 4 generations = 30 minutes history
Somewhere out there is a SNO with several external 2.5" SMR USB drives⌠plugged into their grandmaâs Gateway computer⌠passed through VMWare Workstation to a Window 2012 VM, and combined with Storage Spaces RAID5 before being presented to their node.
âŚand they just read about the new power-of-2 node selection⌠and are thinking:
âCrap.â
Power of 2 choices has an explicit name because itâs a spectacular jump from just picking one item at random. But going from 2 to 3, the difference turns out to be miniscule. This is one of those counterintuitive results from probabilistic algorithms field. See e.g. this PDF for some exposition.
It makes a lot of sense here, itâs great that Storj applied it with success.
Yeah, that was clearly a joke. You quoted it by skipping the first part of the sentence and omitting the emoji, both of which made it pretty clear that was not a serious suggestion.
Isnât it in fact âcrapâ for the lower 50% of nodes?
It seems that it will be more percentile-ish. The fastest node out of the bunch will be selected 100% of the time, a median node will be selected 50% of the time, etc.
Ok. But all this testing and the optimizations have been announced being necessary for customers in the sales pipeline. I believe you have announced that, I canât remember. So it would be great to learn if with all the tweaking we are gearing towards what those customers expect because at the end this is what everybody wants I guess, to have the network ready to take on the requirements of them. And if there are signs that the progress brings us closer to the deal and the potential customers are pleased with what we are able to offer, then it would be great to hear about it. Even more of course if a contract has been won.
This sounds very interesting. And if it helps to not to overload the nodes with upload requests they cannot fulfill then it is even better. Are we going to see longer tests with that like over the course of several days?
But does this kind of selection work even more in favor of nodes that are close to the uploader? It is just because for the use case where upload and download location are the same or close, this is perfect.
But Storj advertises kind of worldwide CDN-like file access speeds. And when the use-case is to move files fast, so that they get uploaded in place Toronto but downloaded in Singapore would that new node selection not hurt that experience?
My idea would be to give a customer an option to select which matches their use case best:
- Upload in A download in A
- Upload in A download in B
- Upload in A download everywhere
And while the node selection for upload could always be the ones that are fastest to the uploader which gives him a good uploading experience. In the background the repair workers could distribute the files over time according to the customers use case.
And of course also the S3 gateways could take that profile selection into account as well when uploading and select nodes according to the customers profile.
Maybe at some time in the future maybe even AI could be used to find the best node selection and distribution for each file of a customer based on the upload and download behavior. An AI powered node selection and file distribution based on individual customer behavior, that would be awesome.
Oh yes that is an interesting thought.
We have these silly limited lines here all over.
And even fibre is being sold not as symmetric but with limited upload bandwidth.
So a node with good download bandwidth can accumulate, with that change now maybe even more than ever. But on a crappy asymmetric line with limited upload bandwidth he canât deliver â Downloading customers not happy
49 posts were split to a new topic: âUnlimitedâ ISP plans