Paradigm shift: from a fixed amount of data to a percentage of free space

Hi Guys,

I am going to throw an idea out there: why doesn’t the storage node software allow a maximum percentage of free disk space instead of the current: fixed amount of data allocated to Storj?..

For example: if I set up my node by saying “my maximum Storj usage is allowed up to 10% of free disk space”. I would know that Storj will stop adding new data if the free space falls under that level~ and even better: Storj can also manage to remove some data to release space back to that level!

It will be a better metric to manage disk usage for everybody! Some people might want to keep 50% of free disk space at all times, some others 30%~ (or whatever).

Of course: you will need to add a new Egress category called “Redundancy traffic” (my suggestion) for which the node operator will NOT get paid for this swap of data to another node.

This can be coded without the need to audit each nodes’ percentage of free space: just run a cron every minute or 5 minutes on the storage node operator so that if % of free disk space < max( $free_disk_space_node_operator_setting , 10%) => then => shift data elsewhere. The decentralization is kept: the node will send its own data elsewhere, there is no need for a satellite to be involved.

Such feature will be so great: as it removes the monitoring of the storage node from the operator completely! (SET & FORGET!)

Additionally:

  • This setting cannot be under 10% (that will leave time to move data if the disk starts to get filled), so that nobody can set Storj to use 99% of 100% of the disk.
  • I do not know how the software copes with full disks?.. But with such feature: you include that 10% overhead rule into it and make it disappear! Because the overhead data usage will be caught into the % of free space, whatever the overhead is!
  • Maybe this also helps the graceful exit routine because it is quite similar in the sense that it is shifting data elsewhere?
  • Disks can be expanded (for example on a VPS/in the Cloud), so no need to modify the node’s parameter after such expansion.
  • The new “Redundancy traffic” category will allow the increase/decrease of the data’s durability~ and open the door (like Amazon AWS S3) to different levels of durability on the other side of the offers (storage classes for Tardigrade), and will have the effect of increasing the price/earning of the data storage rewarded to storage node operators~
1 Like

This should be under Ideas & Suggestions

Post moved to that category! Thanks!

1 Like

While I generally like the idea of a self maintaining network with as little satellite involvement as possible, I wonder how the satellite can keep track of the pieces of a file if it is not involved in such a data transfer operation?

Sorry for the confusion: to be more precise, I meant that the transfer of data does not transit through the satellite but goes directly to the other node (peer-to-peer). There is probably some kind of index update to be done back in a registry somewhere (on a satellite)?

I think you can enhance your suggestion by reading the whitepaper.

2 Likes

Overall a good idea but I think this will be too fiddly for Storj’s purposes and prone to frequent (otherwise) needless transactions for people that would use this.

The data never transit through a satellite. All data traffic between the customer’s application (uplink) and the storagenode(s)
The satellite is an address book, repair, audit and payment service.

OK so that part is already done! :sweat_smile:

That was a great read, thank you!

I still think that my idea makes it simpler for a SNO, above the free traffic suggested feature~.