Very high CPU usage during upload

570RJ · January 25, 2020, 9:40pm

I started s3 gateway for storj and started mirroring my minio server (50GB of files ranging in size from 100KB to 3MB). s3 gateway could not sustain my max upload speed of 500mbps (separate topic documented in other threads), but whenever it was pushing 300 to 500mbps, it was using 400% cpu (4 cores loaded fully). I am running on Intel i5-8400 which has aes-ni and avx extensions so I would not expect encryption and upload to take that much cpu. In comparison, when I am uploading backup to minio (using restic which will encrypt backup and break up into smaller blobs), I could max out my 500mbps connection and neither restic, nor minio server (using https) would use much of my cpu.

I was planning to use storj in cloud setup where I would be uploading lots of data directly from cloud servers to storj network, but so much CPU requirement from storj uplink would slow down my servers a lot (CPU is very expensive in cloud). Is there a way to minimize CPU usage?

deathlessdd · January 25, 2020, 9:53pm

Yes only upload 1 file at a time instead of many because they are being split and encrypted.

570RJ · January 25, 2020, 9:56pm

Please stop trolling. Telling me that I can minimize CPU usage by not using network is not really an answer, is it? Whole point is to use network and upload files to it. I know it won’t use CPU if I am not uploading file to it.

deathlessdd · January 25, 2020, 9:57pm

Excuse me? Who said not to upload a file I said 1 file at a time.

570RJ · January 25, 2020, 9:58pm

cost per file is still same so that does not change anything. Uploading 1 file at a time does not work on scale.

deathlessdd · January 25, 2020, 9:58pm

I can hit 900Mbit easy with 1 file I dont know what your talking about.

570RJ · January 25, 2020, 10:00pm

I’m talking about of files ranging in size between 100KB to 3MB. (it is spelled out in first post)

deathlessdd · January 25, 2020, 10:14pm

Im not disagreeing with you im just saying why your seeing high cpu usage.

super3 · January 28, 2020, 11:08pm

570RJ:

I started s3 gateway for storj and started mirroring my minio server (50GB of files ranging in size from 100KB to 3MB). s3 gateway could not sustain my max upload speed of 500mbps (separate topic documented in other threads), but whenever it was pushing 300 to 500mbps, it was using 400% cpu (4 cores loaded fully). I am running on Intel i5-8400 which has aes-ni and avx extensions so I would not expect encryption and upload to take that much cpu. In comparison, when I am uploading backup to minio (using restic which will encrypt backup and break up into smaller blobs), I could max out my 500mbps connection and neither restic, nor minio server (using https) would use much of my cpu.

I was planning to use storj in cloud setup where I would be uploading lots of data directly from cloud servers to storj network, but so much CPU requirement from storj uplink would slow down my servers a lot (CPU is very expensive in cloud). Is there a way to minimize CPU usage?

How much data total are you trying to mirror?

570RJ · January 28, 2020, 11:47pm

It was close to 60GB total. It seemed that faster the upload speed, more CPU was consumed

super3 · January 28, 2020, 11:58pm

Can you give us more info about how to mirror the data? Also would be nice to know your costs in terms of cloud CPU. From my perspective its super cheap nowadays, but maybe you are using a special setup. We have uploaded petabytes of data via cloud servers for load testing.

The erasure encoding is taking up most of the CPU. Hypothetically you could change some config in your uplink to not upload so many files at the same time, which should reduce CPU load to the levels you want.

BrightSilence · January 29, 2020, 12:33am

Have you considered adding acceleration through opencl or gpu acceleration?

570RJ · January 29, 2020, 12:50am

I am using Restic to create hourly backups of my server. Currently all the files are stored in minio. I ran storj s3 gateway and used mc mirror to copy to storj. While mirroring the data, I observed cpu and network load. As for pricing, you can use digitaloceans prices 4vcpu is around $40. However my cpu is dedicated 6core i5-8400 (4ghz max turbo boost) which is probably double the compute power of digital oceans shared vcpu’s.

super3 · January 29, 2020, 1:30am

@570RJ Are these complete backups or incremental backups?

570RJ · January 29, 2020, 1:47am

Yes restic makes incremental backups with dedup. Currently I have 12 hourly, 7 daily, 4 weekly, 12 monthly backups. Restic manages all of itself

570RJ · January 29, 2020, 1:55am

Server is running mail, nextcloud, grafan+prometheus and other services. Right now I only had a huge load because of initial sync. However I need to run a monthly cleanup job so stale blobs are removed from the repo and it requires some data download/upload and repository Management from restic. (Index rebuilding). That probably will take 30 minutes to couple hours and I expect having a CPU load from network IO from storj s3 gateway. I will try to set lowest priority on S3 gateway so it does not affect other services, and the process will take longer than it would otherwise, so it’s not really a problem. I just wanted to bring it up for the team since, in cloud my speed will be limited by my CPU and not network which is unexpected from a data service perspective (usually network and hdd are bottlenecks)

570RJ · January 29, 2020, 2:05am

In other words. If you were to run a benchmark on S3 gateway where HDD and network is stubbed (instant read of data and instant upload), what would be maximum theoretical throughput of S3 gateway? According to my results it is around 10MB/s per dedicated cpu core.

kevink · January 29, 2020, 6:58am

Which server has opencl or gpu acceleration? This would only be helpful on desktops.

twl · January 30, 2020, 1:33pm

it was using 400% cpu (4 cores loaded fully)

If an application maxes out 4 of the 6 cores of your CPU, that translates to roughly 66% CPU usage, not 400%

570RJ · January 30, 2020, 5:05pm

@twl top by default reports cpu time used relative to 1 cpu core. You can read more how it works here: https://unix.stackexchange.com/questions/145247/understanding-cpu-while-running-top-command