Table of Contents

Design a syncing file server (Dropbox)

Back of the Envelope

(S3 standard costs 1c per GB/month). (S3 Glacier costs about 1/2c for millisecond access)

Data transfer in to S3 is free, but it costs about 10c/gb out (painful)

Also it costs $4 per 10M CRUD requests.

At 300B requests per month, the requests would cost about $120k/month.

Assuming we have to save all data, this is 1.5 EB of data (1500 petabytes).

It costs about $30M a month to store all this data on S3. (it would be about $6M if you used glacier instant access)

Technically S3 takes care of this, but if we were to roll our own, we’d have 3 copies of the data (ideally), so this would be 4.5 EB of data.

Questions

Architecture