Open this lesson in your favourite AI. It'll walk you through the why, explain the demo, and quiz you on the try-it list.
Object storage is the workhorse of the cloud — it is where logs, backups, ML datasets, static assets, and user uploads land. The concept is identical across providers (buckets of immutable blobs addressed by key), but the defaults around encryption, versioning, public access, and consistency differ in ways that bite you on day two. Learning the four side by side is a once-and-done investment.
Create a bucket, upload a file, and list it on each cloud with the same handful of CLI commands.
Use these three in order. Each builds on the one before.
In one paragraph, explain object storage like I'm new to it, with a concrete S3 example of buckets, keys, and objects.
Walk me through how an S3 multipart upload actually works step by step — initiate, upload parts, complete, and how the ETag is computed differently than a single-PUT.
Given a 2 PB ML training dataset that is read once a week and hot for 48 hours during each run, design the storage class + lifecycle policy on S3 vs GCS and justify the cost difference.
BUCKET=capstok-obj-demo-$RANDOM
aws s3api create-bucket --bucket $BUCKET --region us-east-1
echo "hello from S3" > hello.txt
aws s3 cp hello.txt s3://$BUCKET/hello.txt
aws s3 ls s3://$BUCKET/
aws s3api head-object --bucket $BUCKET --key hello.txt \
--query "{size:ContentLength,etag:ETag,sse:ServerSideEncryption}"
# Cleanup
aws s3 rb s3://$BUCKET --force