Introduction to S3

Video Activity
Join over 3 million cybersecurity professionals advancing their career
Sign up with
Required fields are marked with an *
or

Already have an account? Sign In »

Time
19 hours 19 minutes
Difficulty
Intermediate
CEU/CPE
20
Video Transcription
00:00
>> Welcome back. In this lesson,
00:00
we're going to dive into our intro on S3.
00:00
The learning objectives are going to be to
00:00
describe S3 to you.
00:00
We want to make sure we understand what the service is
00:00
and then we're going to talk about
00:00
different characteristics of S3,
00:00
some things you need to know about when you're
00:00
working with this particular storage service.
00:00
Amazon S3, it stands for Simple Storage Service.
00:00
This is one of the first services
00:00
that Amazon ever created,
00:00
and it's something that's very handy.
00:00
You're going to see this commonly used
00:00
in most AWS Cloud environments.
00:00
It's a Blob storage solution, if you will,
00:00
or object-based storage and
00:00
it allows you to do a lot of stuff.
00:00
You could use this in application environments,
00:00
you can use this in IT
00:00
and infrastructure environments that are,
00:00
you're looking for some type of
00:00
scalable file storage solution this is
00:00
a great option and it is cheaper than EFS,
00:00
just something to keep in mind.
00:00
It allows users to store objects, different files,
00:00
you can store folders,
00:00
you can store files,
00:00
videos, music, MP3s, all different
00:00
types of objects that you would want to put up there.
00:00
It is not hierarchical,
00:00
it's a flat base file storage.
00:00
It uses buckets and these are like
00:00
directories but you're not going to
00:00
have that hierarchy which causes
00:00
overhead when you're processing
00:00
through file storage solutions.
00:00
This is a lot easier and without getting too technical,
00:00
it doesn't allow for
00:00
that overhead that would overload systems when trying
00:00
to search through file solutions or storage solutions.
00:00
Buckets are going to be globally
00:00
unique so whatever you title your bucket,
00:00
it has to be unique and you have to have
00:00
a naming convention that works
00:00
good for your organization.
00:00
Once the bucket is created and it has that unique name,
00:00
no one else in the world is going to be able to do the
00:00
same or use the same name as you. That's what that means.
00:00
Buckets are going to be defined at the regional level.
00:00
You got to select which region you want to put
00:00
your bucket in and that's where
00:00
your bucket is going to be based out of
00:00
but you can access
00:00
your bucket from anywhere in the world.
00:00
Naming convention. When you are naming your buckets,
00:00
this is important to know because you're going
00:00
to have questions on this in your exam,
00:00
you cannot have uppercase letters,
00:00
there is no underscore that's allowed,
00:00
your bucket name has to be between
00:00
3 and 63 characters long.
00:00
This is not an IP address
00:00
and it must start with a letter or a number.
00:00
Remember, this is globally unique so
00:00
come up with a naming convention that works
00:00
for your organization because
00:00
you're going to want this to be something you
00:00
can repeat over and over
00:00
for other future buckets that you create.
00:00
S3 objects are the things that go inside your bucket.
00:00
These are your files, your movies,
00:00
your log files, your documents,
00:00
your music, if you have that, whatever it is.
00:00
For example, Netflix they
00:00
leverage S3 buckets to store their video files.
00:00
Now they have a lot of S3 buckets.
00:00
They all store a lot of different videos and they're all
00:00
secured in great detail
00:00
and I'm not going to get into a lot
00:00
of the technicalities of that,
00:00
but they do leverage S3 for that storage solution.
00:00
There's a lot of other organizations that do that.
00:00
They have uploaded their movies
00:00
to Netflix and then the applications are
00:00
calling those buckets directly and
00:00
streaming the video file from that bucket out to you.
00:00
The way they have it architected is
00:00
very advanced and it's highly available
00:00
and highly redundant and it makes sure that the service
00:00
is always going to be ready for you whenever you're
00:00
jumping on the couch and turning on
00:00
Netflix, ready to watch a video.
00:00
The architecture is well-designed,
00:00
but at its simplest form,
00:00
it goes back to these basic principles using things like
00:00
Amazon S3 to host their video files.
00:00
This is an example of the path of the S3 bucket.
00:00
we have s3://my-bucket/my_file.txt.
00:00
Your bucket name, remember,
00:00
your bucket doesn't have the underscore,
00:00
but your files can have underscores.
00:00
I just want to make that important call-out in
00:00
case somebody asks any questions about it.
00:00
But yes,
00:00
you can upload your files and that's the file path.
00:00
This is what it would look like if
00:00
you were in a console and you were
00:00
searching your bucket through some type of
00:00
terminal or through an application SDK,
00:00
that's how you would be able to see that.
00:00
There are no directories.
00:00
It looks like that but it's not like I said,
00:00
this is flat storage so you don't have to
00:00
think of parent folders and child folders in that sense.
00:00
When you are uploading objects to your bucket,
00:00
your max object size is going to be five terabytes.
00:00
First of all, let me spin off here for a second.
00:00
Your max size, your max object size is five terabytes.
00:00
You can store more than five terabytes in a bucket.
00:00
But the max object,
00:00
one file, if you have
00:00
one file that's over five terabytes,
00:00
you've got a problem.
00:00
But [LAUGHTER] if you happen to have that 4.9 terabytes,
00:00
you can upload that to your S3 bucket.
00:00
If you do upload a file that's 4.9 terabytes big,
00:00
you're going to do something
00:00
that's called a multipart upload.
00:00
As a matter of fact, if you
00:00
upload anything that's larger than five gigabytes,
00:00
doesn't have to be in the terabyte range,
00:00
just five gigs they're going to do a multi-part upload,
00:00
which means that they're going to break apart
00:00
the object and they're going to send different pieces
00:00
through the Internet and then when it is
00:00
received on Amazon's data warehouse,
00:00
they're going to put the pieces back
00:00
together and then you'll have your file there.
00:00
That's what multipart means,
00:00
different parts being uploaded at different speeds and
00:00
eventually it all comes back together. Pretty cool stuff.
00:00
The way I like to envision
00:00
that if you're a Star Trek fan,
00:00
which I'm not a huge Star Trek fan,
00:00
but they have this thing,
00:00
this beam, beam me up Scotty,
00:00
like when they're on a planet and
00:00
Scotty is over there trying to bring
00:00
the people back to the spaceship,
00:00
I forgot the name of the spaceship
00:00
but you know what I'm saying.
00:00
They bring him back to the spaceship.
00:00
They use this beam and this beam breaks apart
00:00
the carbon molecules of
00:00
the person and then pieces it back
00:00
together inside of
00:00
the spaceship where they are retrieved.
00:00
For all of you Star Trek fans, please don't hate me,
00:00
I know that probably didn't do a lot of justice,
00:00
but you get the concept.
00:00
It's breaking apart and putting it back
00:00
together when you arrive at your destination.
00:00
Continuing on, Amazon S3 uses metadata.
00:00
Metadata is common,
00:00
we use it everywhere,
00:00
operating systems use it
00:00
too but this is going to allow us
00:00
to create some characteristics to organize our data by.
00:00
We can list key-value pairs,
00:00
very customizable you're going to get to do
00:00
that in this module,
00:00
where you'll see me do it,
00:00
and then you'll be able to do it yourself as well.
00:00
Metadata is the data about the data.
00:00
It allows us to organize and search
00:00
and secure and do all that good stuff.
00:00
Tagging. We can use
00:00
a Unicode key-value pair up to 10 for tagging.
00:00
This is also going to be useful for
00:00
security and development life cycles.
00:00
If you're an app dev,
00:00
if you're a developer then you may
00:00
want to use tagging, it's going to be beneficial.
00:00
Then versioning. Versioning is a unique thing.
00:00
It's not actually that unique,
00:00
it's not unique to AWS,
00:00
but it's something that is very beneficial here with S3.
00:00
When you upload a file, let's say it's a word document,
00:00
and you have your original version,
00:00
let's say this Version 0,
00:00
you upload it up there and then you're like,
00:00
"Oh dang, I need to go and
00:00
make a modification to that file."
00:00
You can go back to
00:00
your local desktop, make a modification,
00:00
maybe want to add a picture to the Word document
00:00
or you want to add a table or something like that.
00:00
You add a table and
00:00
you save it on your local workstation,
00:00
then you submit it back to your S3 bucket.
00:00
Now you have your original version,
00:00
which was Version 0, which did not have a table,
00:00
then you upload a new version.
00:00
The title is still the same
00:00
but when you upload the new version,
00:00
it's now going to override it with Version 1,
00:00
which is the newest version.
00:00
Version 1 is going to have everything that
00:00
Version 0 had,
00:00
but it's also going to have
00:00
the table included in it as well.
00:00
That's versioning is just
00:00
keeping up with updates to the documents.
00:00
S3 will allow you to retain older versions.
00:00
Let's say you add
00:00
a table and then your peer looks at the table and says,
00:00
"I don't like the table, let's remove it."
00:00
You guys can go back to Version 0
00:00
and scrap the Version 1 that you uploaded.
00:00
That's a really handy thing for collaboration
00:00
and for any accidents that happen.
00:00
That about summarizes this lecture on
00:00
the characteristics and just general introduction of S3.
00:00
To wrap it up, S3 as a storage service,
00:00
stores objects in the Cloud
00:00
and there's a lot of customizability,
00:00
there's a lot of things you can do with it,
00:00
It's very, very flexible.
00:00
It's always up. Amazon guarantees 9 times
00:00
9 is like 9.99999 percent.
00:00
I'm sorry, it's 99.99999.
00:00
[LAUGHTER] It's almost a 100 percent.
00:00
It's super close.
00:00
But I think it was like nine nines
00:00
of availability and redundancy and all that stuff.
00:00
So you know that your data
00:00
is staying available, it's staying persistent,
00:00
it's not going to be modified,
00:00
it's going to be redundant, which is a beautiful thing.
00:00
S3 is a excellent storage.
00:00
Definitely recommend that you learn a lot
00:00
about it because it's going to be
00:00
one of those topics
00:00
that is going to be embedded in a lot of questions,
00:00
not only on this exam, but
00:00
you're going to see it everywhere you go.
00:00
I've seen it in the federal government
00:00
here in the United States,
00:00
I've seen it for HIPAA compliant environments,
00:00
I've seen it in GDPR compliant environments,
00:00
I've seen in PCI DSS compliant environments.
00:00
Regardless of the rules and regulations,
00:00
pretty much everyone likes the idea of
00:00
S3 and they do recommend using it.
00:00
If you're going to be using it in a Cloud space,
00:00
you can be rest assured that it is
00:00
safe and that it is secure and it is compliant.
00:00
Anyways, that is it for this lecture,
00:00
go ahead and take a break.
00:00
I know that we went a little over here.
00:00
I'm going to grab a cup of coffee and then
00:00
jump into the next lecture. I'll see you there.
Up Next
S3 Consistency Model
33m
Securing S3 Bucket Access
49m
Module 6 Conclusion
5m