Amazon Web Services (AWS) are becoming an integral part of everyday life in many companies and governments around the world. While everyone has heard of AWS and knows a little about it, there is a lot more to AWS than most of us are aware of. So, we are going to look deeper and explore parts of AWS that you may not be aware of. By the time you finish this article, you will have a good idea of exactly what AWS is all about and how it can help your IT department enhance and enable your systems along with expanding the company’s operations.
AWS is so much more than cloud storage, or even a cloud compute engine. AWS is comprised of over 90 different types of services in 20 different categories. AWS is growing in size, breadth, and complexity every month. Let’s start with data organization. At a high level, AWS is organized into Regions, Availability Zones, Edge Locations, and Instances. Regions are the highest level and are physical geographic places in the world, whereas Availability Zones are virtual places located within the Regions. Edge Locations are closer to populated areas and basically serve as the front end to AWS, plus cache data to make access faster. An instance is the lowest level container for data.
"Amazon has eight primary Analytics offerings that can help you work with large datasets for analysis and processing"
Each AWS user can create one or many instances within an Availability Zone. Each Availability Zone can be located at one or more physical data centers to provide redundancy. As an example, in the US there are two Regions: US East and US West. The US East Region contains 9 Availability Zones, and US West contains 6. Also, there are 2 Government Regions in US West (for a total of 8 Regions). There are 20 Edge locations and 3 Regional Edge Caches.
Across the world, there are 16 different Regions and 44 Availability Zones. AWS has built Regions everywhere around the world except for two continents (Africa and Antarctica). Looking ahead to the 2018 time frame there are six more Regions and 17 more Availability Zones planned. AWS created Regions to reduce local latency as well as help companies and governments comply with local rules and data compliance requirements in each country.
Let’s explore some popular services on AWS so that you can speak the language of AWS. First, let’s look at their databases offerings. AWS offers Relational Database Services (RDS) and allows the choice of six different databases (Aurora, MySQL, MariaDB, Oracle, SQL Server, PostgreSQL). AWS also offers several NoSQL database options. Amazon SimpleDB is a straightforward yet powerful database for most quick and dirty needs like logging or monitoring. It is very flexible and forgiving of data structures. Amazon DynamoDB is a very robust NoSQL for most database services. Once you pick the database (SQL or NoSQL) that you want, RDS will automatically install the database, configure it, and perform routine maintenance like installing patches and making backups.
Amazon’s compute engines are well known. The most popular one is Amazon Elastic Compute Cloud (Amazon EC2). This service provides scalable virtual servers for your applications to handle spikes up or down based on your needs (Auto Scaling). There are different computer performance options available labeled C4, M4, T2, I2, and R3. For very extremely intense compute needs, F1 instances are available with Field Programmable Gate Arrays (FPGA’s). Amazon also offers Elastic Load Balancing to help keep each of your EC2 instances from becoming overloaded. For most of us, the easiest and lowest cost way to get started with cloud computing is to use the new Amazon Light sail offering which has most of the options for compute, storage, and networking already pre-configured. This ease of use allows almost anyone (even someone without cloud experience) to launch virtual private servers (VPS) with only a few clicks. It is effort less to do and low cost.
Probably one of the most popular services at Amazon is for storage. Their storage services are Amazon Simple Storage Service (S3), Amazon Glacier (for longer-term storage) and AWS Storage Gateway which allows you to connect your on-site storage systems to Amazon’s storage systems. As an example of how this works, you could connect Amazon Glacier to your server via the Storage Gateway instead of to a tape backup system. Amazon Elastic Block Storage (EBS) and Amazon Elastic File System (Amazon EFS) provide storage services and are both custom tailored for EC2 instances.
Big Data is something that Amazon does exceptionally well. Amazon has eight primary Analytics offerings that can help you work with large datasets for analysis and processing. These services are Athena, Elastic Map Reduce (EMR), Elasticsearch Service, Kinesis, QuickSight, Redshift, Glue, and Data Pipeline. Athena is a high-speed interactive parallel SQL query tool that can be used to get at vast amounts of S3 data in seconds. Amazon EMR is a managed Hadoop framework that is dynamically scalable. Elastic search is a service that you do not know you need until you realize that there must be a better way to search large files for text, monitor applications or analyzing logs. Kinesis is an exciting tool that allows you to collect, analyze and process large amounts of streaming data. With this tool, you can now analyze logs, click streams and telemetry data in near real-time. QuickSight is an easy to use and low-cost business analytics tool for most companies. Redshift is your standard data warehouse but on steroids. It can handle Petabytes of data without an issue. Glue is a full-service ETL offering that allows you to prepare and load data for analytics. Lastly, Data Pipeline lets you move data as if it was in a pipeline all around Amazon and to/from your premise location.
Amazon has many impressive services that are becoming even easier to use, more robust and connected with each other. If you have not started using AWS, or experimenting with it, there hasn’t been a better time to do it--than right now.