This Quick Start Walk-Through Guide is intended to walk you through a Data Lake reference architecture and AWS services used within. To demonstrate the flow in your AWS account, we are going to use a sample dataset from a fictional company, EcommCo. EcommCo sells products in multiple categories via its e-commerce website EcommCo.com. Its business users would like to consume key insights via business intelligence reports and dashboards to help answer key questions like, “What are our top selling products by state?” and “What is the customer lifetime value?”
EcommCo’s Data Lake in AWS provides data ingestion, real-time analytics over a continuous stream of data, batch analytics using all data available in a Data Lake, ad-hoc analytics for ease of exploring unknown data and visualization, so analytics can be easily understood by key stakeholders.
The diagram below provides a high-level overview of EcommCo’s Data Lake in AWS. All of the AWS resources illustrated below were deployed to your account when you launched the Quick Start. As we step through this guide, sample data from EcommCo will be ingested into your account. After the demonstration, you can delete the data and begin using the Quick Start architecture with your own data.