Querying data within an S3 bucket using AWS Glue and Amazon Athena
In this hands-on lab, we are going to use some public sample flight data that is stored within a public S3 bucket to create an AWS Glue table. Then, we are going to run queries against that AWS Glue table to find out some flight information. Let's get started:
- Log in to the AWS Console and navigate to AWS Glue.
- Click on Crawlers from the main left-hand menu and then click Add crawler.
- Enter
DBCertCrawler
for Crawler name and click Next. - Leave all the defaults on the Specify crawler source
type
page as-is:
- On the next page, leave the data source as S3 and click Add connection. Complete the popup by using the following details:
- Name:
DBCertFlight
- Include path:
s3://athena-examples/flight/
- Name:
The following screenshot shows how the form should be completed: