Search icon CANCEL
Subscription
0
Cart icon
Your Cart (0 item)
Close icon
You have no products in your basket yet
Arrow left icon
Explore Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Conferences
Free Learning
Arrow right icon
Arrow up icon
GO TO TOP
Learning AWS

You're reading from   Learning AWS Design, build, and deploy responsive applications using AWS Cloud components

Arrow left icon
Product type Paperback
Published in Feb 2018
Publisher Packt
ISBN-13 9781787281066
Length 412 pages
Edition 2nd Edition
Tools
Arrow right icon
Authors (2):
Arrow left icon
Amit Shah Amit Shah
Author Profile Icon Amit Shah
Amit Shah
Aurobindo Sarkar Aurobindo Sarkar
Author Profile Icon Aurobindo Sarkar
Aurobindo Sarkar
Arrow right icon
View More author details
Toc

Table of Contents (12) Chapters Close

Preface 1. Cloud 101 – Understanding the Basics FREE CHAPTER 2. Designing Cloud Applications 3. Introducing AWS Components 4. Designing for and Implementing Scalability 5. Designing for and Implementing High Availability 6. Designing for and Implementing Security 7. Deploying to Production and Going Live 8. Designing a Big Data Application 9. Implementing a Big Data Application 10. Deploying a Big Data System 11. Other Books You May Enjoy

Using AWS Glue and Amazon Athena

In this section, we will use AWS Glue to create a crawler, an ETL job, and a job that runs KMeans clustering algorithm on the input data.

We use a publicly available dataset about the students' knowledge status on a subject. The dataset and the field descriptions are available for download from the UCI site: https://archive.ics.uci.edu/ml/datasets/User+Knowledge+Modeling

  1. Log in to the AWS Management Console and go to the Glue console. Click on the Add crawler button.
  2. Specify the Crawler name as User Modeling Data Crawler as shown here. Click on the Next button:
  1. In the Add a data store screen, select S3 as the Data store, and select the Specified path in my account option. Specify the path for the S3 bucket containing the input data. Click on the Next button:
  1. Select No on the Add another data store and click on the Next button.
  2. On the...
lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $19.99/month. Cancel anytime
Banner background image