Additional analytics services
In this section, we will take a very quick look at some other AWS analytics services that you need to be aware of. Specifically, we will look at the Elastic Map Reduce (EMR) service, CloudSearch, and Data Pipeline:
- AWS EMR: This provides a managed Hadoop framework to enable you to process vast amounts of big data. You can use open source tools such as Apache Spark, Apache Hive, Apache HBase, Apache Flink, Apache Hudi, and Presto. Amazon EMR comes with an integrated development environment (IDE) called EMR Studio to help you develop, visualize, and debug data engineering and data science applications written in R, Python, Scala, and PySpark. You can run your EMR workloads on EC2 Instances, Amazon Elastic Kubernetes Service (EKS) clusters, and on-premises using the AWS Outpost service. In terms of pricing, you are charged at a per-instance rate for every second used, with a 1-minute minimum charge.
- AWS Data Pipeline: This is a web service that...