This book is a quick-start guide for learning Apache Hadoop version 3. It is targeted at readers with no prior knowledge of Apache Hadoop, and covers key big data concepts, such as data manipulation using MapReduce, flexible model utilization with YARN, and storing different datasets with Hadoop Distributed File System (HDFS). This book will teach you about different configurations of Hadoop version 3 clusters, from a lightweight developer edition to an enterprise-ready deployment. Throughout your journey, this guide will demonstrate how parallel programming paradigms such as MapReduce can be used to solve many complex data processing problems, using case studies and code to do so. Along with development, the book will also cover the important aspects of the big data software development life cycle, such as quality assurance and control, performance, administration, and monitoring. This book serves as a starting point for those who wish to master the Apache Hadoop ecosystem.
United States
United Kingdom
India
Germany
France
Canada
Russia
Spain
Brazil
Australia
Argentina
Austria
Belgium
Bulgaria
Chile
Colombia
Cyprus
Czechia
Denmark
Ecuador
Egypt
Estonia
Finland
Greece
Hungary
Indonesia
Ireland
Italy
Japan
Latvia
Lithuania
Luxembourg
Malaysia
Malta
Mexico
Netherlands
New Zealand
Norway
Philippines
Poland
Portugal
Romania
Singapore
Slovakia
Slovenia
South Africa
South Korea
Sweden
Switzerland
Taiwan
Thailand
Turkey
Ukraine