0

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Free Learning

Hadoop 3.2.0 released with support for node attributes in YARN, Hadoop submarine and more

Amrata Joshi

3 min read
24 Jan 2019

0 Likes
0 Comments
6716 Views

article-image

The team at Apache Hadoop released Apache Hadoop 3.2.0, an open source software platform for distributed storage and for processing of large data sets. This version is the first in the 3.2 release line and is not generally available or production ready, yet.

What’s new in Hadoop 3.2.0?

Node attributes support in YARN

This release features Node Attributes that help in tagging multiple labels on the nodes based on their attributes. It further helps in placing the containers based on the expression of these labels. It is not associated with any queue and hence there is no need to queue resource planning and authorization for attributes.

Hadoop submarine on YARN

This release comes with Hadoop Submarine that enables data engineers for developing, training and deploying deep learning models in TensorFlow on the same Hadoop YARN cluster where data resides. It also allows jobs for accessing data/models in HDFS (Hadoop Distributed File System) and other storages. It supports user-specified Docker images and customized DNS name for roles such as tensorboard.$user.$domain:6006.

Storage policy satisfier

Storage policy satisfier supports HDFS applications to move the blocks between storage types as they set the storage policies on files/directories. It is also a solution for decoupling storage capacity from compute capacity.

Enhanced S3A connector

This release comes with support for an enhanced S3A connector, including better resilience to throttled AWS S3 and DynamoDB IO.

ABFS filesystem connector

It supports the latest Azure Datalake Gen2 Storage.

Major improvements

jdk1.7 profile has been removed from hadoop-annotations module.

Redundant logging related to tags have been removed from configuration.

ADLS connector has been updated to use the current SDK version (2.2.7).

This release includes LocalizedResource size information in the NM download log for localization.

This version of Apache Hadoop comes with ability to configure auxiliary services from HDFS-based JAR files.

This release comes with the ability to specify user environment variables, individually.

The debug messages in MetricsConfig.java have been improved.

Capacity scheduler performance metrics have been added.

This release comes with added support for node labels in opportunistic scheduling.

Major bug fixes

The issue with logging for split-dns multihome has been resolved.

The snapshotted encryption zone information in this release is immutable.

A shutdown routine has been added in HadoopExecutor for ensuring clean shutdown.

Registry entries have been deleted from ZK on ServiceClient.

The javadoc of package-info.java has been improved.

NPE in AbstractSchedulerPlanFollower has been fixed.

To know more about this release, check out the release notes on Hadoop’s official website.

Why did Uber created Hudi, an open source incremental processing framework on Apache Hadoop?

Uber’s Marmaray, an Open Source Data Ingestion and Dispersal Framework for Apache Hadoop

Setting up Apache Druid in Hadoop for Data visualizations [Tutorial]

Like
Save for later
Comment

0 Likes
0 Comments
6716 Views

Recommendations for you

Fundamentals of Object-Oriented Programming - C++

Fundamentals of Object-Oriented Programming - C++

Feb 2023 7hrs 6mins

Video

€8.99 ~~€22.99~~

Hands-On Reinforcement Learning with Python

Hands-On Reinforcement Learning with Python

Jun 2018 318 pages

eBook

€8.99 ~~€23.99~~

Managing Kubernetes Resources Using Helm

Managing Kubernetes Resources Using Helm

Sep 2022 310 pages

eBook

€8.99 ~~€31.99~~

Implementing Event-Driven Microservices Architecture in .NET 7

Implementing Event-Driven Microservices Architecture in .NET 7

Mar 2023 326 pages

eBook

€8.99 ~~€26.99~~

OpenGL 4 Shading Language Cookbook

OpenGL 4 Shading Language Cookbook

Sep 2018 472 pages

eBook

€8.99 ~~€29.99~~

Raspberry Pi and MQTT Essentials

Raspberry Pi and MQTT Essentials

Sep 2022 272 pages

eBook

€8.99 ~~€22.99~~

The Software Developer's Guide to Linux

The Software Developer's Guide to Linux

Jan 2024 300 pages

eBook

€8.99 ~~€23.99~~

Building Python Microservices with FastAPI

Building Python Microservices with FastAPI

Aug 2022 420 pages

eBook

€8.99 ~~€28.99~~

Hands-On Design Patterns with C++

Hands-On Design Patterns with C++

Jul 2023 626 pages

eBook

€8.99 ~~€29.99~~

LLM Engineer's Handbook

LLM Engineer's Handbook

Oct 2024 522 pages

eBook

€8.99 ~~€43.99~~

article-image-revolutionising-work-and-everyday-life-with-chatgpt

M.T. White

16 Dec 2024

Revolutionising Work and Everyday Life with ChatGPT

M.T. White

16 Dec 2024

10 min read

article-image-building-trust-in-ai-the-role-of-rag-in-data-security-and-transparency

Keith Bourne

13 Dec 2024

Building Trust in AI: The Role of RAG in Data Security and Transparency

Keith Bourne

13 Dec 2024

15 min read

article-image-enhancing-data-quality-with-cleanlab

Prakhar Mishra

11 Dec 2024

Enhancing Data Quality with Cleanlab

Prakhar Mishra

11 Dec 2024

10 min read

article-image-revolutionize-power-bi-queries-with-openai

Gus Frazer

11 Dec 2024

Revolutionize Power BI Queries with OpenAI

Gus Frazer

11 Dec 2024

10 min read

Comments (0)

No comments for this article yet!