You're reading from Python for Geeks Build production-ready applications using advanced Python concepts and industry best practices

Product type Paperback

Published in Oct 2021

Publisher Packt

ISBN-13 9781801070119

Length 546 pages

Edition 1st Edition

Languages

Python

Tools

Django

Concepts

Programming Language

Author (1):

Muhammad Asif

View More author details

Table of Contents (20) Chapters

Preface

1. Section 1: Python, beyond the Basics

2. Chapter 1: Optimal Python Development Life Cycle FREE CHAPTER

3. Chapter 2: Using Modularization to Handle Complex Projects

4. Chapter 3: Advanced Object-Oriented Python Programming

5. Section 2: Advanced Programming Concepts

6. Chapter 4: Python Libraries for Advanced Programming

7. Chapter 5: Testing and Automation with Python

8. Chapter 6: Advanced Tips and Tricks in Python

9. Section 3: Scaling beyond a Single Thread

10. Chapter 7: Multiprocessing, Multithreading, and Asynchronous Programming

11. Chapter 8: Scaling out Python Using Clusters

12. Chapter 9: Python Programming for the Cloud

13. Section 4: Using Python for Web, Cloud, and Network Use Cases

14. Chapter 10: Using Python for Web Development and REST API

15. Chapter 11: Using Python for Microservices Development

16. Chapter 12: Building Serverless Functions using Python

17. Chapter 13: Python and Machine Learning

18. Chapter 14: Using Python for Network Automation

19. Other Books You May Enjoy

Summary

In this chapter, we explored how to execute data-intensive jobs on a cluster of machines to achieve parallel processing. Parallel processing is important for large-scale data, which is also known as big data. We started by evaluating the different cluster options available for data processing. We provided a comparative analysis of Hadoop MapReduce and Apache Spark, which are the two main competing platforms for clusters. The analysis showed that Apache Spark has more flexibility in terms of supported languages and cluster management systems, and it outperforms Hadoop MapReduce for real-time data processing because of its in-memory data processing model.

Once we had established that Apache Spark is the most appropriate choice for a variety of data processing applications, we started looking into its fundamental data structure, which is the RDD. We discussed how to create RDDs from different sources of data and introduced two types of operations, transformations and actions...

The rest of the chapter is locked

Tech Concepts

Programming languages

Tech Tools

Unlimited access to the largest independent learning library in tech of over 8,000 expert-authored tech books and videos.

Innovative learning tools, including AI book assistants, code context explainers, and text-to-speech.

50+ new titles added per month and exclusive early access to books as they are being written.

You're reading from Python for Geeks Build production-ready applications using advanced Python concepts and industry best practices

Table of Contents (20) Chapters

Summary

Authors (1)

Personalised recommendations for you

You're reading from Python for Geeks Build production-ready applications using advanced Python concepts and industry best practices

Table of Contents (20) Chapters

Summary

Authors (1)

Personalised recommendations for you

Create a Free Account To Continue Reading

Sign in to activate your 7-day free access