Modern Data Architectures with Python: A practical guide to building and deploying data pipelines, data warehouses, and data lakes with Python 264559

Код товару: 264559Паперова книга
  • ISBN
    978-1801070492
  • Бренд
  • Автор
  • Рік
    2023
  • Мова
    Англійська
  • Ілюстрації
    Чорно-білі
Build scalable and reliable data ecosystems using Data Mesh, Databricks Spark, and Kafka
Key Features
  • Develop modern data skills used in emerging technologies
  • Learn pragmatic design methodologies such as Data Mesh and data lakehouses
  • Gain a deeper understanding of data governance
Book Description
Modern Data Architectures with Python will teach you how to seamlessly incorporate your machine learning and data science work streams into your open data platforms. You’ll learn how to take your data and create open lakehouses that work with any technology using tried-and-true techniques, including the medallion architecture and Delta Lake.
Starting with the fundamentals, this book will help you build pipelines on Databricks, an open data platform, using SQL and Python. You’ll gain an understanding of notebooks and applications written in Python using standard software engineering tools such as git, pre-commit, Jenkins, and Github. Next, you’ll delve into streaming and batch-based data processing using Apache Spark and Confluent Kafka. As you advance, you’ll learn how to deploy your resources using infrastructure as code and how to automate your workflows and code development. Since any data platform's ability to handle and work with AI and ML is a vital component, you’ll also explore the basics of ML and how to work with modern MLOps tooling. Finally, you’ll get hands-on experience with Apache Spark, one of the key data technologies in today’s market.
By the end of this book, you’ll have amassed a wealth of practical and theoretical knowledge to build, manage, orchestrate, and architect your data ecosystems.
What you will learn
  • Understand data patterns including delta architecture
  • Discover how to increase performance with Spark internals
  • Find out how to design critical data diagrams
  • Explore MLOps with tools such as AutoML and MLflow
  • Get to grips with building data products in a data mesh
  • Discover data governance and build confidence in your data
  • Introduce data visualizations and dashboards into your data practice
Who this book is for
This book is for developers, analytics engineers, and managers looking to further develop a data ecosystem within their organization. While they’re not prerequisites, basic knowledge of Python and prior experience with data will help you to read and follow along with the examples.
About the Author
Brian Lipp is a Technology Polyglot, Engineer, and Solution Architect with a wide skillset in many technology domains. His programming background has ranged from R, Python, and Scala, to Go and Rust development. He has worked on Big Data systems, Data Lakes, data warehouses, and backend software engineering. Brian earned a Master of Science, CSIS from Pace University in 2009. He is currently a Sr. Data Engineer working with large Tech firms to build Data Ecosystems.
1'400 ₴
Купити
Monobank
до 10 платежей
от 157 ₴ / міс.
  • Нова Пошта
    Безкоштовно від 3'000,00 ₴
  • Укрпошта
    Безкоштовно від 1'000,00 ₴
  • Meest Пошта
    Безкоштовно від 3'000,00 ₴
Modern Data Architectures with Python: A practical guide to building and deploying data pipelines, data warehouses, and data lakes with Python - фото 1
Інші книги Packt Publishing

Характеристики

  • Бренд
  • Автор
  • Категорія
    Програмування
  • Рік
    2023
  • Сторінок
    318
  • Формат
    190х235 мм
  • Обкладинка
    М'яка
  • Тип паперу
    Офсетний
  • Мова
    Англійська
  • Ілюстрації
    Чорно-білі

Від видавця

Build scalable and reliable data ecosystems using Data Mesh, Databricks Spark, and Kafka
Key Features
  • Develop modern data skills used in emerging technologies
  • Learn pragmatic design methodologies such as Data Mesh and data lakehouses
  • Gain a deeper understanding of data governance
Book Description
Modern Data Architectures with Python will teach you how to seamlessly incorporate your machine learning and data science work streams into your open data platforms. You’ll learn how to take your data and create open lakehouses that work with any technology using tried-and-true techniques, including the medallion architecture and Delta Lake.
Starting with the fundamentals, this book will help you build pipelines on Databricks, an open data platform, using SQL and Python. You’ll gain an understanding of notebooks and applications written in Python using standard software engineering tools such as git, pre-commit, Jenkins, and Github. Next, you’ll delve into streaming and batch-based data processing using Apache Spark and Confluent Kafka. As you advance, you’ll learn how to deploy your resources using infrastructure as code and how to automate your workflows and code development. Since any data platform's ability to handle and work with AI and ML is a vital component, you’ll also explore the basics of ML and how to work with modern MLOps tooling. Finally, you’ll get hands-on experience with Apache Spark, one of the key data technologies in today’s market.
By the end of this book, you’ll have amassed a wealth of practical and theoretical knowledge to build, manage, orchestrate, and architect your data ecosystems.
What you will learn
  • Understand data patterns including delta architecture
  • Discover how to increase performance with Spark internals
  • Find out how to design critical data diagrams
  • Explore MLOps with tools such as AutoML and MLflow
  • Get to grips with building data products in a data mesh
  • Discover data governance and build confidence in your data
  • Introduce data visualizations and dashboards into your data practice
Who this book is for
This book is for developers, analytics engineers, and managers looking to further develop a data ecosystem within their organization. While they’re not prerequisites, basic knowledge of Python and prior experience with data will help you to read and follow along with the examples.
About the Author
Brian Lipp is a Technology Polyglot, Engineer, and Solution Architect with a wide skillset in many technology domains. His programming background has ranged from R, Python, and Scala, to Go and Rust development. He has worked on Big Data systems, Data Lakes, data warehouses, and backend software engineering. Brian earned a Master of Science, CSIS from Pace University in 2009. He is currently a Sr. Data Engineer working with large Tech firms to build Data Ecosystems.

Зміст

Table of Contents
  1. Modern Data Processing Architectures
  2. Basics of Data Analytics Engineering
  3. Cloud Storage and Processing Concepts
  4. Python Batch and Stream Processing with Spark
  5. Streaming Data with Kafka
  6. Python MLOps
  7. Python and SQL based Visualizations
  8. Integrating CI into your workflow
  9. Data Orchestration
  10. Data Governance
  11. Introduction to Saturn Insurance, Deploying CI and ELT
  12. Data Governance and Dashboards

Відгуки про Modern Data Architectures with Python: A practical guide to building and deploying data pipelines, data warehouses, and data lakes with Python

Modern Data Architectures with Python: A practical guide to building and deploying data pipelines, data warehouses, and data lakes with Python
Modern Data Architectures with Python: A practical guide to building and deploying data pipelines, data warehouses, and data lakes with Python
1'400 ₴
Купити
Персонально для вас
Introduction to Data Science: A Python Approach to Concepts, Techniques and Applications (Undergraduate Topics in Computer Science) 2nd ed. 2024 Edition
275543
Laura IgualSanti SeguiJordi VitriaEloi PuertasPetia RadevaOriol PujolSergio EscaleraFrancesc Danti
1'200 ₴
Python Programming for Mathematics
295070
Guillod Julien
1'300 ₴
Digital Signal Processing: Illustration Using Python 1st ed. 2024 Edition
277689
S EsakkirajanT VeerakumarBadri N Subudhi
1'400 ₴
Django in Action
282314
Christopher Trudeau
1'400 ₴
OSGi in Action: Creating Modular Applications in Java
14426
Richard Hall, Karl Pauls, Stuart McCulloch, David Savage
590 ₴
Fluent Python. Clear, Concise, and Effective Programming. 2nd Edition
197750
Luciano Ramalho
1'900 ₴1'672 ₴
Data Ingestion with Python Cookbook: A practical guide to ingesting, monitoring, and identifying errors in the data ingestion process
246252
Glaucia Esppenchutz
1'200 ₴
TCP/IP Sockets in C#: Practical Guide for Programmers
39500
David Makofske, Michael J. Donahoo, Kenneth L. Calvert
1'800 ₴
Software Development, Design, and Coding: With Patterns, Debugging, Unit Testing, and Refactoring Third Edition
282447
John F. DooleyVera A. Kazakova
1'800 ₴
Architecting Vue.js 3 Enterprise-Ready Web Applications: Build and deliver scalable and high-performance, enterprise-ready applications with Vue and JavaScript
302522
Solomon Eseme
1'600 ₴
Computational Mathematics: An introduction to Numerical Analysis and Scientific Computing with Python
246257
Dimitrios Mitsotakis
1'700 ₴
Inside Microsoft® SQL Server® 2008: T-SQL Programming (Developer Reference) 1st Edition
190238
Dejan SarkaGreg LowRoger WolterEd KatibahIsaac KunenItzik Ben-Gan
650 ₴
Machine Learning with Python Cookbook
255743
Kyle GallatinChris Albon
1'900 ₴