Learning Spark 2nd Edition 114663

Паперова книга
114663
Learning Spark 2nd Edition - фото 1
830
Купити

Все про “Learning Spark 2nd Edition”

Від видавця

Data is getting bigger, arriving faster, and coming in varied formats-and it all needs to be processed at scale for analytics or machine learning. How can you process such varied data workloads efficiently? Enter Apache Spark. Updated to emphasize new features in Spark 2.4., this second edition shows data engineers and scientists why structure and unification in Spark matters. Specifically, this book explains how to perform simple and complex data analytics and employ machine-learning algorithms. Through discourse, code snippets, and notebooks, you'll be able to: Learn Python, SQL, Scala, or Java high-level APIs: DataFrames and Datasets Peek under the hood of the Spark SQL engine to understand Spark transformations and performance Inspect, tune, and debug your Spark operations with Spark configurations and Spark UI Connect to data sources: JSON, Parquet, CSV, Avro, ORC, Hive, S3, or Kafka Perform analytics on batch and streaming data using Structured Streaming Build reliable data pipelines with open source Delta Lake and Spark Develop machine learning pipelines with MLlib and productionize models using MLflow Use open source Pandas framework Koalas and Spark for data transformation and feature engineering
About the Author
Jules S. Damji is an Apache Spark Community and Developer Advocate at Databricks. He is a hands-on developer with over 20 years of experience and has worked at leading companies, such as Sun Microsystems, Netscape, @Home, LoudCloud/Opsware, VeriSign, ProQuest, and Hortonworks, building large-scale distributed systems. He holds a B.Sc and M.Sc in Computer Science and MA in Political Advocacy and Communication from Oregon State University, Cal State, and Johns Hopkins University respectively. Denny Lee is a Technical Product Manager at Databricks. He is a hands-on distributed systems and data sciences engineer with extensive experience developing internet-scale infrastructure, data platforms, and predictive analytics systems for both on-premise and cloud environments. He also has a Masters of Biomedical Informatics from Oregon Health and Sciences University and has architected and implemented powerful data solutions for enterprise Healthcare customers. His current technical focuses include Distributed Systems, Apache Spark, Deep Learning, Machine Learning, and Genomics. Brooke Wenig is the Machine Learning Practice Lead at Databricks. She guides and assists customers in implementing machine learning pipelines, as well as teaching Distributed Machine Learning & Deep Learning courses. She received an MS in Computer Science from UCLA with a focus on distributed machine learning. She speaks Mandarin Chinese fluently and enjoys cycling. Tathagata Das is an Apache Spark committer and a member of the PMC. He's the lead developer behind Spark Streaming and currently develops Structured Streaming. Previously, he was a grad student in the UC Berkeley at AMPLab, where he conducted research about data-center frameworks and networks with Scott Shenker and Ion Stoica.

Анотація

Learning Spark 2nd Edition

Всі характеристики

  • Бренд
  • Автор
  • Категорія
    Комп'ютерна література
  • Номер видання
    2-ге вид.
  • Рік
    2020
  • Сторінок
    300
  • Формат
    170х240 мм
  • Обкладинка
    М'яка
  • Тип паперу
    Офсетний
  • Мова
    Англійська
  • Ілюстрації
    Чорно-білі

Товар входить до категорії

  • Безкоштовна доставка Новою Поштою від 1'500,00 ₴
  • Безкоштовна доставка Укрпоштою від 200,00 ₴
  • Безкоштовна доставка Meest Поштою від 1'500,00 ₴
Персонально для вас
PostgreSQL 15 изнутри
238928
Егор Рогов
740 ₴
Elasticsearch, Kibana, Logstash и поисковые системы нового поколения
99755
Пранав ШуклаШарат Кумар
750 ₴
Apache Kafka. Потоковая обработка и анализ данных. 2-е издание
246812
Раджини СиварамКрит ПеттиГвен ШапираТодд Палино
750 ₴
Grokking Relational Database Design
302636
Qiang HaoMichail Tsikerdekis
750 ₴
Принципы организации распределенных баз данных
128631
М. Тамер ЁсуПатрик Вальдуриес
759 ₴799 ₴
Mongo DB. Повне керівництво
119793
Шеннон БрэдшоуЙон БрэзилКристина Ходоров
800 ₴
Ефективний Spark. Масштабування і оптимізація
78770
Холден КарауРейчел Уоррен
806 ₴
DAX для професіоналів Теорія та практика
239563
Розема М.Х. Влотман
798 ₴840 ₴
PostGIS в действии
244063
Регина ОбеЛео Хсу
850 ₴
Релевантний пошук з використанням Elasticsearch і Solr
66607
5/1
Джон БеррименТарнбулл Д.
774 ₴880 ₴
Graph Algorithms: Practical Examples in Apache Spark and Neo4j 1st Edition
114590
Mark NeedhamAmy E. Hodler
950 ₴
Потоковая обработка данных с Apache Flink
130811
5/1
Фабиан УэскеВасилики Калаври
980 ₴
Інші книги O'Reilly
Learning MySQL: Get a Handle on Your Data. 2nd Ed.
244765
Vinicius M. Grippa, Sergey Kuzmichev
2'100 ₴
Mastering Ethereum: Smart Building Contracts and Dapps 1st Edition
67017
Andreas M. Antonopoulos
3'291 ₴
Clean Code Cookbook: Recipes to Improve the Design and Quality of your Code 1st Edition
264156
Maximiliano Contieri
1'700 ₴
Practical Process Automation. Orchestration and Integration in Microservices and Cloud Native Architectures
153396
Bernd Ruecker
2'200 ₴
Hands-On Smart Contract Development with Hyperledger Fabric V2. Building Enterprise Blockchain Applications. 1st Ed.
244754
Matt Zand, Xun Wu
2'300 ₴
Site Reliability Engineering: How Google Runs Production Systems
38540
Niall Richard Murphy, Chris Jones, Jennifer Petoff, Betsy Beyer
800 ₴
Designing Data-Intensive Applications: The Big Ideas Behind Reliable, Scalable, and Maintainable Systems 1st Edition
48003
Martin Kleppmann
616 ₴700 ₴
Data Science from Scratch: First Principles with Python 2nd Edition
114588
Joel Grus
720 ₴900 ₴