Spark: The Definitive Guide

Download or Read eBook Spark: The Definitive Guide PDF written by Bill Chambers and published by "O'Reilly Media, Inc.". This book was released on 2018-02-08 with total page 594 pages. Available in PDF, EPUB and Kindle.
Spark: The Definitive Guide
Author :
Publisher : "O'Reilly Media, Inc."
Total Pages : 594
Release :
ISBN-10 : 9781491912294
ISBN-13 : 1491912294
Rating : 4/5 (94 Downloads)

Book Synopsis Spark: The Definitive Guide by : Bill Chambers

Book excerpt: Learn how to use, deploy, and maintain Apache Spark with this comprehensive guide, written by the creators of the open-source cluster-computing framework. With an emphasis on improvements and new features in Spark 2.0, authors Bill Chambers and Matei Zaharia break down Spark topics into distinct sections, each with unique goals. Youâ??ll explore the basic operations and common functions of Sparkâ??s structured APIs, as well as Structured Streaming, a new high-level API for building end-to-end streaming applications. Developers and system administrators will learn the fundamentals of monitoring, tuning, and debugging Spark, and explore machine learning techniques and scenarios for employing MLlib, Sparkâ??s scalable machine-learning library. Get a gentle overview of big data and Spark Learn about DataFrames, SQL, and Datasetsâ??Sparkâ??s core APIsâ??through worked examples Dive into Sparkâ??s low-level APIs, RDDs, and execution of SQL and DataFrames Understand how Spark runs on a cluster Debug, monitor, and tune Spark clusters and applications Learn the power of Structured Streaming, Sparkâ??s stream-processing engine Learn how you can apply MLlib to a variety of problems, including classification or recommendation


Spark: The Definitive Guide Related Books

Spark: The Definitive Guide
Language: en
Pages: 594
Authors: Bill Chambers
Categories: Computers
Type: BOOK - Published: 2018-02-08 - Publisher: "O'Reilly Media, Inc."

DOWNLOAD EBOOK

Learn how to use, deploy, and maintain Apache Spark with this comprehensive guide, written by the creators of the open-source cluster-computing framework. With
Learning Spark
Language: en
Pages: 289
Authors: Holden Karau
Categories: Computers
Type: BOOK - Published: 2015-01-28 - Publisher: "O'Reilly Media, Inc."

DOWNLOAD EBOOK

Data in all domains is getting bigger. How can you work with it efficiently? Recently updated for Spark 1.3, this book introduces Apache Spark, the open source
Learning Spark
Language: en
Pages: 400
Authors: Jules S. Damji
Categories: Computers
Type: BOOK - Published: 2020-07-16 - Publisher: O'Reilly Media

DOWNLOAD EBOOK

Data is bigger, arrives faster, and comes in a variety of formats—and it all needs to be processed at scale for analytics or machine learning. But how can you
Hadoop: The Definitive Guide
Language: en
Pages: 687
Authors: Tom White
Categories: Computers
Type: BOOK - Published: 2012-05-10 - Publisher: "O'Reilly Media, Inc."

DOWNLOAD EBOOK

Ready to unlock the power of your data? With this comprehensive guide, you’ll learn how to build and maintain reliable, scalable, distributed systems with Apa
Kafka: The Definitive Guide
Language: en
Pages: 315
Authors: Neha Narkhede
Categories: Computers
Type: BOOK - Published: 2017-08-31 - Publisher: "O'Reilly Media, Inc."

DOWNLOAD EBOOK

Every enterprise application creates data, whether it’s log messages, metrics, user activity, outgoing messages, or something else. And how to move all of thi