VU Boekhandel

Fast Data Processing with Spark 2 - Third Edition

Hoofdkenmerken

Auteur:	Krishna Sankar
Titel:	Fast Data Processing with Spark 2 - Third Edition
Uitgever:	Packt Publishing
ISBN:	9781785882968
ISBN boekversie:	9781785889271
Editie:	3
Prijs:	€ 31.16
Verschijningsdatum:	24-10-2016

Inhoudelijke kenmerken

Categorie:	Data Processing
Taal:	English
Imprint:	Packt Publishing

Technische kenmerken

Verschijningsvorm:

E-book

Inhoudsopgave:

\u003cp\u003e\u003cb\u003eLearn how to use Spark to process big data at speed and scale for sharper analytics. Put the principles into practice for faster, slicker big data projects.\u003c/b\u003e\u003c/p\u003e\u003ch2\u003eAbout This Book\u003c/h2\u003e\u003cul\u003e\u003cli\u003eA quick way to get started with Spark â and reap the rewards\u003c/li\u003e\u003cli\u003eFrom analytics to engineering your big data architecture, weâve got it covered\u003c/li\u003e\u003cli\u003eBring your Scala and Java knowledge â and put it to work on new and exciting problems\u003c/li\u003e\u003c/ul\u003e\u003ch2\u003eWho This Book Is For\u003c/h2\u003e\u003cp\u003eThis book is for developers with little to no knowledge of Spark, but with a background in Scala/Java programming. Itâs recommended that you have experience in dealing and working with big data and a strong interest in data science.\u003c/p\u003e\u003ch2\u003eWhat You Will Learn\u003c/h2\u003e\u003cul\u003e\u003cli\u003eInstall and set up Spark in your cluster\u003c/li\u003e\u003cli\u003ePrototype distributed applications with Spark's interactive shell\u003c/li\u003e\u003cli\u003ePerform data wrangling using the new DataFrame APIs\u003c/li\u003e\u003cli\u003eGet to know the different ways to interact with Spark's distributed representation of data (RDDs)\u003c/li\u003e\u003cli\u003eQuery Spark with a SQL-like query syntax\u003c/li\u003e\u003cli\u003eSee how Spark works with big data\u003c/li\u003e\u003cli\u003eImplement machine learning systems with highly scalable algorithms\u003c/li\u003e\u003cli\u003eUse R, the popular statistical language, to work with Spark\u003c/li\u003e\u003cli\u003eApply interesting graph algorithms and graph processing with GraphX\u003c/li\u003e\u003c/ul\u003e\u003ch2\u003eIn Detail\u003c/h2\u003e\u003cp\u003eWhen people want a way to process big data at speed, Spark is invariably the solution. With its ease of development (in comparison to the relative complexity of Hadoop), itâs unsurprising that itâs becoming popular with data analysts and engineers everywhere.\u003c/p\u003e\u003cp\u003eBeginning with the fundamentals, weâll show you how to get set up with Spark with minimum fuss. Youâll then get to grips with some simple APIs before investigating machine learning and graph processing â throughout weâll make sure you know exactly how to apply your knowledge.\u003c/p\u003e\u003cp\u003eYou will also learn how to use the Spark shell, how to load data before finding out how to build and run your own Spark applications. Discover how to manipulate your RDD and get stuck into a range of DataFrame APIs. As if thatâs not enough, youâll also learn some useful Machine Learning algorithms with the help of Spark MLlib and integrating Spark with R. Weâll also make sure youâre confident and prepared for graph processing, as you learn more about the GraphX API.\u003c/p\u003e\u003ch2\u003eStyle and approach\u003c/h2\u003e\u003cp\u003eThis book is a basic, step-by-step tutorial that will help you take advantage of all that Spark has to offer.\u003c/p\u003e