Ebook sparkadvanced data analytics, sparkanalytics
This book is a practical example of Spark for large-scale data analysis, written by data scientists at Cloudera, a big data company. The four authors first explained Spark based on the broad background of Data Science and big data analysis, then introduced basic knowledge about Data Processing Using Spark and Scala, and then discussed how to use Spark for machine learning, it also introduces several common algorithms in common applications. In addition, it also collects some more novel applications, such as querying Wikipedia or analyzing genetic data through text implicit semantic relationships.
Author Profile
Sandy Ryza is a data scientist at Cloudera and an active code contributor to the Apache Spark project. Led Spark development for Cloudera. He is also a member of the Hadoop Project Management Committee.
Uri Laserson is a data scientist at Cloudera and focuses on the Python part of the Hadoop ecosystem.
Sean Owen is the data science director of Cloudera's EMEA Region and a code contributor to the Apache Spark project. He created the Hadoop real-time large-scale Learning Project Oryx (formerly called Myrrix) based on Spark, Spark Streaming, and Kafka ).
Josh Wills is the senior director of Data Science at Cloudera, the initiator and Vice President of the Apache Crunch project.
Personal Learning is restricted and cannot be used for commercial purposes. Please delete it within 24 hours after the download.
Note: The resource is from the network. If there is any reason, you can trust me and delete it in seconds.
Ebook sparkadvanced data analytics Free Download
Https://page55.ctfile.com/fs/14299555-204462273