Ebook sparkadvanced data analytics, sparkanalytics

Source: Internet
Author: User
Tags hadoop ecosystem

Ebook sparkadvanced data analytics, sparkanalytics

This book is a practical example of Spark for large-scale data analysis, written by data scientists at Cloudera, a big data company. The four authors first explained Spark based on the broad background of Data Science and big data analysis, then introduced basic knowledge about Data Processing Using Spark and Scala, and then discussed how to use Spark for machine learning, it also introduces several common algorithms in common applications. In addition, it also collects some more novel applications, such as querying Wikipedia or analyzing genetic data through text implicit semantic relationships.

Author Profile
Sandy Ryza is a data scientist at Cloudera and an active code contributor to the Apache Spark project. Led Spark development for Cloudera. He is also a member of the Hadoop Project Management Committee.

Uri Laserson is a data scientist at Cloudera and focuses on the Python part of the Hadoop ecosystem.

Sean Owen is the data science director of Cloudera's EMEA Region and a code contributor to the Apache Spark project. He created the Hadoop real-time large-scale Learning Project Oryx (formerly called Myrrix) based on Spark, Spark Streaming, and Kafka ).

Josh Wills is the senior director of Data Science at Cloudera, the initiator and Vice President of the Apache Crunch project.

 
 
Personal Learning is restricted and cannot be used for commercial purposes. Please delete it within 24 hours after the download.
Note: The resource is from the network. If there is any reason, you can trust me and delete it in seconds.
Ebook sparkadvanced data analytics Free Download
Https://page55.ctfile.com/fs/14299555-204462273

Related Article

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.