Big Data-spark-based machine learning-smart Customer Systems Project Combat

Source: Internet
Author: User
Tags zookeeper idf

Course Outline:
Section 1th introduction of the project and what can be learned in this course, how to apply it to the actual project 00:09:43 min .
2nd. Installation and use of Scala and IDE and installation of MAVEN plugin 00:07:04 minutes
3rd CentOS Environment Preparation (Java environment, hosts configuration, firewall off) 00:06:24 min
4th Scala Basics-1 00:08:51 min
5th Scala Basics Tutorial-functions and Closures-2 00:30:07 min
6th Scala Basics Tutorial-Arrays and collections-3.1 00:48:33 min
7th Scala Basics Tutorial-Arrays and collections-3.2 00:14:16 min
8th Scala Basics Tutorial-Classes and objects -400:23:06 minutes
9th Scala Basics-Feature and pattern matching -500:13:46 minutes
10th Scala Basics Tutorial-Regular expressions and exception handling -600:12:41 minutes
11th Scala Basics-Knowledge review 00:15:58 min
12th NoSQL Database MongoDB installation 00:04:57 min
section 13th Spring data for mongodb-simple connection mongodb00:07:52 min
section 14th Spring data for mongodb-spring configuration +crud operation (repo not implemented, default action) 00:36:20 min
section 15th Spring Data for mongodb-implementation Repo Interface +mongotemplate+crud operation 00:36:17 min
16th Spring data for mongodb-paged query 00:13:32 min
17th Section Zookeeper cluster installation 00:13:41 min
18th Section Zookeeper Basic introduction -100:22:36 minutes
19th Section Zookeeper working principle-election process (Basic Paxos algorithm) -200:24:27 min
20th Section Zookeeper working principle-election process (Fast Paxos algorithm) -300:31:16 min
21st kafka-Background and architecture introduction 00:12:28 min
22nd Section Kafka cluster installation and testing 00:14:29 minutes
23rd Kafka Data Sending and receiving implementation-java00:31:28 minutes
24th HDFs Stand-alone installation deployment 00:18:51 min
25th section connecting HDFs query store-java00:35:45 minutes
section 26th machine learning Basic Linear algebra introduction 00:05:08 min
27th Section Ikanalyzer Chinese word breaker 00:17:54 min
28th Section Ikanalyzer Chinese Word breaker tool with Java application 00:16:29 minutes
29th Spark and eco-circle introduction 00:11:45 min
Section 30th Introduction and principle of spark running architecture job,stage,task00:26:19 minutes
31st Spark programming model RDD design and operation principle 00:15:48 min
32nd Pure Handwriting First Spark application: wordcount00:23:57 minutes
section 33rd Rdd common function Introduction 00:29:22 minutes
section 34th Spark SQL Introduction, DataFrame creation and use, RDD DataFrame DataSet Mutual conversions 00:12:54 min
section 35th Spark Streaming introduction 00:12:56 min
36th Spark Streaming+kafka Integration Operation 00:18:44 min
37th section Avro combined with Maven for serialization and deserialization 00:21:07 min
Section 38th Spark ML (machine learning) Introduction (supervised learning, semi-supervised learning, unsupervised learning) 00:13:59 min
section 39th feature extraction: Introduction to TF-IDF principle 00:17:49 min
40th section feature extraction: TF-IDF code implementation 00:26:37 minutes
41st Section Clustering algorithm: Introduction to Kmeans principle 00:20:55 min
42nd Section Clustering algorithm: Kmeans code implementation 00:20:03 minutes
43rd section Other spark ml algorithms simple introduction 00:03:48 min
section 44th Spark Connection MongoDB code implementation 00:13:08 minutes
45th Section Mesos Overview of the overall architecture 00:08:25 min
46th Section Mesos installation deployment 00:12:04 minutes
47th Spark on Mesos installation deployment 00:11:12 min
48th. System Architecture Re-introduction + Technology Tandem Introduction (all the learning techniques are integrated into the project) 00:03:57 min
section 49th Project code: Parent project, managing versions of each jar 00:03:47 min
Section 50th Project code: AVRO serialization jar for client and machine learning to implement serialization and deserialization 00:04:46 min
section 51st Project code: Kafka send data jar, call to app and implement word cut and send data to kafka00:06:23 minutes
section 52nd Project Code: Tool class jar for operation of HDFs, word-cutting and operation mongodb00:03:28 minutes
section 53rd Project code: Manipulating the class jar, invoking the tool class to specifically cut words and data for cleaning and storing to hdfs00:05:34 minutes
section 54th Project code: Machine Learning Collection jar, mainly used for storing record00:02:56 minutes
The 55th section of the Project code: Machine learning algorithm jar, mainly for TF-IDF and Kmeans calculation, mainly to achieve upstream and downstream enterprises, supply and demand upstream and downstream model calculation 00:07:11 min
section 56th Project code: Streaming compute jar, mainly accepts the data load model that the client sends to Kafka to calculate 00:04:35 minutes
Section 57th Project code: Test simulation jar, main simulation implementation user load Avro serialized jar write data to kafka00:01:51 minutes
58th Spark on MESOS Deployment submission parameter description 00:08:17 min
section 59th Spark code submitted to Mesos run (spark-submit) 00:07:13 min
60th. Overall flow of the project running through, results show 00:06:54 minutes
61st Spark Tuning introduction 00:08:01 minutes
62nd Spark-based machine learning project-Intelligent Customer System Combat Course summary 00:04:12 min

63rd. Practical work and interview attention 00:03:45 minutes


: Baidu Network disk download

Original address: http://linyunbbs.com/thread-2142-1-1.html

Big Data-spark-based machine learning-smart Customer Systems Project Combat

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.