First step: Install Anaconda
Anaconda comes with a series of scientific computing packages
Download Link: http://pan.baidu.com/s/1b4jWlg Password: fqq3
Then configure the environment variables: If I install on the D drive
Try command line startup to make sure Ipython starts from Anaconda
Step Two: Install Spark (requires pre-installation of JDK)
Download Spark installation package to official website
Http://spark.apache.org/downloads.html
This is the version I'm following:
Do not forget to add environment variables after decompression
Configure the following system variables
This step is important. copy E:\spark\python\pyspark to D:\anaconda\Lib\site-packages directory
Step three: Modify the Spark-env file under spark\conf (do not know if it is necessary)
Add the following three lines
Export Pyspark_python=/d:/anaconda
Export Pyspark_driver_python=/d:/anaconda
Export pyspark_submit_args= '--master local[*] '
Fourth step: Start Pyspark
Discovery has started successfully, the general webpage will automatically open http://localhost:8888/tree#
Fifth Step: Test SC
Success!
How to install the configuration under Windows Pyspark Notebook