Zeppelin is an Apache incubation project.
A web-based notebook that supports interactive data analysis. You can use SQL, Scala, and so on to make data-driven, interactive, and collaborative documents. (similar to Ipython notebook, you can write code, notes and share directly in the browser)
Multi-purpose notebooks
To achieve what you need:
- Data Acquisition
- Data Discovery
- Data Analysis
- Visualization and collaboration of data
Supports multiple languages, the default is Scala (behind the Spark shell), sparksql, Markdown and Shell.
You can even add your own language support. How to write a Zeppelin interpreter
Apache Spark Integration
Zeppelin provides built-in Apache Spark integration. You don't need to build a module, plug-in, or library separately.
The Zeppelin Spark Integration provides:
- Automatic introduction of Sparkcontext and SqlContext
- Load the jar packages that are dependent on the runtime from the local file system or from the MAVEN library. more about dependent loader
- can cancel job and show job progress
Visualization of data
Some basic charts are already included in the Zeppelin. Visualization is not limited to sparksql queries, and the output of any language in the backend can be identified and visualized.
Dynamic tables
Zeppelin can create some input formats dynamically in your notebook.
More about dynamic Tables
Collaboration
Notebook URLs can be shared between collaborators. Zeppelin can then broadcast any changes in real time, just like in Google Docs.
Release
Zeppelin provides a URL to show only the results, and that page does not include Zeppelin menus and buttons. This way, you can easily integrate it into your site as an IFRAME.
100% Open Source
Apache Zeppelin (hatching) is the Apache2 licensed software. Check out the source library and how to contribute.
Zeppelin has a very active development community. Join the mailing list and report issues on our issue tracker has a very active development community. Join the mailing list and report issues on our Issue tracker.
The above translations are from the official website of Apache Zeppelin (incubating).
Because binary installation packages are not currently available, you need to compile them yourself.
Compile
-Pspark-1.3-Dhadoop.version=2.3.0-Phadoop-2.3-DskipTests
[INFO]------------------------------------------------------------------------[INFO] Reactor Summary:[info] [INFO] Zeppelin ...... ..... ....... .................. SUCCESS [8.289S][info] zeppelin:interpreter ......... ............. SUCCESS [33.977S][info] zeppelin:zengine ......... ................ SUCCESS [9.707S][info] Zeppelin:spark ......... .................. SUCCESS [2:14.821S][info] Zeppelin:markdown interpreter ......... ..... SUCCESS [4.486S][info] Zeppelin:shell interpreter ......... ........ SUCCESS [4.506S][info] zeppelin:web application ......... ......... SUCCESS [2:09.064S][info] zeppelin:server ......... ................. SUCCESS [31.863S][info] zeppelin:packaging distribution ........ ..... SUCCESS [0.517S][info]------------------------------------------------------------------------[INFO] BUILD Success[info]------------------------------------------------------------------------[INFO] Total Time:5:58.114S[info] Finished at: Tue Mar to -: -: toPdt -[INFO] Final Memory: them/307M[info]------------------------------------------------------------------------
Compile successfully (the network is not good, it takes a long time).
Start:
zhf@ubuntu:~/Downloads/incubator-zeppelin$ ./bin/zeppelin-daemon.sh startPid dir doesn‘t exist, create /home/zhf/Downloads/incubator-zeppelin/runZeppelin start [ OK ]
Open in Browser: http://localhost:8080 can enter Zeppelin home page.
Start creating a notebook and you can use it.
About Apache Zeppelin