Apache Zeppelin provides a web version of a similar Ipython notebook for data analysis and visualization. The back can be connected to different data processing engines, including Spark, Hive, Tajo, native support Scala, Java, Shell, Markdown and so on. Its overall presentation and use form is the same as the Databricks cloud, which comes from the demo at the time.
Zeppelin is an Apache incubation project.
A web-based notebook that supports interactive data analysis. You can use SQL, Scala, and so on to make data-driven, interactive, and collaborative documents. (similar to Ipython notebook, you can write code, notes and share directly in the browser)
Multi-purpose notebooks
To achieve what you need:
- Data Acquisition
- Data Discovery
- Data Analysis
- Visualization and collaboration of data
Supports multiple languages, the default is Scala (behind the Spark shell), sparksql, Markdown and Shell.
You can even add your own language support.
Apache Spark Integration
Zeppelin provides built-in Apache Spark integration. You don't need to build a module, plug-in, or library separately.
The Zeppelin Spark Integration provides:
- Automatic introduction of Sparkcontext and SqlContext
- Load the jar packages that are dependent on the runtime from the local file system or from the MAVEN library.
- can cancel job and show job progress
Visualization of data
Some basic charts are already included in the Zeppelin. Visualization is not limited to sparksql queries, and the output of any language in the backend can be identified and visualized.
Dynamic tables
Zeppelin can create some input formats dynamically in your notebook.
More about dynamic Tables
Collaboration
Notebook URLs can be shared between collaborators. Zeppelin can then broadcast any changes in real time, just like in Google Docs.
Release
Zeppelin provides a URL to show only the results, and that page does not include Zeppelin menus and buttons. This way, you can easily integrate it into your site as an IFRAME.
100% Open Source
Apache Zeppelin (hatching) is the Apache2 licensed software. Check out the source library and how to contribute.
Zeppelin has a very active development community. Join the mailing list and report issues on our issue tracker has a very active development community. Join the mailing list and report issues on our Issue tracker.
The above translations are from the official website of Apache Zeppelin (incubating).
Because binary installation packages are not currently available, you need to compile them yourself.
If you have a tool that lets you write shell code, Python code, and Scala code on the same Web page, do you want it?
What if you can also execute the Pyspark code and spark code? Did you have a heartbeat?
What if I can write hive-sql,spark-sql?
What if you can save the code and form a document that supports the markdown syntax?
What if you could visualize the results of SQL on the Web interface?
If you also support the R language, do you support Kylin? Do you support angular? Also support PostgreSQL? Don't believe that you are not tempted.
What is Apache Zeppelin?