Hue for Apache Hadoop

Source: Internet
Author: User
Tags gmp

    • What is Hue
    • Hue Architecture
    • Install and configure Hue on Hadoop
    • Tips for some common issues

1. What is Hue

Hue is one of Hadoop's most important projects, as it significantly increases a user's ease of access to the power of the Hadoop platform. While Hive and YARN provide a processing backbone for data analysts familiar with SQL to use Hadoop, Hue provides the inte Rface of choice for data analysts to quickly get connected with big data and Hadoop ' s powerful tools

2. Hue Architecture

Hue applications run in a Web browser and require no client installation. The following figure illustrates how Hue works. Hue Server is a "container" Web application this sits in between Hadoop installation and the browser. The It hosts all of the Hue Web applications and communicates with various servers, which interface with Hadoop components.

3. Install and configure Hue on Hadoop

Hue consists of a Web service that runs on a special node in the Hadoop cluster. At here, I'll use existing master node as the Hue Server.

3.1 Technical Details
    • Distribution:apache Hadoop HDFS
    • Cluster Manager:resource Manager
    • Environment:ali Cloud Server
    • Operating System:ubuntu 14.04 LTS
3.2 Features confirmed to work in partial or complete fashion
    • File Browser (HDFS access through Webhdfs or HTTPFS)
    • Hive/beeswax (beeswax uses the Hive client libraries)
    • HBase Cluster Browser (Requires Thrift 1 service)
    • Job Browser (Job information access through Hue-plugins)
3.3 Hue Dependencies

Hue employs some Python modules which use native code and requires certain development libraries is installed on the Syste M. To install from the Tarball, following components must is installed:

    • sudo apt-get install-y ant
    • sudo apt-get install-y gcc g++
    • sudo apt-get install-y libkrb5-dev Libmysqlclient-dev
    • sudo apt-get install-y libssl-dev libsasl2-dev libsasl2-modules-gssapi-mit
    • sudo apt-get install-y Libsqlite3-dev
    • sudo apt-get install-y libtidy-0.99-0 libxml2-dev Libxslt-dev
    • sudo apt-get install-y maven
    • sudo apt-get install-y Libldap2-dev
    • sudo apt-get install-y python-dev python-simplejson python-setuptools
3.4 Install and configure Hue (1) Download Hue 3.9 release tarball from below link.

http://gethue.com/hue-3-9-with-all-its-improvements-is-out/

(2) Unpackage the tarball to the Ubuntu opt directory.

(3) Make sure all the dependencies has been installed and then start the build process.

By default, Hue installs to '/usr/local/hue ' in the master node ' s local filesystem.

(4) The Hue configuration file can be found at '/usr/local/hue/desktop/conf/hue.ini '. Bellow is the changes.

Desktop

Hadoop

Hbase

Hive

3.5 Start Hue and browser the data.   (1) Start the Hue server using the ' Supervisor ' command. (2) Also need to start Hive2 server for using Hive (3) Start HBase Server and thrift server for using HBase.

3.6 Browser the data with Hue. (1) Using hsql to select demo data from hive database

(2) Create demo table by using Hbase browser

(3) Check jobs information

4.Tips for some common issues

(1) Sometimes the build process for the dependencies raises errors. So I suggest to manually install Ant and maven.for These II tools, you can directly download corresponding release Tarbal LS and then manually configure environment variable ant_home and Maven_home and PATH.

(2) as installed, the Hue installation folders and file ownership would be set to the ' root ' user. We ' d better to fix, so Hue can run correctly without root user permissions.

(3) For error message "creating BUILD/TEMP.LINUX-X86_64-2.7/SRC Gcc-pthread-fno-strict-aliasing-fwrapv-wall-wstr ict-prototypes-fpic-std=c99-o3-fomit-frame-pointer-isrc/-i/usr/include/-i/home/huser/miniconda/include/ Python2.7-c src/_fastmath.c-o build/temp.linux-x86_64-2.7/src/_fastmath.o src/_fastmath.c:36:18:fatal error:gmp.h: No such file or directory # include <gmp.h> ^ compilation terminated. Error:command ' gcc ' failed with exit status 1... "

is because the GCC had not found "gmp.h". You need to assure the installed "Libgmp3-dev" package and has gmp.h in path. Try to execute below command to install Libgmp3-dev.

#sudo Apt-get Install Libgmp3-dev

Hue for Apache Hadoop

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.