Configure oozie4.10+hadoop2.5.2

Source: Internet
Author: User

Finally, this mysterious looking person Oozie installation configuration succeeded, this troubled me for several days, when see the following screen, I think value!

Talk less, see how I compile and install the process:

(The HADOOP2.5.2HA environment has been set up, Hive,habase, Flume,stom have

Linux environment: centos6.5 64bit

jdk:1.7

MySQL has been installed

Apache Maven 3.1.1

Download Oozie installation package: oozie-4.1.0.tar.gz http://mirror.bit.edu.cn/apache/oozie/

Download Ext-2.2.zip http://oozie.apache.org/docs/4.0.1/DG_QuickStart.html This path has a ExtJS link

1. Compiling

Go to http://mirrors.cnnic.cn/apache/oozie/4.2.0/

The download is oozie-4.2.0.tar.gz

, then unzip:

TAR-ZXVF oozie-4.2.0.tar.gz

CD Oozie-4.2.0/bin

./mkdistro.sh-dskiptests-phadoop-2-dhadoop.auth.version=2.5.2-ddistcp.version=2.5.2-dsqoop.version=1.4.4- dhive.version=0.13.1

-dtomcat.version=7.0.52

A long wait, the network problem has been bothering me, I repeat the above command, found that, finally came here,

That's the wrong thing to compile here.
[INFO]------------------------------------------------------------------------[INFO] Reactor Summary:[info][info ] Apache Oozie Main ..... ....... ............... SUCCESS [6.824s][info] Apache Oozie Hadoop Utils hadoop-2-4.2.0 .... SUCCESS [9.525s][info] Apache Oozie Hadoop Distcp hadoop-2-4.2.0 ... SUCCESS [0.444s][info] Apache Oozie Hadoop Auth hadoop-2-4.2.0 Test ... SUCCESS [1.027s][info] Apache Oozie Hadoop Libs ......... ......... SUCCESS [0.101s][info] Apache Oozie Client ......... .............. SUCCESS [5:08.683s][info] Apache Oozie Share Lib Oozie ............. SUCCESS [9.351s][info] Apache Oozie Share Lib hcatalog ........ ..... SUCCESS [11.656s][info] Apache Oozie Share Lib Distcp ......... ..... SUCCESS [3.151s][info] Apache Oozie Core ..... ....... .............. SUCCESS [3:53.804s][info] Apache Oozie Share Lib streaming ....... ..... SUCCESS [13.230s][info] Apache Oozie Share Lib Pig ......... .......SUCCESS [15.454s][info] Apache Oozie Share Lib Hive ......... ....... SUCCESS [13.747s][info] Apache Oozie Share Lib Hive 2 ....... ....... SUCCESS [14.417s][info] Apache Oozie Share Lib Sqoop ............. SUCCESS [5.546s][info] Apache Oozie Examples ......... ............ SUCCESS [10.178s][info] Apache Oozie Share Lib Spark ........ ....... SUCCESS [15.450s][info] Apache Oozie Share Lib ......... ........... SUCCESS [52.422s][info] Apache Oozie Docs ...... ..... ............... FAILURE [9.477s][info] Apache Oozie WebApp ......... .............. Skipped[info] Apache Oozie Tools ..... ....... .............. Skipped[info] Apache Oozie minioozie ......... ........... Skipped[info] Apache Oozie distro ......... .............. Skipped[info] Apache Oozie ZooKeeper Security Tests .... ..... Skipped[info]------------------------------------------------------------------------[INFO] BUILD Failure[info]- -----------------------------------------------------------------------[INFO] Total time:12:21.113s[info] finished at:wed Oct 26 05:39:28 CST 2016[info] Final Memory:174m/482m[info]--------------------------------------------------------------                                                                                        ----------[ERROR] Failed to execute goal org.apache.maven.plugins:maven-site-plugin:2.0-be Ta-6:site (default) on Project Oozie-docs:the site descriptor can Not being resolve D from the Repositor                                                                                        Y:could Not transfer artifact Org.apache:apache:xml:site_en                                                                                        : From/to Codehaus Repository (http://repository.codehaus.org/): repository.co dehaus.org: Unknown name or service [ERROR] Org.apache:apache:xml: 16[error][error] from the specified rEmote Repositories:[error] Central (http://repo1.maven.org/maven2, Releases=true, Snapshots=false), [ERROR] CE D (https                                                                                        ://repository.cloudera.com/cloudera/ext-release-local/, Relea Ses=true, Snapshots=false), [ERROR] Codehaus repository (http://repository.codehaus.org/, Releas Es=true, SNA pshots=false), [ERROR]                                                                                        Cloudera COM (Https://repository.cloudera.com/content/repositories/relea ses/, Releases=true, Snapshots=false), [ERROR] central maven (http://central.maven.or                                                                                        g/maven2/, Releases=true, snapshot                                                             S=false), [ERROR] Apache.snapshots.repo (https://repository.apache.org/content/groups/snap                           Shots, Releases=true, snapshots=true), [ERROR] DataNucleus (http://www.datanucleus.org/downloads /maven2, Releases=true, snapshots=                                                                                        False), [ERROR] Apache.snapshots (http://repository.apache.org/snapshots, Releases=false , snapshots=true): Unknown host repository.codehaus.org: unknown name or service [ERR                                                                                        OR], [help 1][error][error] to see the full stack trace of the errors, re-run Maven with The-e Swit Ch.[error] Re-run Maven using the-x switch to Enab Le full debug logging. [ERROR] [ERROR]                                                                                        For more information on the errors and possible solutions, please REA D the following articles:[error] [Help 1] http://cwiki.apache.org/conflUence/display/maven/mojoexecutione  Xception[error][error] After correcting the problems, you can resume the build with the Command[error] mvn <goals> -rf:oozie-docserror, Oozie distro creation failed

  

Helpless, and then went to compile, 3.3.2, the result:

It's not going to compile anymore.

Here, I go to the net to find the reason:

Told me it was: MAVEN's storage address, so I changed the storage configuration:

Oozie root directory,

Pom.xml, modify the storage in the:<repositories></repositories> as follows:

<repositories> <repository> <id>cloudera com</id> <url>https://repos Itory.cloudera.com/content/repositories/releases/</url> <snapshots> <enabled>f alse</enabled> </snapshots> </repository> <repository> <id&                Gt;central</id> <url>http://repo1.maven.org/maven2</url> <snapshots>            <enabled>false</enabled> </snapshots> </repository><repository>            <id>central maven</id> <url>http://central.maven.org/maven2/</url> <snapshots> <enabled>false</enabled> </snapshots> </repository > <repository> <id>codehaus repository</id> <url>http://repository . CODEHAUS.ORG/&LT;/URL&GT <snapshots> <enabled>false</enabled> </snapshots> </repository > <repository> <id>apache.snapshots.repo</id> <url>https://reposit            ory.apache.org/content/groups/snapshots</url> <name>apache Snapshots Repository</name> <snapshots> <enabled>true</enabled> </snapshots> </reposi tory> <repository> <id>datanucleus</id> <url>http://www.datanucleu                S.org/downloads/maven2</url> <name>Datanucleus</name> <snapshots> <enabled>false</enabled> </snapshots> </repository> </repositories&gt ;

 

When you're done, continue compiling.

[INFO] Apache Oozie Docs ...... ..... ................ FAILURE [9.477s]

Ping address seems to be possible,

Really do not know why, try a few times not, estimated where needs to change, I have no way, the waiting to solve

2. Change the program, with other compiled good,

So it's Cloudera company.

http://archive.cloudera.com/cdh5/cdh/5/

This: http://archive.cloudera.com/cdh5/cdh/5/oozie-4.1.0-cdh5.8.2.tar.gz

, unzip after download: TAR-ZXVF oozie-4.1.0-cdh5.8.2.tar.gz

This is a hadoop2.6 of support.

Then I'll replace him with my Hadoop version hadoop2.5.2

The Concrete Way is: (thank him)

This reference this address: http://www.mamicode.com/info-detail-490284.html

1. Unzip

CP Oozie-4.1.0-distro.tar.gz/home/hadoop

Cd/home/hadoop

Tar xvzf oozie-4.1.0-distro.tar.gz

/home/hadoop/oozie-4.1.0 is the root directory of the Oozie

2. Setting Environment variables

Vi/etc/profile

Export Oozie_home=/home/hadoop/oozie-4.1.0export path= $PATH: $OOZIE _home/bin

Since then, in the source/etc/profile brought him into force

3. Introduction of the JAR package

Create a Libext folder under Oozie_home

mkdir Libext

Copy all the jar packages for Hadoop into this directory

CP $HADOOP _home/share/hadoop/*/hadoop-*.jar./libext/

CP $HADOOP _home/share/hadoop/*/lib/*.jar./libext/

CP Mysql-connector-java-5.1.29-bin.jar./libext/

Delete Libext in Jasper*.jar, Servlet-api.jar, Jsp-api.jar, and oozie-4.0.1/oozie-server/lib/under the jar package conflict, the war packet will be reported:

Org.eclipse.jdt.internal.compiler.CompilationResult.getProblems () [Lorg/eclipse/jdt/core/compiler/iproblem

4. Create a War package

Bin/oozie-setup.sh Prepare-war

Will generate @oozie_home/oozie-server/webapps/oozie.war

After extracting the Ext-2.2.zip, generate the ext-2.2 folder and enter the folder into Oozie.war. His approach is to start the service after the Oozie.war package will be decompressed to Oozie, so that after the ext-2.2 directly dragged in.

(My practice is, the above Oozie.war, download to the desktop, with the decompression tool to open, and then drag Ext-2.2.zip to Oozie.war, and later found that, in fact, it is not necessary, I opened it)

Note: 1. See the following command on the web to generate Oozie.war, and have ext-2.2.zip into the war package

./addtowar.sh-inputwar $OOZIE _home/oozie.war-outputwar $OOZIE _home/oozie-server/webapps/oozie.war-hadoop 2.3.0 $ Hadoop_home-extjs/home/oozie/ext-2.2.zip

2. You have the zip and unzip command, otherwise it will be packet error, go to the root user, with yum-y install unzip and yum-y installed Zip installation can be

5. Modify the configuration

VI $OOZIE _home/conf/oozie-site.xml

<property>   <name>oozie.service.JPAService.jdbc.driver</name>   <value> com.mysql.jdbc.driver</value>    <description>        jdbc Driver class.    </description></property><property>   <name>oozie.service.jpaservice.jdbc.url</ name>   <value>jdbc:mysql://mysql-server:3306/oozie</value>    <description>        JDBC URL.    </description></property><property>   <name>oozie.service.jpaservice.jdbc.username </name>    <value>root</value>    <description>        DB user name.    </description></property><property>   <name>oozie.service.jpaservice.jdbc.password </name>    <value>mapengbo</value>    <description>        DB user password.    </description></property>

  

6. Create a database

Create a database named Oozie and assign the right

CREATE DATABASE Oozie;

Grant all on oozie.* to ' shirdrn ' @ ' oozie-server ' identified by ' 0o21e ';

FLUSH privileges;

Generate the required database tables and execute the

bin/ooziedb.sh Create-sqlfile Oozie.sql–run

View database Oozie generated a related table for Oozie.

7. Start the service

bin/oozied.sh start

Accessing the console Http://hadoop1:11000/oozie HADOOP1 for my host name

Four Configure Jobhistory and users for Hadoop

Modify $hadoop_home/etc/hadoop/mapred-site.xml

and $oozie_home/conf/hadoop-conf/core-site.xml Add the following configuration.

<property>       <name>mapreduce.jobhistory.address</name>        <value>node3:10020</ value>     </property>    <property>       <name>mapreduce.jobhistory.webapp.address</ name>        <value>node3:19888</value>     </property>     <property>       <name >mapreduce.jobhistory.intermediate-done-dir</name>       <value>${hadoop.tmp.dir}/mr/history-tmp </value>     </property>    <property>        <name>mapreduce.jobhistory.done-dir< /name>       <value>${hadoop.tmp.dir}/mr/history-done</value></property>

  

You need to add the following to the Hadoop core-site.xml:

<property>                <name>hadoop.proxyuser.root.hosts</name>                <value>*</value>         </property>         <property>                <name>hadoop.proxyuser.root.groups</name>                < Value>*</value>         </property>

  

Root for Hadoop user, hadoop.proxyuser.root.groups property Configure user-owned group name, configure complete restart Hadoop

You can also write this: Hadoop.proxyuser. [User].hosts and Hadoop.proxyuser. [User].groups

 

Start the Hadoop history jobhistory Service

$HADOOP _home/sbin/mr-jobhistory-daemon.shstart Historyserver//This is a reboot of the HADOOP cluster

Restart Oozie

bin/oozied.sh start

Five Client Test

TAR–ZXVF oozie-client-4.1.0.tar.gz//This I used to be compiled in the oozie4.2.0 compiled well, found that I downloaded the Cloudera there is no this

Address: Link: http://pan.baidu.com/s/1eSBOdEi Password: Q1NW

TAR–ZXVF oozie-examples.tar.gz

TAR–ZXVF oozie-sharelib-4.1.0.tar.gz

HDSF Dfs-put Examples hdfs:/myserver/user/hadoop/

HDSF dfs-put share/user/hadoop/--//This later found no, need to configure the local directory path in Oozie-site.xml,

Configure Oozie.service.WorkflowAppService.system.libpath

A. Modify the $oozie_home/conf/oozie-site.xml file and add the following:

<property><name>oozie.service.workflowappservice.system.libpath</name><value>file:/// Home/${user.name}/oozie-4.1.0-cdh5.8.2/share/lib</value></property>

B. Modify the $oozie_home/conf/hadoop-conf/core-site.xml file and add the following:

<property>   <name>yarn.resourcemanager.address</name>      <value>node1:8032</ Value> (should be the same as the configuration of Hadoop,                            This I am in http://Your mapreduce hostname £ 8088/conf, and change it to the same) </property><property>     <name>yarn.resourcemanager.scheduler.address</name>      <value>node1:8030</value> </property>

  

C. Modify the Oozie.service.HadoopAccessorService.hadoop.configurations property to adjust its value to *=hadoop_home/etc/hadoop

---//This I do not have how to configure, you can look at this person configured http://heylinux.com/archives/2836.html

D. Modify $oozie_home/examples/apps/map-reduce/ Job.properties (There is no jobtracker in yarn, the following jobtracker fills in the yarn.resourcemanager.address value, Oozie.wf.application.path is the Oozie sample program in HDFs Path

    namenode=hdfs://node1:9000    jobtracker=node1:8032    queuename=default    examplesroot=examples    Oozie.wf.application.path=${namenode}/user/${user.name}/${examplesroot}/apps/map-reduce    outputDir= Map-reduce

  

Invoking the OOZIE script in $oozie_home/oozie-client-4.0.1/bin, executing the workflow

./oozie Job-oozie Http://node3:11000/oozie-config $OOZIE _home/examples/apps/map-reduce/job.properties-run

Access console Http://hadoop1:11000/oozie

Completed!

Configure oozie4.10+hadoop2.5.2

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.