Flume installation and configuration, and flume installation ConfigurationFlumeInstallation and configuration
0. Follow jdk.
Download the jdk-1.8.0 and apache-flume Binary packagesSet the software path as follows:Jdk: // usr/local/jdk-1.8.0Flume:/opt/apache-
Apache Flume is a distributed, reliable, and efficient system that collects, aggregates, and moves data from disparate sources to a centralized data storage center. Apache Flume is not just used in log collection. Because data sources can be customized,flume can use the transfer of a large number of custom event data, including but not limited to website traffic
Flume ng Overview:Flume Ng is a distributed, highly available, reliable system that collects, moves, and stores disparate amounts of data into a single data storage system. Lightweight, simple to configure, suitable for a variety of log collections, and supports failover and load balancing. Where the agent contains Source,channel and Sink, three have formed an agent. The duties of the three are as follows:
Source: Used to consume (collect) th
more channel
Flume Channe: Save event know event is consumed by a Flume sink
Flume Sink: The event in the channel will be placed in an external source or sent to the Flume source of the next Flume agent.Note: Flume source
Reprint Please specify source: http://www.cnblogs.com/xiaodf/Flume as a Log collection tool, monitoring a file directory or a file, when new data is added, the acquisition of new data sent to the message queue.1 Installing the Deployment flumeTo collect local data from a data node, each node needs to have a flume tool installed to do data collection.1.1 Download and installGo to the official website to down
-env.sh, it'll be sourced# during Flume startup. # enviroment variables can is set here. Java_home=/usr/lib/jvm/java-7-Oracle # Give Flume more memory and pre-allocate, enable remote monitoring via jmx#java_opts= "-xms100m-xmx200m-dcom.sun.management.jmxremote"# Note that the Flume conf directory was always included in the classpath. #FLUME_CLASSPATH=""3) Verify
One: Flume Introduction and function
II: Flume installation and configuration and simple testing
A: Flume introduction and Functional Architecture 1.1 Flume Introduction: 1.1.1 Flume是Cloudera提供的一个高可用的,高可靠的,分布
1, Flume agent installation (using SPOOLDIR mode to obtain system, application and other log information)Note: Install with Jyapp userWhen a single virtual machine deploys multiple Java applications and needs to deploy multiple flume-agent for monitoring,The following configuration files need to be adjusted:The Spool_dir parameter in a
Label: Flume The demo is not saying. You can search by yourself.But now the internet is mainly Flume 1.4 version number of information. Flume 1.5 In a sensational big change. Assuming you're ready to try, I'm here to introduce you to the program minimization structure, and the data that uses Mongosink is stored in MongoDB. Completely independent of execution, wit
Flume's introduction is not much to say, we can search by ourselves. But the internet is mostly Flume 1.4 version or before the material, Flume 1.5 feeling change is very big, if you are ready to try, I here to introduce you to the minimization of the construction scheme, and use the Mongosink to the data into MongoDB. Completely single-machine operation, no master, no collector (plainly collector is an age
1.installationJdkrefer to the installation of the JDK here. 2.installationFlume2.1. DownloadFlume:http://flume.apache.org/download.html650) this.width=650; "Src=" https://s5.51cto.com/oss/201710/25/ Da9277a9d433278d21a9ccdef349d90a.png-wh_500x0-wm_3-wmp_4-s_3707767358.png "title=" 1.png "alt=" Da9277a9d433278d21a9ccdef349d90a.png-wh_ "/>Click the link: apache-flume-1.7.0-bin.tar.gz download. 2.2. Unpacking
Overview1-flume IntroductionSystem Requirements3-Installation and configuration4-Start and testI. Introduction to FlumeWebsite address: http://flume.apache.org/1-OverviewFlume is a distributed, reliable, and usable service for efficiently collecting, summarizing, and moving large amounts of log data. It has a simple and flexible architecture based on streaming data streams. It has a reliable mechanism of
first part single node flume configuration
Installation Reference http://flume.apache.org/FlumeUserGuide.html
http://my.oschina.net/leejun2005/blog/288136
Here is a simple introduction, the command to run the agent
$ bin/flume-ng agent-n $agent _name-c conf-f conf/flume-conf.properties.template
1. The single node co
.sources.r1.command = Tail-f/home/hadoop/flume/flume/conf/source.txt (big difference in size tail-f solves a big problem for us)Sinks Component Type is loggerA1.sinks.k1.type = LoggerChannels component type is memoryA1.channels.c1.type = Memorya1.channels.c1.capacity = 1000a1.channels.c1.transactionCapacity = 100Connect the sources, the sinks and the pipes.A1.sources.r1.channels = C1A1.sinks.k1.channel = C1
I. Installation deployment of Flume: Flume installation is very simple, only need to decompress, of course, if there is already a Hadoop environment The installation package Is: http://www-us.apache.org/dist/flume/1.7.0/apache-
Flume collects.
Flume collection system structure diagram simple structure:Single Agent collects dataComplex structureTandem between multi-level agentsFlume Installation Deployment
Upload the installation package to the node on which the data source resides
Extract
TAR-ZXVF apache-
can be understood as a data flow pipeline (there are a lot of types, the documentation given in the example is used in memory, but the memory is not stable, so my test environment also changed to a file)A simple expression of three functions (not strictly stated): Source reads the log data and writes it to the channel, sink reads the data from the channel and writes it to its designated place. If sink write fails here, the data will accumulate in the channel until Sink returns to normal (this e
The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion;
products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the
content of the page makes you feel confusing, please write us an email, we will handle the problem
within 5 days after receiving your email.
If you find any instances of plagiarism from the community, please send an email to:
info-contact@alibabacloud.com
and provide relevant evidence. A staff member will contact you within 5 working days.