Read-only disc memory CD-ROM is the normal CD that we often use. CD-ROM discs can be used to cross storage of large volumes of text, sound, graphics and images and other media digital information for quick retrieval, so the CD-ROM drive has become a standard configuration of multimedia computers.
Flume: Used to collect logs and transfer logs to KAKFAKafka: As a cache, store logs from FlumeES: As a storage medium, store logsLogstash: True filtering of logsFlume deploymentGet the installation package, unzip1 wget http://10.80.7.177/install_package/apache-flume-1.7.0-bin.tar.gz tar ZXF apache-flume-1.7.0-bin.tar.gz-c/usr/local/Modify the flumen-env.sh scri
1. Introduction to Flume
Flume is a distributed, reliable, andHigh AvailabilityA system that aggregates massive logs. It supports customization of various data senders to collect data. Flume also provides simple processing of data and writes it to various data receivers (customizable) capabilities.Design goals:
(1) Reliability
When a node fails, logs can be trans
ObjectiveFirst look at the definition of event in Flume official websiteA line of text content is deserialized into an event "serialization is the process of converting an object's state into a format that can be persisted or transmitted. Relative to serialization is deserialization, which transforms a stream into an object. These two processes combine to make it easy to store and transfer data ", the maximum definition of event is 2048 bytes, exceedi
From Bin/flume this shell script can see Flume starting from the Org.apache.flume.node.Application class, which is where the main function of Flume is.The Main method first parses the shell command, assuming that the specified configuration file does not exist and then dumps the exception.According to the command contains the "no-reload-conf" parameters, decided
Kafkasink.jar, copy it to the Flume/lib directory on the node where Flume resides, and then you need to Kafka_2.10-0.8.2.0.jar, Kafka-clients-0.8.2.0.jar, Metrics-core-2.2.0.jar, Scala-library-2.10.4.jar These 4 jar packages are copied to the Flume/lib directory on the node where Flume resides.3. Start the agent of
Flume load Balancing is the choice of a certain algorithm per sink output to the specified place, if the file output is very large, load balancing is still necessary, through the output of multiple channels to alleviate the output pressureFlume built-in load balancing algorithm by default is round robin, polling algorithm, ordered selectionHere's a look at the specific examples:# Name The components in this agenta1.sources = R1a1.sinks = K1 k2a1.chann
IconAs shown in the Red box section, I do stability testing, when the flume run a few days later, I found that the counter value gradually become larger, to a certain value, and then become smaller, there is a cycle of the process, and therefore the desire to produce research, the following to see:if (Txneventcount = = 0) { sinkcounter.incrementbatchemptycount (); } else if (Txneventcount = = batchsize) { Sinkcounter.incrementbatchc
internal selection of a valid sink for processingThe exception section, we found that triggered the informsinkfailed () method, let's take a look at the methodpublic void Informfailure (T failedobject) {//if There are no Backoff this method is a no-op. if (!shouldbackoff) {return; } failurestate state = Statemap.get (Failedobject); Long now = System.currenttimemillis (); Long delta = now-state.lastfail; /* * When do we increase the Backoff period? * We Basically calculate the ti
1.installationJdkrefer to the installation of the JDK here. 2.installationFlume2.1. DownloadFlume:http://flume.apache.org/download.html650) this.width=650; "Src=" https://s5.51cto.com/oss/201710/25/ Da9277a9d433278d21a9ccdef349d90a.png-wh_500x0-wm_3-wmp_4-s_3707767358.png "title=" 1.png "alt=" Da9277a9d433278d21a9ccdef349d90a.png-wh_ "/>Click the link: apache-flume-1.7.0-bin.tar.gz download. 2.2. Unpacking the installation package$ tar zxvf apache-
Multiplexing technology is intended to send an event to a specific channel based on configuration information.A source instance can specify multiple channels, but a sink instance can only specify one channel.Flume supports fanning out the flow from one source to multiple channels. There is modes of fan out, replicating and multiplexingFlume supports two modes of output from source to multiple channel: copy and Reuse1. In copy mode, the event data received by source is output to all channel confi
Basic concepts of flume, data stream model, and flume data stream1. Basic concepts of flume
AllFlumeAll related terms are in italic English. The meanings of these terms are as follows.
FlumeA reliable and distributed system for collecting, aggregating, and transmitting massive log data.
Web ServerOne generationEvents.
Agent flumeA node in the system contains thre
I. Introduction of FlumeFlume is a distributed, reliable, and highly available mass-log aggregation system that enables the customization of various data senders in the system for data collection, while Flume provides the ability to simply process the data and write to various data-receiving parties (customizable).Design objectives:(1) ReliabilityWhen a node fails, the log can be transmitted to other nodes without loss.
good performance where multiple disks is not available for checkpoint and data Directori Es.It is natural that the channel data is synchronized to disk and performance degrades, but the checkpoint mechanism is added to prevent data loss.For the deformed memory channel, which is the memory channel and the file channel used together, we do not explain here, because this mixed use, the official also give hints-not recommended in the production environment to use.The reason for this is that data lo
a certain range, it will flushprivate void Flusheventbatch (listFlush is the event in the EventList that is now being saved and emptied1. Put the event into the configured channelFor (event event:events) { listHere is the detailed procedure for putting the event into the channel, but here you notice that there are two selector getchannel methods, because there are two types of channel selector modes: Multiplexing and Replication if (restart) { logger.info ("Restarting in {}ms, ex
Install flume
1, to the official website download flume, download address: http://flume.apache.org/download.html
2, [root@bicloud77 home]# tar zxvf apache-flume-1.5.2-bin.tar.gz
3, [root@bicloud77 home]# CD Apache-flume-1.5.2-bin
4,[root@bicloud76 apache-
2017-09-06 Zhu Big Data and cloud computing technologies Any production system will produce a large number of logs during operation, and the log often hides a lot of valuable information. These logs are stored for a period of time and are cleaned up before the method is parsed. With the development of technology and the improvement of analytical ability, the value of log is re-valued. Before you analyze these logs, you need to collect the logs that are scattered across production systems. Thi
Document Location:Http://flume.apache.org/FlumeUserGuide.html#system-requirements
Java Runtime Environment-java 1.8 or later (Java version must be 1.8 or higher)
Memory-sufficient memory for configurations used by sources, channels or sinks (to have enough RAM for channel and source use)
Disk Space-sufficient disk Space for configurations used by channels or sinks (requires enough memory if channel is file type)
Directory permissions-read/write Permissions for directories us
1, download the latest flume on the official website of Flumewget http://124.205.69.169/files/A1540000011ED5DB/mirror.bit.edu.cn/apache/flume/1.6.0/ apache-flume-1.6.0-bin.tar.gz 2. Solve Flume installation packagecd/export/software/TAR-ZXVF apache-flume-1.6.0-bin.tar.gz-c/e
The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion;
products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the
content of the page makes you feel confusing, please write us an email, we will handle the problem
within 5 days after receiving your email.
If you find any instances of plagiarism from the community, please send an email to:
info-contact@alibabacloud.com
and provide relevant evidence. A staff member will contact you within 5 working days.