One: Flume Introduction and function
II: Flume installation and configuration and simple testing
A: Flume introduction and Functional Architecture 1.1 Flume Introduction: 1.1.1 Flume是Cloudera提供的一个高可用的,高可靠的,分布式的海量日志采集、聚合和传输的系统,
Flume is a distributed, reliable, and highly available system for collecting, aggregating, and transmitting large volumes of logs. Support for customizing various data senders in the log system for data collection, while Flume provides the ability to simply process the data and write to various data recipients (such as text, HDFS, hbase, etc.).First, what is Flume
Questions Guide:1.flume-ng and Scribe, where is the advantage of Flume-ng?2. What issues should be considered in architecture design considerations?3.Agent How can I fix it?Does 4.Collector panic have an impact?What are the measures for 5.flume-ng reliability (reliability)?The U.S. mission's log collection system is responsible for the collection of all business
Scenario 1. What is Flume 1.1 backgroundFlume, as a real-time log collection system developed by Cloudera, has been recognized and widely used by the industry. The initial release version of Flume is now collectively known as Flume OG (original Generation), which belongs to Cloudera. But with the expansion of the FLume
[TOC]Non-clustered configurationThis situation is not cluster configuration, relatively simple, you can directly refer to my collation of the "Flume notes", the basic structure of the following:Flume multiple agents of a cluster a source structure descriptionThe structure diagram is as follows:The description is as follows:即可以把我们的Agent部署在不同的节点上,上面是两个Agent的情况。其中Agent foo可以部署在日志产生的节点上,比如,可以是我们web服务器例如tomcat或者nginx的节点上,foo的source可以配置为监控日志文件数据的变化,channel则
Copyright NOTICE: This article is Yunshuxueyuan original article.If you want to reprint please indicate the source: http://www.cnblogs.com/sxt-zkys/QQ Technology Group: 299142667
the concept of flume1. As a real-time log collection system developed by Flume, Cloudera has been recognized and widely used by the industry. The initial release version of Flume is now collectively known as
* Flume Framework FoundationIntroduction to the framework:* * Flume provides a distributed, reliable, and efficient collection, aggregation, and mobile service for large data volumes, flume can only be run in a UNIX environment.* * Flume is based on streaming architecture, fault-tolerant, and flexible and simple, mainl
START: Flume is a high-availability, highly reliable, open-source, distributed, high-volume log collection system provided by Cloudera, where log data can flow through flume to storage terminal destinations. The log here is a general term, refers to the file, Operation Records and many other data.First, flume basic Theory 1.1 Common distributed log Collection sys
1.flume conceptFlume is a distributed, reliable, highly available system for efficient collection, aggregation, and movement of large amounts of log data from different sources, and centralized data storage.Flume is currently a top-level project for Apache.Flume need Java running environment, require java1.6 above, recommended java1.7.Unzip the downloaded Flume installation package to the specified director
START: Flume is a high-availability, highly reliable, open-source, distributed, high-volume log collection system provided by Cloudera, where log data can flow through flume to storage terminal destinations. The log here is a general term, refers to the file, Operation Records and many other data.First, flume basic Theory 1.1 Common distributed log Collection sys
The previous introduction of how to use thrift source production data, today describes how to use Kafka sink consumption data.In fact, in the Flume configuration file has been set up with Kafka sink consumption dataAgent1.sinks.kafkaSink.type =Org.apache.flume.sink.kafka.KafkaSinkagent1.sinks.kafkaSink.topic=TRAFFIC_LOGagent1.sinks.kafkaSink.brokerList=10.208.129.3:9092,10.208.129.4:9092,10.208.129.5:9092agent1.sinks.kafkaSink.metadata.broker.list=10.
Original link: Kafka combat-flume to KAFKA1. OverviewIn front of you to introduce the entire Kafka project development process, today to share Kafka how to get the data source, that is, Kafka production data. Here are the directories to share today:
Data sources
Flume to Kafka
Data source Loading
Preview
Let's start today's shared content.2. Data sourcesThe data produced by Kafka i
Flume integrated Kafka:flume capture business log, sent to Kafka installation deployment KafkaDownload1.0.0 is the latest release. The current stable version was 1.0.0.You can verify your download by following these procedures and using these keys.1.0.0
Released November 1, 2017
Source download:kafka-1.0.0-src.tgz (ASC, SHA512)
Binary Downloads:
Scala 2.11-kafka_2.11-1.0.0.tgz (ASC, SHA512)
Scala 2.12-kafka_2
This article describes the initial process of using flume to transfer data to MongoDB, covering environment deployment and considerations.1 Environment Constructionrequires JDK, flume-ng, MongoDB java driver, Flume-ng-mongodb-sink(1) jdk:http://www.oracle.com/technetwork/java/javase/downloads/jdk8-downloads-2133151.html(2) flune-ng:http://www.apache.org/dyn/close
Flume custom hbasesink class
Reference (to the original author) http://ydt619.blog.51cto.com/316163/1230586Https://blogs.apache.org/flume/entry/streaming_data_into_apache_hbaseSample configuration file of flume 1.5
# Name the components on this agenta1.sources = r1a1. sinks = k1a1. channels = c1 # Describe/configure the sourcea1.sources. r1.type = spooldira1.sour
Flume Simple Introduction
When you see this article, you should have a general understanding of the flume but to take care of the students just getting started, so still will say Flume, just start using flume do not need to understand too much inside things, only need to understand the following map can use the
1) Introduction
Flume is a distributed, reliable, and highly available system for aggregating massive logs. It supports customization of various data senders in the system for data collection. Flume also provides simple data processing, and write the capabilities of various data receivers (customizable.
Design goals:(1) ReliabilityWhen a node fails, logs can be transferred to other nodes without being lost.
The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion;
products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the
content of the page makes you feel confusing, please write us an email, we will handle the problem
within 5 days after receiving your email.
If you find any instances of plagiarism from the community, please send an email to:
info-contact@alibabacloud.com
and provide relevant evidence. A staff member will contact you within 5 working days.