The previous introduction of how to use thrift source production data, today describes how to use Kafka sink consumption data.In fact, in the Flume configuration file has been set up with Kafka sink consumption dataAgent1.sinks.kafkaSink.type =Org.apache.flume.sink.kafka.KafkaSinkagent1.sinks.kafkaSink.topic=TRAFFIC_LOGagent1.sinks.kafkaSink.brokerList=10.208.129.3:9092,10.208.129.4:9092,10.208.129.5:9092ag
ERROR Log event analysis in kafka broker: kafka. common. NotAssignedReplicaException,
The most critical piece of log information in this error log is as follows, and most similar error content is omitted in the middle.
[2017-12-27 18:26:09,267] ERROR [KafkaApi-2] Error when handling request Name: FetchRequest; Version: 2; CorrelationId: 44771537; ClientId: ReplicaFetcherThread-2-2; ReplicaId: 4; MaxWait: 50
1. OverviewIn the "Kafka combat-flume to Kafka" in the article to share the Kafka of the data source production, today for everyone to introduce how to real-time consumption Kafka data. This uses the real-time computed model--storm. Here are the main things to share today, as shown below:
Data consumption
First attach the Kafka operation log profile: Log4j.propertiesSet the log according to the appropriate requirements.#日志级别覆盖规则 Priority: All off#1The . Sub-log Log4j.logger overwrites the primary log Log4j.rootlogger, where the log output level is set, threshold sets the Appender log receive level;2. Log4j.logger level below Threshold,appender receive level depends on threshold level;3the Log4j.logger level above the Threshold,appender receive level de
To start the Kafka service:
bin/kafka-server-start.sh Config/server.properties
To stop the Kafka service:
bin/kafka-server-stop.sh
Create topic:
bin/kafka-topics.sh--create--zookeeper hadoop002.local:2181,hadoop001.local:2181,hadoop003.local:2181-- Replication-facto
I. Kafka INTRODUCTION
Kafka is a distributed publish-Subscribe messaging System . Originally developed by LinkedIn, it was written in the Scala language and later became part of the Apache project. Kafka is a distributed, partitioned, multi-subscriber, redundant backup of the persistent log service . It is mainly used for the processing of active streaming data
I. OverviewThe spring integration Kafka is based on the Apache Kafka and spring integration to integrate KAFKA, which facilitates development configuration.Second, the configuration1, Spring-kafka-consumer.xml 2, Spring-kafka-producer.xml 3, Send Message interface Kafkaserv
Kafka-Storm integrated deploymentPreface
The main component of Distributed Real-time computing is Apache Storm Based on stream computing. The data source of real-time computing comes from Kafka in the basic data input component, how to pass the message data of Kafka to Storm is discussed in this article.0. Prepare materials
Normal and stable
Learning questions: Does 1.kafka need zookeeper?What is 2.kafka?What concepts does 3.kafka contain?4. How do I simulate a client sending and receiving a message preliminary test? (Kafka installation steps)5.kafka cluster How to interact with zookeeper? 1.
Flume and Kakfa example (KAKFA as Flume sink output to Kafka topic)To prepare the work:$sudo mkdir-p/flume/web_spooldir$sudo chmod a+w-r/flumeTo edit a flume configuration file:$ cat/home/tester/flafka/spooldir_kafka.conf# Name The components in this agentAgent1.sources = WeblogsrcAgent1.sinks = Kafka-sinkAgent1.channels = Memchannel# Configure The sourceAgent1.sources.weblogsrc.type = SpooldirAgent1.source
Background:In the era of big data, we are faced with several challenges, such as business, social, search, browsing and other information factories, which are constantly producing various kinds of information in today's society:
How to collect these huge information
how to analyze how it is
done in time as above two points
The above challenges form a business demand model, which is the information of producer production (produce), consumer consumption (consume) (processing analysis), an
I. Kafka INTRODUCTIONKafka is a distributed publish-subscribe messaging system. Originally developed by LinkedIn, it was written in the Scala language and later became part of the Apache project. Kafka is a distributed, partitioned, multi-subscriber, redundant backup of the persistent log service. It is mainly used for the processing of active streaming data (real-time computing).In big Data system, often e
1. Background information
Many of the company's platforms generate a large number of logs (typically streaming data, such as the PV of search engines, queries, etc.), which require a specific log system, which in general requires the following characteristics:
(1) Construct the bridge of application system and analysis system, and decouple the correlation between them;
(2) support the near real-time on-line analysis system and the off-line analysis system similar to Hadoop;
(3) with high scalabi
In the previous blog, how to send each record as a message to the Kafka message queue in the project storm. Here's how to consume messages from the Kafka queue in storm. Why the staging of data with Kafka Message Queuing between two topology file checksum preprocessing in a project still needs to be implemented.
The project directly uses the kafkaspout provided
Kafka producer production data to Kafka exception: Got error produce response with correlation ID-on topic-partition ... Error:network_exception1. Description of the problem2017-09-13 15:11:30.656 o.a.k.c.p.i.Sender [WARN] Got error produce response with correlation id 25 on topic-partition test2-rtb-camp-pc-hz-5, retrying (299 attempts left). Error: NETWORK_EXCEPTION2017-09-13 15:11:30.656 o.a.k.c.p.i.Send
Kafka ~ Validity Period of consumption, Kafka ~ Consumption Validity Period
Message expiration time
When we use Kafka to store messages, if we have consumed them, permanent storage is a waste of resources. All, kafka provides us with an expiration Policy for message files, you can configure the server. properies# Vi
KafkaSource Compilation reading environment constructionDevelopment Environment: Oracle Java 1.7.0_25 + idea + Scala 2.10.5 +gradle 2.1 + Kafka 0.9.0.1First,GradleInstallation Configuration Kafka code from 0.8.x Gradle to compile and build, you first need to install gradle gradle integrates and absorbs the maven > The main advantages are also overcome
Questions Guide
1. How to create/delete topic.
What processes are included in the 2.Broker response request.
How the 3.LeaderAndIsrRequest responds.
This article forwards the original link http://www.jasongj.com/2015/06/08/KafkaColumn3
In this paper, based on the previous article, the HA mechanism of Kafka is explained in detail, and the various HA related scenarios such as broker Failover,controller Failover,topic creation/deletion, broker initiati
Kafka's consumption model is divided into two types:1. Partitioned consumption model2. Group Consumption modelA. Partitioned consumption modelSecond, the group consumption modelProducer: PackageCn.outofmemory.kafka;Importjava.util.Properties;ImportKafka.javaapi.producer.Producer;ImportKafka.producer.KeyedMessage;ImportKafka.producer.ProducerConfig;/*** Hello world! **/ Public classKafkaproducer {Private FinalProducerproducer; Public Final StaticString TOPIC = "Test-topic"; PrivateKafkaproducer
The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion;
products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the
content of the page makes you feel confusing, please write us an email, we will handle the problem
within 5 days after receiving your email.
If you find any instances of plagiarism from the community, please send an email to:
info-contact@alibabacloud.com
and provide relevant evidence. A staff member will contact you within 5 working days.