Big Data Learning Articles

Source: Internet
Author: User
Tags solr redis cluster

ZooKeeper:

Zookeeper analysis:http://www.cnblogs.com/sharpxiajun/archive/2013/06/02/3113923.html

Hdfs:

The working process of the MapReduce program: http://www.aboutyun.com/thread-15494-1-2.html

HDFs small File Processing solution Summary:http://www.aboutyun.com/thread-14227-1-1.html

A summary of Hadoop learning: HDFs Introduction:http://www.cnblogs.com/forfuture1978/archive/2010/03/14/1685351.html

Mapreduce


Hadoop notes-Why Map-reduce v2 (Yarn): http://www.cnblogs.com/LeftNotEasy/archive/2012/02/18/why-yarn.html

Yarn:architecture of Next Generation Apache Hadoop mapreduceframework:http://blog.csdn.net/colorant/article/details/ 9146201

The working process of the MapReduce program: http://www.aboutyun.com/thread-15494-1-2.html

Hadoop Core Architecture hdfs+mapreduce+hbase+hive internal mechanism:http://blog.csdn.net/yczws1/article/details/19178265.

Hadoop Learning wordcount+block+split+shuffle+map+reduce Technical details:http://blog.csdn.net/yczws1/article/details/21899007

MapReduce scheduling and execution Principles series articles

First, the MapReduce scheduling and execution principle of the work submitted

Second, the MapReduce scheduling and execution principle of job initialization

Third, the task scheduling of MapReduce dispatching and executing principle

Iv. task scheduling of the MapReduce scheduling and execution Principle (cont.)

Jobtracker Job START Process Analysis:http://blog.csdn.net/androidlushangderen/article/details/41356521

Hadoop Cluster Job scheduling algorithm

analysis of data skew in Hadoop: http://my.oschina.net/leejun2005/blog/100922

Hadoop source parsing: How Textinputformat handles Cross-split rows: http://blog.csdn.net/bluishglc/article/details/9380087

Hive

Hive Basic operation:http://www.aboutyun.com/thread-6867-1-1.html

Hive components and execution process: http://blog.csdn.net/lifuxiangcaohui/article/details/40262021

Technology in the Big Data Era hive Introduction:http://www.cnblogs.com/sharpxiajun/archive/2013/06/02/3114180.html

Hive Architecture:http://blog.csdn.net/lifuxiangcaohui/article/details/40615843

The hive of SQL performance optimization in the Data Warehouse

Implementation principles of HIVE Group by, join, DISTINCT, etc.

HBase:
Hadoop Core Schema HBase:http://blog.csdn.net/yczws1/article/details/19178265

Introduction to HBase System--Overview

HBase writes data, saves data, and reads data in a detailed process

Hbase Rowkey Design A

Rowkey design of HBase (with examples)

LSM tree origin, design ideas, and indexes applied to HBase

Application of HBase in Sohu Content recommendation engine system

HBase Modeling

Comparison of HBase and Oracle

About Hfile's storage structure grooming and quick positioning Rowkey

HBase Two-level indexing scheme summary

Solr:

A summary of Lucene learning: The Fundamentals of full-text retrieval

SOLR Study and summary (offline 1)

SOLR Study 2

SOLR use

HBase Multi-Conditional query testing based on SOLR

ElasticSearch:

Elasticsearch Study 1

ES Learning 2

Elasticsearch Using code

Modify ES shard rules

Redis:

NoSQL and Redis

Redis cluster configuration

15-Day play with Redis (Mark,redis Learning series)

Kafka:

Quick understanding of Kafka distributed Message Queue framework

Thinking of Kafka reliability

Introduction to the Kafka

Kafka Depth Analysis

Flume-kafka-storm Log Processing Experience

"Acquisition Layer" Kafka and Flume how to choose

Flume1.5.0 Getting Started: Cases of installation, deployment, and Flume

Storm:

Storm Quick Understanding

Learn the design of streaming real-time distributed computing from Storm and spark

The difference between a distributed and a cluster

Big Data Learning Articles

Related Article

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.