Project development mainly targets entrepreneurial applications, so the data volume is not big data. But recently, the use of big data in the Internet industry, as a programmer, should we learn new technologies? However, he only learns to learn from the big data processing ideas, implementation processes, and specific scenarios, so that he can learn from some ideas in our project development and improve the development level.
==============================================
Hadoop ecosystem:
CPU: hadoop's mad-Reduce ------------------------ model mechanism of Parallel Programming
Memory: hbase-column-based nosql database ---------------- key-value fast query Mechanism Based on hashtable
Storage: hadfs Distributed File System -------------------------- stores a large amount of data
======================================
Habse: nosql Database
Hive: nosql Data Warehouse
Hadfs: Distributed File System
Sqoop: ETL Tool (Data Extraction-ETL process)
Zookeeper: Time Synchronization tool ------ note that coordination between networks is essentially a time service.
======================================
If you are learning, this is a technology you should learn.
Part from: http://blog.csdn.net/woshiwanxin102213/article/details/19688393
Learn more about Big Data