With the development of http://www.aliyun.com/zixun/aggregation/5739.html > Internet Technology, today's network is generating a huge amount of information every day, including semi-structured and unstructured data. Organizations can find out what their customers really need and why they need it through an analysis of massive amounts of information. Now Apache Hadoop has become the driving force behind the development of the big data industry.
Facebook engineers believe they run the largest data collection platform based on Hadoop. Jay Parikh, vice president of infrastructure engineering at Facebook, says most of Facebook's web site data is stored in a single cluster, and 100pb,facebook clusters are unique compared to other companies ' clusters.
The Facebook product team measures products by scanning 105TB of data every 30 minutes, while Facebook manages millions of photos and billions of like button traffic logs to recommend content to users based on their preferences.
The following is the daily data traffic for Facebook
2.7 billion like button traffic 300 million photos uploaded to Facebook 70000 query Execution (manual or automated) more than 500TB of data growth
Original link: CNET (li/compiling Zhang Zhiping/revisers)
(Responsible editor: The good of the Legacy)