The Data Revolution Speaker (the father of Hadoop Doug Cutting lectures at Tsinghua University)

Source: Internet
Author: User

2014-12-12 14:30two-way multifunctional hall of Fit building, Tsinghua Universitythe whole lecture lasted about one hours, about two and a half hours before Doug cutting a total of about 7 ppt, after half an hour of interaction.
Doug Cutting a total of about 7 Zhang Ppt,ppt there is no content, each PPT only a title, the text is a picture, the content is mainly about their own open source business, Lucene, Hadoop and so on.
PPTOne: Means for Change:hardwareWith Moore's law, it's very fast to talk about processors and store these hardware updates. This is a hardware foundation.

PPT: Fuel for Change:datahere is a logic that leads to the importance of open source. first put forward to software is eating the industry, software development, resulting in a variety of data, and the amount of data is very large, very high value, so need to have tools to process this data, This leads to the next ppt:opensource.
PPT Three: Seeds for Change:open Source
about the benefits of open source software is about to say a bit, did not speak particularly much, is generally easy to open, useful and therefore used. One of the ideas that he started his own open source business was that when he was doing lucene, he found that he was not fit to engage in businesses, so give it away~~This PPT also mentions three important component, does not hear what is three parts, probably is the entire computer industry? three are: Hardware, Data, software
PPTfour: New datastyle:hadoopthis ppt was introduced hadoop,hadoop about a bit. Many of Gfs,hadoop's ideas refer to GFs. Google published the paper, put forward its theory, everyone is very interested, but not Google's reasons, so it is not very convenient to use. At this time Hadoop came out, opensource convenient, easy to get. Has its natural affinity for the people. Doug Cutting mentioned that he went to Yahoo, because Yahoo needs to deal with a lot of data, and a lot of hardware can be used, and they are very fit.
PPT Five: Style Catches on:ecosystemThe introduction of Hive, pig, spark, etc., not too much to say.
PPT Six: Victor emerges:enterprise Data HubRoughly speaking of his work in Cloudera, introducedEnterprise Data Hub is important. Remember to say a word: I am Lucky in the "right" and "the" time. (grammar feeling a bit awkward)mentioned this is the future tool.
PPT Seven: the Data multi-toolIt's almost over, and speaking of some of the existential implications of Hadoop, an example of this is the PPT picture, which is a mobile phone. The general meaning is: mobile phone can do a lot of things, such as photography, but the function of photography than some professional camera. But one thing can be sure, we use mobile phones more time than the camera, why, because the phone has been around you, you can use, and in addition to photography, I can also share photos, in general, is already there, and convenient. Hadoop is similar, and now has a lot of computational frameworks, such as Spark, Storm. This situation does not have to deny other existence, Hadoop people will be more familiar with, and the application is very broad, when you need, you may have a Hadoop cluster environment, some calculations may be better spark performance, but Hadoop can do, easy to use. This reminds me of the operating system, not necessarily windows is the best, but everyone is accustomed to, that is enough, and then appear a new operating system, unless you make me feel that I do not want to use Windows, Windows is enough, do not have to replace it, similar truth.
Finally, the question Time, big should record a few questions:1. Security issues. Doug Cutting the approximate meaning of the answer: technical solutions +social solution.
2.relational database and NoSQLThis is not really a new problem,Doug Cutting said one point: each had its usesthe existence of 3.spark,storm, such as Spark is memory, Hadoop is now HDFs, do you want to learn from spark?Doug Cutting's approximate answer is, this is ecosystem, each component has its role, each good job can, I am happy to see Spark
Also, this is open source software, not a company controlled by another Hadoop control spark, two companies in the competition. Because it is open source, the ultimate goal is for everyone to use. 4. What is BigdataDoug Cutting answered a long string and finally sounded the point: not the Size,it's the style.
Here , Bigdata is a way of thinking, a kind of treatment of the embodiment. Can I understand how much of the data is not important and what is important is the approach to processing? 5. Cloudera and Hortonworks were asked. Doug Cutting also answered some polite words, and then said: Happy competition.

also: Ask for a book. Go a little later, you can findDoug cutting himself signed and photographed. Doug cutting people very good, very kind, in addition particularly high, about 1.8-meter feel his chin around, the pressure is too big, he signed at the time is kneeling down on the ground, see I was very touched.
The book says: Enjoy Hadoop and sign yourself. found that I wrote the article has been crawled by other sites, so leave an address: http://blog.csdn.net/picassolovecoding


The Data Revolution Speaker (the father of Hadoop Doug Cutting lectures at Tsinghua University)

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.