"BDTC lecturer" the father of Hadoop Doug Cutting:lucene to the open source of Hadoop

Source: Internet
Author: User
Keywords Large data hadoop bdtc bdtc2014 dougcutting

Doug cutting, with his passion for work and down-to-earth attitude, pioneered Lucene and Nutch's two successful Open-source search engine projects, as well as the founder of Hadoop, the current popular data computing framework. Doug graduated from Stanford University in 1985 and the first internship at Xerox laid the groundwork for his future research on search engine projects and success. At the end of 1997, Doug made a great breakthrough in theory to practice through Lucene, the first open source function library to provide Full-text search. On this basis, Doug has implemented Nutch, Hadoop. For the realization of his dream, Doug worked in Architext and Yahoo!, until 2009 as Cloudera's chief architect.


Lucene&nutch

Lucene is the first function library to provide Full-text text search, providing a simple and powerful application interface, and a high-performance, scalable information Search library. As a mature and free open source project, Lucene has been widely welcomed in the Java Information Retrieval Program Library. Developers can not only use it to build specific full-text search applications, but also integrate them into various system software, it provides a lot of API functions can be applied to a variety of practical applications.

Nutch is Doug on the basis of lucene to continue to deepen the idea of open source is a real application, it is built on the Lucene core of the implementation of Web search, the purpose is to reduce the complexity of the use of the process, And in the cost of a few cases to configure a world-class web search engine, to achieve out-of-the-box features. Site indexing and search are extended to search the global web, just like Google and Yahoo.

Hadoop

Hadoop is Doug based on Google Mapreducesystem developed an open source version, is an open source for large data distributed storage and processing platform, is a new era of application development of the necessary skills. Hadoop is a distributed platform that allows users to easily structure and use the following advantages:

high reliability, high scalability, high efficiency, low cost

Hadoop has performed exceptionally well from the moment it was first applied, greatly improving the speed of web search. Doug's goal is to develop Hadoop as a redhat in the cloud computing world. Looking at the focus of the current computing framework, Hadoop's success has been completely detached from Doug's imagination.

Doug is a legendary figure in cloud computing and big data, and has magically turned the search technology into a product. However, his secret of success is not mysterious--the passion for work and the earnest sureness of work. However, it was this well-known quality that made him successful, and almost all of them used his work directly or indirectly.

In December 2014 12-14th, the 2014 China Large Data Technology Congress (and the second CCF conference on large data) is inviting the father of Hadoop, Doug Cutting, to have the opportunity to take you through the best practices of Cloudera, a well-known Hadoop company.

More lecturers and schedule information please pay attention to the 2014 China Data Technology Conference (and the second CCF large data academic conference) official website. Another, now buy BDTC tickets to enjoy a maximum of 1500 yuan discount, the event until October 17. Advance Advance

Free Subscription "CSDN cloud Computing (left) and csdn large data (right)" micro-letter public number, real-time grasp of first-hand cloud news, to understand the latest big data progress!

CSDN publishes related cloud computing information, such as virtualization, Docker, OpenStack, Cloudstack, and data centers, sharing Hadoop, Spark, Nosql/newsql, HBase, Impala, memory calculations, stream computing, Machine learning and intelligent algorithms and other related large data views, providing cloud computing and large data technology, platform, practice and industry information services.

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.