Google Daniel Jeffrey Dean

Source: Internet
Author: User
Tags scale image

I joined Google in mid-1999, and I's currently a Google Fellow in the Systems Group. My areas of interest include large-scale distributed systems, performance monitoring, compression techniques, information Retrieval, application of machine learning to search and other related problems, microprocessor architecture, compiler opt Imizations, and development of the new products, this organize existing information in new and interesting ways. While in Google, I ' ve worked on the following projects:

The design and implementation of the initial version of Google's advertising serving system.
The design and implementation of five generations of our crawling, indexing, and query serving systems, covering two and T Hree orders of magnitude growth in number of documents searched, number of queries handled per second, and frequency of Dates to the system. I recently gave a talk at WSDM ' about some of the issues involved in building large-scale retrieval systems (slides).
The initial development of Google ' s AdSense for Content product (involving both the production serving system design and I Mplementation as OK as work on developing and improving the quality of ad selection based on the contents of pages).
The development of Protocol buffers, a way of encoding structured data in a efficient yet extensible format, and a Compil Er that generates convenient wrappers for manipulating the objects in a variety of languages. Protocol buffers are used extensively at Google for almost all RPC protocols, and for storing structured information in a Variety of persistent storage systems. A version of the protocol buffer implementation has been open-sourced and is available at Http://code.google.com/p/protobu f/.
Some of the initial production serving system work for the "Google News product", working with Krishna Bharat to move the PR Ototype system He put together into a deployed system.
Some aspects of our search ranking algorithms, notably improved handling for dealing with off-page signals such as Anchort Ext.
The design and implementation of the the ' the ' the ' the ' Our automated job scheduling system for managing a cluster of Mac Hines.
The design and implementation of prototyping infrastructure for rapid development and experimentation with new ranking ALG Orithms.
The design and implementation of MapReduce, a system for simplifying the development of large-scale data processing applic Ations. A paper about MapReduce appeared in OSDI ' 04.
The design and implementation of BigTable, a large-scale semi-structured storage system used underneath a number of Google Products. A paper about BigTable appeared in Osdi ' (Best Paper Award).
The design and implementation of spanner, a globally-distributed database system supporting consistent operations and Dyna Mically-configurable cross-datacenter replication of data. The spanner is used as the primary storage system for Google ' s advertising system, among. Apaper about spanner appeared in OSDI ' (Best Paper Award).
The design and implementation of Distbelief, a system for large-scale distributed training of deep neural network. The system has been used for both unsupervised training and supervised training in a variety of image recognition, speech Recognition, natural language modeling and other tasks. A paper about the use of distbelief for training a large-scale image recognition model appeared in ICML ' 12. John Markoff of the New York Times wrote a article about this work in June, 2012.
Some of the production system design for Google Translate, our statistical machine translation system. In particular, I designed and implemented a system for distributed high-speed access to very large language models (too LA Rge to fit in memory on a single machine).
Some internal tools to make it easy to rapidly search our internal source code repository. Many of the ideas from-internal tool were incorporated into our Google Code Search product, including the ability to Use regular expressions for searching large corpora of source code.
I enjoy developing software with great colleagues, and I ' ve been fortunate to have worked with many wonderful and talented People on "All" I work here at Google. To help ensure which Google continues to hire people with excellent technical skills, I ' ve also been fairly involved Engineering hiring process.
I received a Ph.D. Computer Science from the University of Washington, working and Craig Chambers on Whole-program opt Imization Techniques for Object-oriented languages in 1996. I received a b.s., summa cum laude from the University of Minnesota into Computer Science & Economics in 1990. From 1996 to 1999, I worked for Digital equipment Corporation ' s Western the Lab in Palo Alto, where I worked on Low-o Verhead Profiling Tools, design of profiling hardware for Out-of-order microprocessors, and web-based information Retrieva L. From 1990 to 1991, I worked for the world Health organization ' s Global programme on AIDS, developing software to do statis Tical modelling, forecasting, and analysis of the HIV pandemic.

In 2009, I is elected to the national Academy of Engineering, and I is also named a Fellow of the Association for Comput ING machinery (ACM). In, I received the ACM Sigops Mark Weiser Award along with my long-time colleague Sanjay.

Selected Slides from talks:

MIT Big Data Lecture Series, September, 2012:living with big data:challenges and opportunities (joint talk with Sanjay G Hemawat)
Berkeley Amplab Cloud Seminar Talk, March, 2012:achieving Rapid, times in Response Online Services
Stanford Computer Science Department distinguished Computer Scientist Lecture Lecture, November, 2010:building Software S Ystems at Google and lessons learned
Symposium on Cloud Computing (SOCC) keynote, June, 2010:evolution and Future directions of large-scale Storage and comput ation Systems at Google
Web Search and Data Mining Conference (WSDM) Keynote, February, 2009:challenges in building, large-scale information Eval Systems
Google Faculty Summit Talk, July, 2008:some potential for Areas
Stanford CS295 class Lecture, Spring, 2007:software Engineering Advice from building large-scale distributed
Selected Publications:

Spanner:google ' s globally-distributed Database [PDF]
In Proceedings of OSDI, Hollywood, CA, 2012.
James C. Corbett, Jeffrey Dean, Michael Epstein, Andrew fikes, Christopher Frost, JJ Furman, Sanjay Ghemawat, Andrey Gubar EV, Christopher Heiser, Peter Hochschild, Wilson Hsieh, Sebastian Kanthak, Eugene Kogan, Hongyi Li, Alexander Lloyd, Serge Y Melnik, David Mwaura, David Nagle, Sean Quinlan, Rajesh Rao, Lindsay Rolig, Yasushi Saito, Michal Szymaniak, Christopher Taylor, Ruth Wang, and Dale Woodford
Abstract
Mapreduce:simplified Data processing on Large clusters,
Communications of the ACM, Vol. 1 (2008), pp. 107-113
Jeffrey Dean and Sanjay Ghemawat.
Large Language Models in Machine translation
In Proceedings of the 2007 Joint Conference on empirical Methods in Natural Language processing and computational Natural Language Learning (EMNLP-CONLL), pp. 858-867.
Thorsten Brants, Ashok C. Popat, Peng Xu, Franz J. Och, Jeffrey Dean.
BIGTABLE:A distributed Storage System for structured Data [PDF]
In Proceedings of OSDI 2006, Seattle, WA, 2006.
Fay Chang, Jeffrey Dean, Sanjay Ghemawat, Wilson c. Hsieh, Deborah A. Wallach, Mike Burrows, Tushar Chandra, Andrew Fikes, and Robert E. Gruber.
Abstract
Mapreduce:simplified Data processing on Large clusters [PDF]
In Proceedings of OSDI, San Francisco, CA, 2004.
Jeffrey Dean and Sanjay Ghemawat
Abstract
Web Search for a planet:the Google Cluster architecture [PDF]
In IEEE Micro, vol. 2, pages 22-28, March, 2003.
Luiz Barroso, Jeffrey Dean, and Urs Hölzle
Abstract
A Comparison of techniques to find mirrored Hosts on the WWW [HTML]
Jasis (Journal of the American Society for Information Science) 51 (12): 1114:1122 (2000). Also presented at 1999 ACM Digital Library Workshop on organizing Web spaces (wows), Berkeley, CA, August 1999.
Krishna Bharat, Andrei Broder, Jeffrey Dean, and Monika R. Henzinger
The Swift Java compiler:design and implementation
Compaq Western Laboratory. 2000/2, April 2000.
Daniel J. Scales, Keith H. Randall, Sanjay Ghemawat, and Jeff Dean.
Hardware Support for Out-of-order instruction Profiling on Alpha 21264a [PPT]
In Proceedings of 11th Hot Chips Symposium (1999), Palo Alto, CA, Aug., 1999.
Jennifer Anderson, Lance Berc, Jeffrey Dean, Sanjay Ghemawat, Shun-tak Leung, Mitch Lichtenberg, George vernes, Mark Vande Voorde, Carl A. Waldspurger, William Weihl, and Jon White
Finding Related Pages in the World Wide Web [HTML]
In Proceedings's eighth World Wide Web Conference (WWW8), Toronto, Canada, May, 1999.
Jeffrey Dean and Monika Henzinger
Transparent, low-overhead Profiling on modern processors [PostScript]
Invited paper in 1998 Workshop on Profile and feedback-directed compilation, Paris, France, October, 1998. Also gave invited talk at the workshop.
Jennifer Anderson, Lance Berc, George Chrysos, Jeffrey Dean, Sanjay Ghemawat, Jamey Hicks, Shun-tak Leung, Mitch lichtenbe RG, Mark Vandevoorde, Carl A. Waldspurger, and William WEIHL
Profileme:hardware Support for Instruction-level Profiling on Out-of-order processors [HTML]
In Proceedings of the 30th Annual Symposium on microarchitecture, Triangle Park, North Carolina, December, 1997.
Jeffrey Dean, Jamey Hicks, Carl Waldspurger, William Weihl, and George Chrysos
Continuous Profiling:where Have All the Cycles Gone?
In Proceedings of 16th Symposium on Operating Systems Principles (1997), St Malo, France, October, 1997. Selected as one of the four best papers at SOSP. An expanded version appears in a special issue of transactions on Computer Systems, vol. 4, pp. 357-390 (novemb Er, 1997).
Jennifer Anderson, Lance Berc, Jeffrey Dean, Sanjay Ghemawat, Monika Henzinger, Shun-tak Leung, Dick Sites, Mark Vandevoor De, Carl Waldspurger, and William WEIHL
Call Graph Construction in object-oriented Languages [HTML]
In Proceedings of 1997 Conference object-oriented programming Languages, Systems, and Applications (OOPSLA '), Atlanta, G A, October, 1997.
David Grove, Greg Defouw, Jeffrey Dean, and Craig chambers
Continuous Profiling (It ' s 10:43; Do you Know Where Your Cycles Are?)
In Proceedings of 9th Hot Chips Symposium (1997), Palo Alto, CA, Aug., 1997.
William Weihl, Jennifer Anderson, Lance Berc, Jeffrey Dean, Sanjay Ghemawat, Monika Henzinger, Shun-tak Leung, Dick Sites, Mark Vandevoorde, and Carl Waldspurger
Whole-program optimization of object-oriented Languages [HTML]
Ph.D dissertation, University of Washington, Dept. of Computer Science and Engineering, November, 1996.
Vortex:an Optimizing Compiler for object-oriented Languages [HTML]
In Proceedings of 1996 conference object-oriented programming Languages, Systems, and Applications (OOPSLA '), San Jose, CA, October, 1996.
Jeffrey Dean, Greg Defouw, David Grove, Vassily Litvinov, and Craig chambers
Expressive, efficient Instance Variables [HTML]
University of Washington Technical, February 1996.
Jeffrey Dean, David Grove, Craig Chambers, and Vassily Litvinov
Optimization of object-oriented Programs Using Static Class hierarchy analysis [HTML]
In Proceedings of 1995 European Conference on Object-oriented Programming (Ecoop '), Aarhus, Denmark, August, 1995.
Jeffrey Dean, David Grove, and Craig chambers
A Framework for selective recompilation in the Presence of Complex intermodule dependencies [HTML]
In Proceedings of the Seventeenth International Conference on Software Engineering (ICSE), Seattle, WA, April, 1995.
Craig Chambers, Jeffrey Dean, and David Grove
profile-guided Receiver Class prediction [HTML]
In Proceedings of 1996 Conference of Object-oriented programming Languages, Systems, and Applications (OOPSLA '), Austin, TX, October, 1995.
David Grove, Jeffrey Dean, Charlie Garrett, and Craig chambers
Selective specialization for object-oriented Languages [HTML]
In Proceedings of 1995 Conference on programming Language Design and implementation (Pldi '), June, 1995.
Jeffrey Dean, Craig Chambers, and David Grove
Identifying profitable specialization in object-oriented Languages [HTML]
In Proceedings of the 1994 Workshop on Partial Evaluation and semantics-based program manipulation (PEPM '), Orlando, FL, June, 1994.
Jeffrey Dean, Craig Chambers, and David Grove
Towards Better inlining decisions Using inlining trials [HTML]
In Proceedings of the 1994 Conference on Lisp and functional Programming (L&FP '), Orlando, FL, June, 1994.
Jeffrey Dean and Craig Chambers
EPI info:a general-purpose microcomputer program for public Health information Systems
In American Journal of Preventive Medicine, vol. 7, pp. 178-182, 1991.
Andrew Dean, Jeffrey Dean, Anthony Burton, and Richard Dicker
Software for Data Management and analysis in epidemiology
In Journal's the World Health Forum, vol. 1, 1990.
Anthony Burton, Jeffrey Dean, and Andrew Dean.
Personal:

I ' ve lived in lots of places in my Life:honolulu, HI; Manila, the Phillipines; Boston, MA; West Nile District, Uganda; Boston (again); Little Rock, AR; Hawaii (again); Minneapolis, MN; Mogadishu, Somalia; Atlanta, GA; Minneapolis (again); Geneva, Switzerland; Seattle, WA; and (currently) Palo Alto, CA. I ' m hard-pressed to pick a favorite, Though:each Place has its plusses and minuses.
One of my life goals are to play soccer and basketball on every continent. So far, I ' ve do so in the North America, South America, Europe, Asia, and Africa. I ' m worried that Antarctica might is tough, though.

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.