Corporate giants flock to the embrace of Hadoop: data is king
Source: Internet
Author: User
KeywordsProvided giant alligator run used Microsoft
According to a survey conducted by TDWI, 34% of companies now make decisions through large data analysis. Amazon, Cloudera and IBM have all released their Hadoop-as-a-service products, and similar products from Microsoft will be available next year. This shows that the development of large data and Hadoop is becoming more and more strong, the future will become more and more important.
As early as 2009, Amazon launched the AWS Elastic MapReduce, which supports the running of Apache Hadoop on EC2 and S3. This service provides the most basic hardware and software required for large data analysis.
After a lapse of time, Cloudera, released the CDH3. This is based on Amazon's MapReduce, after tuning the Hadoop AMI. Because CDH3 integrates a lot of additional software, it can be used to handle hadoop tasks.
And so far, the crown of the most mature solution is IBM,IBM launched the Infosphere biginsights software based on Watson technology, which can run Hadoop on SmartCloud Enterprise. This is not just a platform for running large data tasks, it also provides the ability to analyze data. This is one of the most complex aspects of the process. It also includes the following kinds of open source projects:
JAQL: An advanced query language based on JavaScript Object notation (JSON), which also supports SQL. Hive: A data warehouse infrastructure that supports bulk querying and analysis of Hadoop files. HBase: A column store Data environment that supports large sparse tables in Hadoop. Flume: A facility that collects data and loads it into Hadoop.
Recently, at Pass Summit 2011, Microsoft's software giant has also opened its arms to Hadoop. Microsoft has announced that it will collaborate with Hortonworks, the originator of Yahoo, to build Windows Server and Windows Azure platform on Apache Hadoop, looking to integrate open source Apache in SQL Server 2012 Hadoop, which provides large data processing capabilities. To make Apache Hadoop a compelling platform for storing and processing data, Microsoft plans to offer Azure based Hadoop services at the end of 2011 and a Windows-based distributed service some time in 2012.
EMC and Intel, Mellanox Technologies, Micron, Seagate, Supermicro, switch, and VMware Partners have launched another solution EMC Greenplum Analytics Workbench, which provides a platform for more than 10000 virtual nodes and PB storage capacity, is primarily used to test Hadoop.
At the IOD2011 conference held in Las Vegas, Datameer company demonstrated the company's product solutions das based on the Hadoop platform. "Many companies have also introduced Hadoop based products, but they all need a connector (Connector) and use our company's solutions to directly experience the efficiency and convenience of the Hadoop architecture." "Datameer company Business development director Yuet said.
In short, the big players ' investments in Hadoop are very active, and they appear to be in the form of ubiquity. With Hadoop as the benchmark of large-scale data processing (Bigdata 處理) technology is maturing, making enterprises from "business-king" have turned to "data for King" change.
The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion;
products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the
content of the page makes you feel confusing, please write us an email, we will handle the problem
within 5 days after receiving your email.
If you find any instances of plagiarism from the community, please send an email to:
info-contact@alibabacloud.com
and provide relevant evidence. A staff member will contact you within 5 working days.