VMware's Open source project Serengeti has been working on the virtualization platform with the load of Hadoop running well. VMware is about to release the commercial version of the Serengeti project, Big Data Extensions.
The new big Data extensions piece is released as a vsphere plugin. Administrators can deploy, monitor, and manage Hadoop clusters directly from the vcenter. The product improves the performance of the Hadoop operation and will become a popular open source platform for large data analysis.
"Hadoop is becoming a regular application running on vsphere, just like any other load." "Fausto Ibarra, senior director of VMware products, said.
Prior to this, Hadoop and other large data platforms typically require dedicated hardware, which is too costly for small and medium-sized enterprises and has a reliability problem. VMware launched the Serengeti project last year to address these issues, and its commercial version, Big Data extensions, will support the enterprise's internal running of Hadoop.
Fausto Ibarra also says VMware has a professional Hadoop Community Code contribution team that optimizes Hadoop's data distribution algorithms, enabling Hadoop to run better on virtualized platforms. VMware has also been working with distribution vendors to explore best practices for virtualization.
Currently Bigdata extensions can support the following Hadoop distributions:
Apache Hadoop 1.2
Cloudera 3 Update6
Cloudera 4.2
Hortonworks Dataplatform 1.3
MAPR 2.1.3
Pivotal HD 1.0
Big Data extensions will be released before the end of the year. VMware also announced the message that its parent company, EMC's Hadoop release,--pivotal HD, has already accepted VMware certification.
Reprint Source: http://itknowledgeexchange.techtarget.com/server-virtualization/vmware-adds-hadoop-support-to-vsphere/
Original Author: Colin Steele
See more highlights of this column: http://www.bianceng.cnhttp://www.bianceng.cn/Servers/virtualization/