"This is the first time that a yarn architecture can run in a Windows environment," says Jim Walker, head of product marketing at Hortonworks. "Running Hadoop on Windows is important to all customers." ”
The Apache Hadoop yarn is the foundation of Hadoop 2.0, released last October. Yarn, as a Hadoop operating system, uses a single data platform for batch processing and transforms it into a multipurpose platform that can be batch, interactive, online, and stream processed at the same time.
New yarn
For data stored on the Hadoop Distributed File System (HDFS), yarn is the primary resource manager and access media that enables organizations to store data in a single location and then interact with it in a variety of ways to maintain the same level of service.
"HDP 2.0 for Windows is a leap because it takes Apache Hadoop functionality to Windows," Hortonworks's product manager Rohit Bakhshi said. Yarn allows users to interact with all data in a variety of ways at the same time, such as leveraging real-time and batching, to make Hadoop truly a multipurpose platform and to have a place in the modern data architecture. ”
He added: "Windows data centers now rely on highly available namenode to detect and recover any hardware, operating system, or JVM failures from the east, and to provide reliable access to data to the left and right HDP processing components." ”
Hortonworks close collaboration with Windows
Hortonworks engineers have been working closely with Microsoft engineers to bring HDP 2.0 to the Windows data Center.
"They are good partners," Walker said. "We really can't find a better partner, they understand the importance of Hadoop in the data center, that's the key to changing the rules of the game and they're helping to make it happen." ”
Whether you're running Linux or windows, Walker says, you can now access the latest and most powerful version of Hadoop.
"It's no different," Walker says, "which boils down to whether you're using Windows or Linux." Now most of them are using Windows, which is a huge benefit for Windows customers. Microsoft wants to implement Hadoop's internal deployment and the availability of cloud computing, and allows you to seamlessly move workloads between the two. The portability here is vital. ”
In addition, HDP 2.0 for Windows is about ensuring that businesses that rely on Excel can now connect to Hadoop 2.0 data sources to drive their business.
"We want to add Hadoop functionality to the world's most powerful business analytics tool," Walker said. "I think Excel is the world's largest analytics tool, plus the functionality of Hadoop, and then the integration of power BI, which will give data analysts, Developers and operating system personnel bring hope. ”