Hadoop

Want to know hadoop? we have a huge selection of hadoop information on alibabacloud.com

Open source cluster computing environment Apache Spark

The Apache Spark abbreviation Spark,spark is an open source cluster computing environment similar to Hadoop, but there are some differences between them, and these useful differences make Spark more advantageous in some workloads, in other words, Spark   With the memory distribution dataset enabled, it can optimize the iteration workload in addition to providing interactive queries. The Apache Spark is implemented in the Scala language, and it uses Scala as its application ...

HP 50 million dollar strategic investment Hadoop Big data platform

Absrtact: HP announced a strategic partnership with the big data platform and injected 50 million dollars into the Hortonworks, and HP CTO Martin Fink will join the board. Just this March, Hortonworks received $100 million in D-round funding from Blackstone, Yahoo, B-P announced a strategic partnership with the big data platform, and injected 50 million dollars into the Hortonworks, and HP CTO Martin Fink will join the board. Just this year 3 ...

New trends in large data

A New Trend for the big Data Peng Qin, Bin Dai, Benxiong Huang and Guan Xu to concurrently process large-scale Data, MapReduce with an O Pen Source Implementation named Hadoop is proposed. In practical ...

HP 50 million dollar strategic investment large data platform Hortonworks

Beijing Time July 25 news, according to Science and technology blog Recode reported that HP is about to announce the Big Data platform Hortonworks strategic investment of 50 million U.S. dollars. Hortonworks is a Hadoop start-up that was spun out of Yahoo in 2011. As part of the deal, HP executive vice president and chief Technology Officer Martin Frink (Martin Fink) will join the Hortonworks board. Hortonworks has previously been awarded by the private equity firm Blackstone Group and Pa ...

HP 50 million dollar strategic investment Hadoop Big data platform

Absrtact: HP announced a strategic partnership with the big data platform and injected 50 million dollars into the Hortonworks, and HP CTO Martin Fink will join the board. Just this March, Hortonworks received $100 million in D-round funding from Blackstone, Yahoo, B-P announced a strategic partnership with the big data platform, and injected 50 million dollars into the Hortonworks, and HP CTO Martin Fink will join the board. Just this year 3 ...

Research and implementation of clustering and convex package algorithm under MapReduce framework

Research and realization of clustering and convex-package algorithm in MapReduce framework Chengdu University of Technology Zhaoju first, this paper makes a research on the generation and value growth of large data, and explains the necessity of improving the execution efficiency of the data mining algorithm, and introduces the technology and tools that support the large-data processing now. Then the paper studies the running mechanism of Hadoop file system, the stored procedure and the programming model of MapReduce framework, and the operation principle. Secondly, in a certain size of Hadoop cluster on the data distributed processing, so as to assess the whole cluster of sex ...

A survey of large data technology

A review of large data technology Zhihui the generation of Zhangquan data brings new challenges to the massive information processing technology. In order to understand the connotation of large data in a more comprehensive way, this paper elaborates from three aspects, such as the concept characteristic of large data, the general processing process and the key technology. The background of large data is analyzed, and the basic concept of large data, Typical 4 "V" features as well as the focus of application areas, summed up the general process of large data processing, for the key technologies, such as MapReduce, GFS, BigTable, Hadoop and data visualization, ...

Mobile communications are moving from the voice age to the data age

Absrtact: Wang, director of mobile Internet Products Division, Unicom Research Institute, November 22, Wang, director of mobile Internet products division at China's Hadoop Summit Technology summit, said the innovation based on Hadoop enabled Unicom to respond to complaints from consumers China Unicom Research Institute Mobile Internet Products Division director Wang November 22, China Unicom Research Institute of Mobile Internet Products Division director Wang at China HADOOP Summit Technology Summit, said, based on Hado ...

Implementation and performance of Hadoop Reference design: Introduction to third party products

British Industry Tatsu server products K800 (ROMLEY-EP) is a ROMLEY-EP platform based on standard 2U server, high http://www.aliyun.com/zixun/aggregation/17968.html "> Memory capacity, High network speed, a variety of SATA expansion configuration, support onboard dual gigabit + dual Gigabit optional configuration to meet the diverse needs of customers. Maximum support 16 memory, capacity up to 512GB, easy to meet customer high ...

Research on the recommended algorithm of limited Boltzmann machine based on cloud computing

Research on the recommended algorithm of limited Boltzmann machine based on cloud Zhengzhi Yun Li Buyuan The exponential growth of the blunt data of Li Lun and the complexity of the algorithm itself make the limited Boltzmann machine face the problem of computational efficiency. Based on the detailed analysis of the restricted Boltzmann machine, the proposed algorithm of limited Boltzmann machine based on cloud platform is put forward by combining the parallel computing architecture of the limited Boltzmann machine and the Hadoop platform. The algorithm solves the problem of data relativity by copying mechanism, and decomposes the traditional limited Boltzmann process into several hadoo ...

The technique of cable-bell filter based on parallel programming

Cable-Bell filter technology based on parallel programming computation Xu Changlong Wang Smart Shuo Hua with the increase of the data volume of remote sensing image, the computation time of the edge filtering operation in a single environment is also greatly increased. According to the characteristics of remote sensing data, combined with MapReduce parallel distributed computing model, this paper proposes a method of migrating this operation into Hadoop cluster environment to complete the Bayes filtering operation of massive image data. The experimental results show that the cluster operation can shorten the computation time, and the calculation time will decrease with the increase of cluster node number. ...

More big data is not in the cloud

Amazon's chief technology officer, Vonna Vogle, has opened a topic about using cloud computing to complete big data, what do you want him to do? This view is compelling, including the need for large data analysis, especially for real-time analysis. Companies want to have that ability, for Vogel, which means they need the public cloud--especially the Amazon's public cloud. Vogel also said that we all hope that infrastructure like Hadoop will be able to hide behind an analysis layer like the Amazon redshift. Vogel right now.

Cloudera acquisition of large data encryption start-ups Gazzang

Absrtact: Hadoop vendor Cloudera has just acquired a start-up Gazzang that specializes in cryptography for next-generation data storage environments, but details of the deal are not disclosed. This is Cloudera's first major acquisition. Gazzang was founded in 2010 and is headquartered in Austin Hadoop supplier Cloudera has just acquired a start-up Gazzang that specializes in cryptography for next-generation data storage environments, but details of the deal are not disclosed. This is Cloudera's first stroke ...

Implementation and performance of the Hadoop reference design: The realization of British business and gigabyte

561.html "> Reference design Implementation name Node/second name Node specification: datanode/http://www.aliyun.com/zixun/aggregation/17034.html" >tasktracker Specification: Cabinet specification: Gigabyte reference design Implementation name Node/second name Nod ...

Research on advertising recommendation method based on classification model

Research on advertising recommendation method based on classification model the main work of Zhe thesis of Beijing Jiaotong University is as follows. First, we implemented a visual statistical and analytical tool for advertising log data provided by an Internet company, using the Hadoop platform to analyze the data and discover the dependencies between features and advertising. Secondly, an improved method of using the single tag classification model based on the non advertising feature and the advertising feature dependency is proposed, which utilizes the mutual information to select the combination feature to join the dependency relationship between the features. Third, put forward ...

Design of K-prototypes algorithm based on distributed

Based on distributed k-prototypes algorithm design Li Clustering algorithm has been widely used in many fields, for most datasets, the properties of which are not exactly numerical type, which brings difficulty to clustering. The emergence of K-prototypes algorithm solves the difficulty of hybrid attribute clustering, but its calculation is tedious and brings a lot of difficulties to programmers. The emergence of the Hadoop distributed system provides the possibility for compiling the parallel k-prototypes algorithm, which can improve the program parallelism and greatly improve the program ...

Analysis of Hadoop1.0 and 2.0 design principles

Brief analysis of Hadoop1.0 and 2.0 design principles Yao Wei Horse and good introduction of the history of Hadoop and its version evolution process, elaborated Hadoop 1. The HDFs design concept, architecture, read/write Data flow and MapReduce architecture, task execution process, and HADOOP1 in 0. 0 insufficiency problem; 0 insufficiency problem, Hadoop2. 0 enhancements to the solution, including Namenode HA program, H ...

On average 24 times times faster than Hive, Impala Sword refers to Stinger

Before yarn, Hadoop was only available for offline processing scenarios. Based on real-time demand, organizations have developed their own streaming framework, this time we are talking about two sql-on-hadoop projects, as well as two well-known Hadoop solution Providers--impala vs. Stinger. Singer:stinger first appeared in Hive 0.11 (HDP 1.3), with a total of 3 phase goals, of which phase I and II had been delivered. Through the hortonwo ...

Hstreaming invested in millions of dollars to create a real time Hadoop system

Hstreaming start-up, headquartered in San Francisco, has recently received its first venture investment, the Atlas venture, which has invested 1 million of billions of dollars to build a real-time Hadoop system. The company, which has only three people, has a history of two years. If you ask a person in Hadoop about how to use Hadoop to go beyond the current batch platform, the main answer is "real time." In fact, next month "Structure:da ...

Spark Tutorial - Building a Spark Cluster - Configuring Hadoop Standalone Mode and Running Wordcount (2)

Previous: http://www.aliyun.com/zixun/aggregation/13383.html "> Spark Tutorial - Building a Spark Cluster - Configuring Hadoop Standalone Mode and Running Wordcount (1) 2. Installing rsync Our version of Ubuntu 12.10 Rsync installed by default, we can install or update rsy through the following command ...

Total Pages: 9 1 .... 5 6 7 8 9 Go to: Go

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.