Data Processing

Read about data processing, The latest news, videos, and discussion topics about data processing from alibabacloud.com

Poliqarp 1.3.11 Publishing Corpus tool

Poliqarp is a large corpus tool for processing. It includes a binary corpus for efficient search of compilation and corpus construction for word retrieval, support for locating tagsets, ambiguity in text and Unicode. The Poliqarp 1.3.11 version fixes a bug in Meta http://www.aliyun.com/zixun/aggregation/14345.html "> Data processing code to avoid a certain corpus corruption." ...

A survey of large data technology

A review of large data technology Zhihui the generation of Zhangquan data brings new challenges to the massive information processing technology. In order to understand the connotation of large data in a more comprehensive way, this paper elaborates from three aspects, such as the concept characteristic of large data, the general processing process and the key technology. The background of large data is analyzed, and the basic concept of large data, Typical 4 "V" features as well as the focus of application areas, summed up the general process of large data processing, for the key technologies, such as MapReduce, GFS, BigTable, Hadoop and data visualization, ...

XML function library: Xml_get_current_column_number

Xml_get_current_column_number learned the first few fields that are currently resolved. Syntax: int xml_get_current_column_number (int parser); Return value: integer function type: Data processing &http://www.aliyun.com/zixun/aggregation/37954.html ">nbsp; content description This function is used to obtain the current ...

Coopy 0.6.3 publishes distributed data processing tools

Coopy is a tool for dealing with distributed data. It supports multiple formats for comparison, patching, merging, and table http://www.aliyun.com/zixun/aggregation/9591.html "> versioning, typically supported in a format that includes CSV, Excel, Ysql, Sqlite Wait a minute. Coopy 0.6.3 This version supports the comparison/patching/merging features of GUI and Socialcalc tables, and Fixes tab-delimited and semicolon-delimited ...

Research on the effect of compression on Hadoop performance

Research on the effect of compression on Hadoop performance Shanglihui, Miuli compression is an important method of I/O tuning, which reduces the I/O calculation load, thereby improving I/O performance. Today, disk I/O can never grow faster than the CPU speed of Moore's law, so I/O is often a bottleneck in data processing. In Hadoop, how to use compression for I/O tuning has not been fully studied. Through experiments, this paper draws a compression strategy to help users of Hadoop to determine when and where ...

Wave K Relocation Project, officially launched in Beijing yesterday

To break the host foreign monopolies, safeguard the national information security of the Wave K relocation project, yesterday in Beijing officially launched. The "K Relocation Project" will fully accelerate shuttle K1 to replace the IOH (IBM, Oracle and HP) minicomputer process.   This is the first time for domestic manufacturers in the industrial core area to occupy the monopoly position of the IOH to launch a challenge. Mainframe is a high-end server dedicated to large-scale transaction data processing, and it is the core equipment of national information security. China's mainframe is faced with foreign technology monopolies and international manufacturers of market monopolies, long-term dependence on imports, this becomes our information Ann ...

XML function library: Xml_get_current_byte_index

Xml_get_current_byte_index has now been resolved to the first few bit groups. Syntax: int xml_get_current_column_number (int parser); Return value: integer function type: Data processing &http://www.aliyun.com/zixun/aggregation/37954.html ">nbsp; content description This function is used to obtain the current X ...

HPCC 3.4.2 Publishing Parallel processing computing platform

HPCC is a high configured Computing cluster abbreviation, namely High-performance computing cluster, is a huge parallel processing computing platform to solve the problem of large data processing. It uses large-scale parallel http://www.aliyun.com/zixun/aggregation/20795.html "> Processing technology to store and process large amounts of data, processing hundreds of millions of records per second." A large number of data across different data sources can be accessed, analyzed, and ...

With the flower shun more than 100 million super raise fund construction Data project

Flush (300033) January 31 Evening announcement, the company plans to exceed the funds in the 111 million yuan for the same flower shun data processing base phase one project. The total investment of the project is 153 million yuan, the company intends to set up a wholly-owned subsidiary in Hangzhou Yuhang as a Project REAL (blog) application of the main body.

Hadoop raises big data revolution three giants Qi exerting force

Introduction: Open source data processing platform with its low-cost, high scalability and flexibility of the advantages has won the majority of network Giants recognized. Now Hadoop will go into more business. IBM will launch a DB2 flagship database management system with built-in NoSQL technology next year. Oracle and Microsoft also disclosed last month that they plan to release a Hadoop-based product next year. Two companies are planning to provide assistance with deployment services and enterprise-level support. Oracle has pledged to preinstall Hadoop software in large data devices. Big Data Revolution ...

Research and practice on large data application of telecom operators

Research and practice of large data application of telecom operators China Mobile Research Institute--The large Data development survey--the demand of the operator's data processing--the research and practice of China Mobile in large data processing

Hadoop raises big data revolution three giants Qi exerting force

Introduction: Open source data processing platform with its low-cost, high scalability and flexibility of the advantages has won the majority of network Giants recognized. Now Hadoop will go into more business. IBM will launch a DB2 flagship database management system with built-in NoSQL technology next year. Oracle and Microsoft also disclosed last month that they plan to release a Hadoop-based product next year. Two companies are planning to provide assistance with deployment services and enterprise-level support. Oracle has pledged to preinstall Hadoop software in large data devices. Large Data Leather ...

HPCC 3.4 Publishing Parallel processing computing platform

HPCC is a high configured Computing cluster abbreviation, namely High-performance computing cluster, is a huge parallel processing computing platform to solve the problem of large data processing. Large-scale parallel http://www.aliyun.com/zixun/aggregation/20795.html "> Processing technology for storing and processing large amounts of data, processing hundreds of millions of records per second. A large number of data across different data sources can be accessed, analyzed, and in seconds ...

Design and implementation of multi-star table storage and cross identification based on Hadoop

Design and realization of multi-star table storage and cross-identification based on Hadoop the Zhang Xiaxu of Shandong University is particularly important in the face of astronomical data, how to efficiently store and cross-identification of multiple-star table. Massive astronomical data processing must use the large data processing technology such as distributed, parallel computation to be able to solve effectively. In this paper, the use of Hadoop to deal with astronomical data is studied, the main work is divided into the following three parts: 1. Using the HBase component of Hadoop, the effective storage of different star table data is constructed, and the efficiency of cluster disk utilization and the query of Star table information is improved. ...

Application of large data technology in environmental information

Application of large data technology in environmental information Li Anzan Wang Wang Zhangquan Chu Yan in the project of "Integrated demonstration of water environment management technology in Liaohe River basin", with the accumulation of time, Environmental monitoring data processing system has been collecting more and more data. However, the data processing system of environmental monitoring in Liaoning Province can not effectively deal with the increasing amount of data. The data center of Environmental monitoring data processing system is improved by using large data technology. Using HDFS powerful data storage and management functions, In response to the increase in data volume, using MapReduce and Hado ...

Moving large data processing to the cloud

Absrtact: Large data start-up sisence 30 million U.S. dollars in Thursday to obtain a C-round financing, DFJ substituting, the investment party has battery Ventures, Genesis and Opus. Sisence last year has received 10 million U.S. dollars of financing, cumulative financing has reached 50 million U.S. large data start-ups sisence in Thursday to obtain 30 million U.S. dollars in C-round financing, the DFJ substituting, the cast side has bat ...

MapReduce programming combat

What MapReduce is? MapReduce is a programming model for Hadoop (this large http://www.aliyun.com/zixun/aggregation/14345.html "> Data Processing Environment). Since it is called a model, it means it has a fixed Form MapReduce programming model, Hadoop ecological environment for data analysis and processing of fixed programming. This fixed programming form is described as follows: ...

HPCC 3.2.0.2 Publishing Parallel processing computing platform

HPCC is a high configured Computing cluster abbreviation, namely High-performance computing cluster, is a huge parallel processing computing platform to solve the problem of large data processing. Large-scale parallel http://www.aliyun.com/zixun/aggregation/20795.html "> Processing technology for storing and processing large amounts of data, processing hundreds of millions of records per second. A large number of data across different data sources can be accessed, analyzed, and ...

Design and implementation of a large amount of small XML data file processing technology based on Hadoop

Design and implementation of a large amount of small XML data file processing technology based on Hadoop University Kong Xin This paper focuses on the following: 1 a distributed mass of small XML data processing system (distributed Massive smallxml files SYSTEM,DMSX), the main idea of the system is to use a large number of small data XML files in the Hadoop system for efficient processing. 2 The system through the use of producer-elimination ...

The role of site and domain

Absrtact: SEO, search engine optimization, as the name implies, must understand the search engine some key factors to our today's topic for further analysis, as a 2 years seoer, to my personal understanding of the content and related information, search SEO, search engine optimization, as the name suggests, Must understand the search engine some key factor to be possible to our today's topic carries on the further analysis, as a 2 year seoer, with my personally to this part content understanding and the related information review, the search engine is a huge data processing system, synchronizes ...

Total Pages: 9 1 .... 3 4 5 6 7 .... 9 Go to: Go

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.