NG and Configuration Practice
, I reposted an article to share with you.
, a real-time log collection system developed by Cloudera ...
Data into an ODPS
I. Introduction Apache
is a distributed, reliable and available system, which can efficiently collect, aggregate and ...
E-MapReduce data migration solution - E-MapReduce
incremental upstream data sources such as RDS incremental data and
. Network interconnection between ...
Import Kafka data into OSS using E-MapReduce service
article, the example is directly run in the E-MapReduce cluster. This example uses the open-source
tool as a transit to ...
A closer look at Hadoop: Hadoop architecture, features of various components and significant for big
)? Hadoop delivers the ability to handle big data at a cheaper cost (the big data volume is usually 10 GB to 100 GB, or ...
Configure DataHub Writer - DataWorks V1.0
collects the buffer data and submits it to the target end in batches when the collected data size reaches ...
Configure DataHub Writer - DataWorks V2.0
N/A maxCommitSize To improve writing efficiency, DataX-On-
collects the buffer data ...
Data upload and download tools - MaxCompute
is a distributed and reliable system, which efficiently ...
Big Data - Product Comparison
plug-ins : OGG、
、LogStash、Flunted Data storage File compression store RaidFile mechanism Azure Blob ...
Drilling into Big Data – A Gold Mine of Information
tool to push it into Hadoop. - Apache
- A Data Flow used for efficiently collecting, aggregating, and pushing large amounts ...