Batch Processing and real-time processing

Source: Internet
Author: User

========================================================== ======================================

Scenario:

Report task: Not a real-time report, data needs to be processed

Recommendation System: requires real-time

Currently, I do not have much knowledge about real-time systems and have little development scenarios. Therefore, I only want to learn

========================================================== ======================================

Distributed System: one system is deployed in multiple sets, multiple storage sets, multiple caches, and multiple databases (RDBMS, nosql )----The cluster system has a large load.

Non-distributed system: it is not just a server, a database, or a cache server

For the high availability of the system, you can use the database system design ideas to make it have a transaction mechanism (Retry Mechanism, commit mechanism, skip mechanism, recovery mechanism)

MANAGER: allocates and schedules resources. It is an idea used to simulate and display the world in software design (tracker, mannager, lanuch, worker, and context)

========================================================== ======================================

Batch Processing: collects a certain amount of business data for unified processing. For example, report data, and monthly statistics on the data for the current month --Scheduled task or batch processing framework (Spring Batch)Hadoop

Real-time processing: Once data is generated, it can be processed.Message Queue (listener and subscription)Storm

Note: These are applicable to scenarios with extremely large data volumes.

However, distributed processing may be a little troublesome, so some frameworks have emerged: hadoop, storm, etc.

Generally, real-time processing uses the event mechanism (CEP)

========================================================== ======================================

Hadoop: a popular term, a distributed Batch Processing Framework

Storm: A recent real-time processing framework

========================================================== ======================================

The thought of concatenating and transforming or splitting:

Split asynchronous execution and merge serial execution

 

========================================================== ======================================

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.