Introduction to real-time Big Data Processing storm
Author: chszs, reprinted with note. Blog homepage:Http://blog.csdn.net/chszs
New technologies such as social media, e-mails, blogs, GIS, RFID, and smart phones provide new opportunities for service-based information construction.
The common method and framework for processing big data is Apache hadoop, which runs data processing tasks in batches. Now, if you need to process data in real time and display data in real time, this batch-based approach is not very suitable.
Storm is an open-source framework from Twitter, whose goal is to process big data streams in real time. Storm can reliably process unlimited data streams and process hadoop Batch Tasks in real time. Storm is very simple and supports allProgramming LanguageIt is full of fun to use.
Storm has many application scenarios: real-time analysis, online machine learning, continuous computing, distributed RPC, distributed ETL, scalability, and fault tolerance. This ensures that your data is processed, easy to build and manipulate.
For the storm address, see:
Http://storm-project.net/
Then we will continue to deepen our work.