Based on Python, there are several scientific and data analysis libraries, which are very convenient to use. Combined with OpenStack (http://www.openstack.org), RabbitMQ (http://www.rabbitmq.com), celery (HTTP/ www.celeryproject.org) can create an analytics platform for real-time data.
OpenStack is a python-based cloud computing platform that enables the scheduling and management of virtual machines, as well as the virtualized storage of data. RABBITMQ is a message bus server, which supports the data fast receiving and dispatching of the message and the Dispatch and dispatch of the task data, celery implements the parallel worker data analysis model. The combination of three software modules can accomplish the tasks related to virtual machine management, task management, real-time data receiving and sending, and perform data analysis using scientific computing libraries such as NumPy.
Build a python-based streaming data analytics platform