Highly recommended! Large collection of top-level open source tools in large data areas
Source: Internet
Author: User
KeywordsLarge data open source tools strong open source
With the maturity of large data and predictive analysis, the advantage of open source as the biggest contributor to the underlying technology licensing solution is becoming more and more obvious.
Now, from small start-ups to industry giants, vendors of all sizes are using open source to handle large data and run predictive analytics. With the help of open source and cloud computing technology, startups can even compete with big vendors in many ways.
Here are some of the top open source tools for large data, grouped into four areas: data storage, development platforms, development tools, and integration, analysis, and reporting tools.
Apache Hadoop Platform –impala (open source large data analysis engine); Lingual (ANSI SQL); Pattern (analytics); Cascading (Open source large Data application development framework)
Apache Lucene and SOLR platforms
OpenStack (Building private cloud and public cloud)
Red Hat (standard Linux distribution with Hadoop server)
REEF (Microsoft's Hadoop developer platform)
Storm (integrated with various queuing systems and database systems)
The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion;
products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the
content of the page makes you feel confusing, please write us an email, we will handle the problem
within 5 days after receiving your email.
If you find any instances of plagiarism from the community, please send an email to:
info-contact@alibabacloud.com
and provide relevant evidence. A staff member will contact you within 5 working days.