today, from small startups to industry giants, vendors of all sizes are using open source to handle big data and run predictive analytics. This article describes some of the top open source tools for big data, divided into four areas: data storage, development platform, development tools and integration, analysis and reporting tools.
As big data and predictive analytics mature, the benefits of open source as the largest contributor to the underlying technology licensing solution are becoming more pronounced.
Today, from small startups to industry giants, vendors of all sizes are using open source to handle big data and run predictive analytics. With open source and cloud computing technology, startups can even compete with big players in many ways.
Here are some of the top open source tools for big data, divided into four areas: data storage, development platform, development tools, and integration, analysis, and reporting tools .
Data storage:
- Apache Hadoop–cloud Foundry (VMware), Hortonworks, hadapt
- NOSQL database –mongodb, Cassandra, Hbase
- SQL database –mysql (Oracle), MariaDB, PostgreSQL, Tokudb
Development Platform:
- Apache Hadoop Platform –impala (open source Big data analytics engine); Lingual (ANSI SQL); Pattern (analytics); Cascading (open source Big Data application Development framework)
- Apache Lucene and SOLR platforms
- OpenStack (Building private and public clouds)
- Red Hat (standard Linux distribution with Hadoop server)
- REEF (Microsoft's Hadoop developer platform)
- Storm (integrated with various queueing systems and database systems)
Development tools and integrations:
- Apache Mahout (machine learning programming language)
- Python and R (predictive analytics programming language)
Analysis and reporting tools:
- Jaspersoft (Reporting and Analysis Server)
- Pentaho (data integration and Business Analytics)
- Splunk (It analytics platform)
- Talend (Big Data integration, data management and application integration)
The above is the big data we summed up a good tool, we hope to help you.
English Original: Blackducksoftware
Large collection of top-level open source tools in the Big Data world