Large collection of top-level open source tools in the Big Data world

Source: Internet
Author: User

today, from small startups to industry giants, vendors of all sizes are using open source to handle big data and run predictive analytics. This article describes some of the top open source tools for big data, divided into four areas: data storage, development platform, development tools and integration, analysis and reporting tools.

As big data and predictive analytics mature, the benefits of open source as the largest contributor to the underlying technology licensing solution are becoming more pronounced.

Today, from small startups to industry giants, vendors of all sizes are using open source to handle big data and run predictive analytics. With open source and cloud computing technology, startups can even compete with big players in many ways.

Here are some of the top open source tools for big data, divided into four areas: data storage, development platform, development tools, and integration, analysis, and reporting tools .

Data storage:
    • Apache Hadoop–cloud Foundry (VMware), Hortonworks, hadapt
    • NOSQL database –mongodb, Cassandra, Hbase
    • SQL database –mysql (Oracle), MariaDB, PostgreSQL, Tokudb
Development Platform:
    • Apache Hadoop Platform –impala (open source Big data analytics engine); Lingual (ANSI SQL); Pattern (analytics); Cascading (open source Big Data application Development framework)
    • Apache Lucene and SOLR platforms
    • OpenStack (Building private and public clouds)
    • Red Hat (standard Linux distribution with Hadoop server)
    • REEF (Microsoft's Hadoop developer platform)
    • Storm (integrated with various queueing systems and database systems)
Development tools and integrations:
    • Apache Mahout (machine learning programming language)
    • Python and R (predictive analytics programming language)
Analysis and reporting tools:
    • Jaspersoft (Reporting and Analysis Server)
    • Pentaho (data integration and Business Analytics)
    • Splunk (It analytics platform)
    • Talend (Big Data integration, data management and application integration)

The above is the big data we summed up a good tool, we hope to help you.

English Original: Blackducksoftware

Large collection of top-level open source tools in the Big Data world

Related Article

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.