More than 20 big data tools commonly used by Java to Adult College data people

Source: Internet
Author: User
Tags amazon dynamodb

Recently I asked a lot of Java developers about what big data tools they used in the last 12 months.

This is a series of topics for:

    • Language
    • Web Framework
    • Application Server
    • SQL data Access Tool
    • SQL database
    • Big Data
    • Build tools
    • Cloud Provider

Today we are going to talk about big data. According to Wikipedia, big data is a generalized term for datasets, and the dataset is so large and complex that traditional data processing applications are not competent.

In many cases, it is sufficient to use SQL database to store/retrieve data. But in other cases, either the SQL database is not large enough or there are better tools. It all depends on the usage.

Now let's talk about the different non-SQL tools for storing/processing data--nosql database, memory cache, full-text search engine, real-time streaming, graph database, and so on.

mongodb--a popular, cross-platform, document-oriented database.

elasticsearch--is a distributed RESTful search engine built for the cloud.

cassandra--, an open source distributed database management system originally developed by Facebook, is designed to handle large amounts of data spanning multiple commercial servers, providing high availability without a single point of failure.

redis--is an open source (BSD license), memory data structure store that is used as a database, cache, and message broker.

hazelcast--Java-based open source memory data grid.

ehcache--is a widely used open source Java distributed cache for general purpose caches, Java EE, and lightweight containers. Ehcache Related Introduction

hadoop--is an open-source software framework written in Java for distributed storage and for distributed processing of very large datasets on computer clusters.

solr--an open-source enterprise search platform, written in Java, from the Apache Lucene project.

Spark--apache is the most active project in the Software Foundation, an open-source cluster computing framework.

memcached--a general-purpose distributed memory caching system.

Apache hive--provides a SQL-like layer on top of Hadoop.

Apache kafka--, a high-throughput, distributed publish-subscribe messaging system, was originally developed on LinkedIn. Windows off Cygwin run Apache Kafka

akka--a toolkit and runtime for building highly parallel, distributed, resilient message-driven applications on the JVM.

hbase--an open-source, non-relational, distributed database, modeled after Google's BigTable, written in Java, and running on HDFs.

neo4j--Open Source graphics database implemented in Java.

couchbase--is an open source, document-oriented, distributed NoSQL database optimized for interactive applications.

Apache storm--Open Source distributed real-time computing system.

couchdb--is a document-oriented, open-source NoSQL database that uses JSON to store data.

Oracle coherence--A memory-based data grid solution that enables organizations to scale mission-critical applications predictably by providing channels for fast access to frequently used data.

titan--an extensible graphics database that is optimized for storing and querying graphs with hundreds of millions of vertices and edges distributed across multiple machine clusters.

Amazon dynamodb--is a fast, flexible, fully managed NoSQL database service for all applications that require consistent, single-digit millisecond latency at any scale.

Amazon kinesis--is a real-time platform for streaming data on AWS.

datomic--a fully transactional, cloud-enabled, distributed database written in Clojure.

There is a word called three people will have my teacher, in fact, as a developer, there is a learning atmosphere
It's especially important to talk to a communication circle. This is my big data exchange Learning Group 531629188
Whether you are small white or Daniel Welcome to settle in, are looking for a job can also join
, we exchange study together, the words are not rough, learn from each other, common progress, together refueling it.

More than 20 big data tools commonly used by Java to Adult College data people

Related Article

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.