Apache Gora Introduction

Source: Internet
Author: User
Tags cassandra

Introduced

Gora is an open-source ORM framework that provides the memory data model and the persistence of data for big data. Currently Gora supports the storage of column data, key-value data, document data and RDBMS data, and also supports the use of Apache Hadoop to analyze Big data

Characteristics   

Although there are many good ORM frameworks for relational databases, the data model-based framework such as JDO still has some shortcomings, such as storage and persistence of the column data model. Gora makes up for this problem by making it easy for users to model and persist large data at the time of memory, and to support Hadoop for analyzing big data.

Gora is a representation and persistence framework for big data, which has the following characteristics

    1. Data persistence: It can persist column data, such as hbase,cassandra,hypertable; Key-value data for persistence, such as Voldermort,redis, SQL database persistence, such as MYSQL,HSQLDB, can also be HDFs storage of files.

    2. Data access: Easy access to data using the Java API

    3. Indexes: You can persist objects to lucene or SOLR, and you can use the Gora API to query

    4. Analysis: You can use Apache pig,hive,cascading to analyze your data

    5. Mr Support: native support for Hadoop's Mr Framework, which has been used on Nutch 2.0.

    6. Support for data persistence, indexing and analysis using pig,lucene,hive

For more information, please refer to: http://gora.apache.org

Disadvantages

Currently, in addition to NUTCH2, the use of Gora in other open source products seems to have not seen

The NUTCH2 series extends the storage layer through Gora, with the option to store data in HBase, Accumulo, Cassandra, MySQL, Datafileavrostore, Avrostore, but some of them are immature.

At present, Gora still needs to be improved. For the pursuit of the ultimate performance of friends, nutch2.x is not stable, the proposed use of nutch1.x, using HDFS and MapReduce data localization and natural parallelism, can be optimized to very fast speed.

Apache Gora Introduction

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.