Recently, a hbase based Mr Program was written. Summarized as follows:
1, using Tablemapper to read the table
2. The first way to write a table is to use the Tablemapreduceutil.inittablereducerjob method, which can be output in the map phase as well as in the reduce phase. The difference is that reduce's class is set to NULL or actual reduce below is an example of a table copy:
Package com.run.test;
Import java.io.IOException;
Import java.util.List;
Environment:hadoop:hadoop-2.2.0hbase:hbase-0.96.01.org.apache.hadoop.hbase.client.put0.94.6 the public class put extends Mutation implements HeapSize, writable, comparable0.96.0 when public class put extends Mutation implements HeapSize, comparableWorkaround:By the public class Monthuserlogintimeindexreducer extends ReducerChange public class Monthuserlogintimeindexreducer extends Reducer2.org.apache.hadoop.hbase.client.mutation.familymapORG.APACHE.HADOOP.HBASE.CLIENT.MUTATION.FAMILYMAP Type cha
SQL operation is a traditional table.
3, HBase: In layman's words, hbase functions like a database, traditional database management is centralized local data files, and HBase is based on HDFS implementation of distributed data file management, such as adding and deleting. In other words, hbase is just a persistent fil
Transferred from:http://blog.jobbole.com/86269/Although hierarchical databases are still widely used on mainframes today, relational database (RDBMS) (SQL) has captured the database market and performed quite well. The money we save doesn't go to someone else's account, and we make sure we have an exclusive seat on the plane, and we're not going to be blamed for what we didn't do, and so on. The data integrity of a relational database is due to its adherence to the principles of acid (atomicity,
require huge overhead and a performance cap, and it's never possible to use a single machine to support all of the load on Google and Facebook. In view of this situation, we need a new database because the relational database does not run well on the cluster. You might as well build a relational db cluster, but they're using shared storage, which is not the type we want. So there's a nosql era in which Google, Facebook, and Amazon are trying to handl
require huge overhead and a performance cap, and it's never possible to use a single machine to support all of the load on Google and Facebook. In view of this situation, we need a new database because the relational database does not run well on the cluster. You might as well build a relational db cluster, but they're using shared storage, which is not the type we want. So there's a nosql era in which Google, Facebook, and Amazon are trying to handl
, the term "NoSQL" was formally presented, with a total of 225 solutions now.The difference between NoSQL and Rdmbs is mainly at two: first, it provides modeless flexibility to support flexible schema changes, and secondly, scalability, native RDBMS only applies to standalone and small clusters. NoSQL was distributed at the outset, addressing both read-write and
Tags: file high availability embedded CRM performance queue expired sales and so onTransferred from: http://www.cnblogs.com/alephsoul-alephsoul/archive/2013/04/26/3044630.html Guide: Kristóf Kovács is a software architect and consultant who recently published an article comparing various types of NoSQL databases. The article is compiled by Agile translator – Tang Yuhua. For reprint, please refer to the following statement. Although SQL database is a
limit, etc., is the most like MySQL NoSQL database, and hbase only support three kinds of search: through a single row key access through the row key range, the full table scan
The update of 5.MONGODB is Update-in-place, that is, in situ update, unless the updated data records are not accommodated in situ. and hbase modification and add are the same command: Put
The first part: Concept 1, why use NoSQLImpedance detuning: The difference between the relational model of a relational database and the data structure in memory.Integration Database : Multiple applications that are typically developed by different teams store their data in a common database.application Database : Its contents can only be accessed directly from the code base of one application, and this codebase is maintained by a team.Use NoSQL reaso
offline data manipulation, which means that it is not suitable for real-time online queries or operations in real-world production environments because one word is "slow". InsteadOriginated in the role of facebook,hive as a data warehouse in Hadoop. Build on top of the Hadoop cluster and manipulate the SQL-like interface for data stored on the Hadoop cluster. You can do select,join with HIVEQL, and so on.If you have data warehousing requirements and you're good at writing SQL and don't want to
online queries or operations in real-world production environments because one word is "slow". InsteadOriginated in the role of facebook,hive as a data warehouse in Hadoop. Build on top of the Hadoop cluster and manipulate the SQL-like interface for data stored on the Hadoop cluster. You can do select,join with HIVEQL, and so on.If you have data warehousing requirements and you're good at writing SQL and don't want to write mapreduce jobs, you can use hive instead.HBaseHBase runs on HDFs as a c
HBase tables generally have the following features: 1 large: A table can have hundreds of millions of rows and 1 million columns. 2 column-oriented: column-oriented (family) Storage and permission control, column (family) independent search. 3 thin
HBase tables generally have the following features: 1 large: A table can have hundreds of millions of rows and 1 million columns. 2 column-oriented: column-orien
requirement.
7. NoSQL Database Classification
Type
Section represents
Characteristics
Column Storage
Hbase Cassandra hypertable
is to store data by column. The biggest feature is the convenient storage of structured and semi-structured data, easy to do data compression, for a column or a few columns of the query has a very large IO advantage.
a commercial version that runs on Linux, Windows, OS X, FreeBSD, and Solaris to provide an intuitive graphical interface for user design databases given its open source community, MySQL has a lot of information and tutorials that allow you to start and handle problems with support for partitioning, replication, Xpath, Stored procedures, triggers, views.
3. Microsoft SQL Server
first release : 1989
Licensing mechanism : proprietary
whether SQL: Yes
The most used commercial databases are lim
save lists type (ordered) and sets type (unordered) data, but also can complete sorting (sort) and other advanced functions, in the implementation of INCR,SETNX and other functions, the operation of the atomic, in addition, Features such as master-slave replication are also supported. Redis is written in C, and can be used as a memcached, on the front end of traditional databases, it supports many programming languages, and is used by many popular projects, such as GitHub and engine Yard, a cli
In the latest release of the Hortonworks HDP Sandbox version 2.2, HBase starts with an error, because the new version of HBase's storage path is different from the past, and the startup script still inherits the old command line to start HBase, The hbase-daemond.sh file could not be found and failed to start. See, the 2.2 version of the sandbox release a little h
Software architects who have worked for a number of large companies have taken into account the Kristóf Kovács's NoSQL database of mainstream blogs (Cassandra, Mongodb, CouchDB, Redis, Riak, Membase, Neo4j and HBase) made a comprehensive comparison.
Although the SQL database is a very useful tool, the monopoly is about to be broken after a 15-year solo show. It was only a matter of time: forced to use rela
Today's main to analyze the characteristics of hbase, put forward some real application scenarios, using HBase to solve the problem of efficiency in the application; HBase features Overview
HBase is an open source implementation of Google's bigtable, built on HDFS, providing high reliability, high performance, Columnst
The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion;
products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the
content of the page makes you feel confusing, please write us an email, we will handle the problem
within 5 days after receiving your email.
If you find any instances of plagiarism from the community, please send an email to:
info-contact@alibabacloud.com
and provide relevant evidence. A staff member will contact you within 5 working days.