.
Graph Database
Source: Euler and graph theory.
Data Model: nodes and relationships. you can also process key-value pairs.
Example: AllegroGraph, InfoGrid, and Neo4j
Advantage: solves complex graph problems.
Relational database
Source: proposed by E. F. Codd in A Relational Model of Data for Large Shared Data Banks
Data Model: various relationships
Example: VoltDB, Clustrix, MySQL
Advantages: high-performance, scalable OLTP, support for SQL
structures often require a fast and multiple scan of the database, and the introduction of RDD enables spark to process graph-based data structures more efficiently, making it possible to store and process large-scale graph networks. Similar systems dedicated to diagrams are neo4j and so on.Graphx relative to the traditional database connection, can deal with larger, deeper topological relations, can be performed on multiple cluster nodes, is indeed
people are willing to migrate to the NoSQL platform. On the other hand, we should consider how to develop standards for different types of NoSQL databases, such as document-type MongoDB and graphic database Neo4j.
The query language designed for the graphic database is not necessarily applicable to document databases or key-value structure databases. Of course, in general cases, our queries are very simple and very easy to support. These non-relation
Web sockets
Although not required, it is useful to get this knowledge in your tool belt. Learn how to write a live Web application using Web sockets and use it for some sample applications. You can use it in the blog application above to enable real-time updates to the list of blog posts.
22nd Step-Learn GRAPHQL
Learn how to create an API using GRAPHQL. Learn how it differs from rest and why it is called Rest 2.0.
23rd Step-View the graphics database
Graph models represent a very flexible way o
Elastisearch provides the river module to obtain data from other data sources. This function exists as a plug-in. The existing river plug-ins include:
River pluginsedit
1. Supported by Elasticsearch
CouchDB River Plugin
RabbitMQ River Plugin
Twitter River Plugin
Wikipedia River Plugin
2. Supported by the community
ActiveMQ River Plugin (by Dominik Dorn)
Amazon SQS River Plugin (by Alex Bogdanovski)
CSV River Plugin (by Martin Bednar)
Dropbox River Plugin (by David Pilato)
FileSystem River Plugin
popular document oriented data store. Mongodb–a Good introduction to MongoDB architecture.Graph Neo4j–most Popular Graph database. Titan–open source Graph Database under the Apache license.ACID I see a lot of evolution happening in the open source community which would try and catch up with what Google have Done–3 Out of the prominent papers below is from Google, they has solved the globally distributed consistent data store proble M.Megastore–a high
universality of NoSQL advantage can compensate for the lack of relational database in this area
Storage of unstructured data, semi-structured data
Performance
The industry uses NoSQL as a memory-focused service, constrained by I/O and network, typically request response time in milliseconds, single-machine QPS at level 100,000 (related to data size and storage complexity)
Some common types of nosql products
K-v (Memcached,Redis), this kind of nosql
Interesting readings
Big Data Benchmark–benchmark of Redshift, Hive, Shark, Impala and Stiger/tez.
NoSQL Comparison–cassandra vs MongoDB vs CouchDB vs Redis vs Riak vs HBase vs Couchbase vs neo4j vs Hypertable vs Elasti Csearch vs Accumulo vs Voltdb vs scalaris comparison.
Interesting Papers2013–2014
2014– Stanford –mining of Massive Datasets.
2013– Amplab –presto:distributed machine learning and Graph processing with Sparse ma
+ + based BigTable DBMS that saves communication and can be run independently or on a Hadoop-like distributed fs.
Document storage
CouchDB-a document-oriented database system for easy-to-use, multi-host replication.
ElasticSearch-Java-based database, popular log aggregation, and e-mail archiving projects.
MongoDB-another document-oriented database system
RavenDB-Document-based database with Acid/things functionality
RETHINKDB-Open source distributed
storage
CouchDB-a document-oriented database system for easy-to-use, multi-host replication.
elasticsearch– Java-based database, popular log aggregation, and e-mail archiving projects.
mongodb– Another document-oriented database system
ravendb– document-based database with Acid/things function
RETHINKDB-Open source distributed document storage database, focus on JSON
Figure
Flockdb-twitter distributed, fault-tolerant graph database
();
println (a);
println (b);
Output:
3
2
1
2
3.1.4. References
Http://en.wikipedia.org/wiki/Abstract_syntax_tree
https://javacc.java.net/
http://www.antlr.org/
3.2. Operating system knowledge
Ubuntu
Centos
Use Linux to familiarize yourself with shell scripts
3.3. Data storage knowledge3.3.1. Relational database
Mysql
How to see the execution plan
How to build MySQL master and slave
What is B
continuous integration.Persistence layers (persistence layer)The persistence layer manages various warehouses adapted to the needs and patterns of the data. It guarantees the establishment and management of multivariate data storage. This includes relational databases such as MySQL and postgresql;key-value data storage for Hadoop, Riak, and Redis, and Columnstore databases such as HBase and Cassandra; Document-based database MongoDB and couchbase; Atlas database such as
, queues, barriers (Barrier), and latches (Latch). Many well-known and successful projects depend on ZooKeeper, including HBase, Hadoop 2.0, Solr Cloud, neo4j, Apache Blur (incubating), and Accumulo.Zookeeper is a distributed, hierarchical file system that facilitates loose coupling between clients and provides a consistent, Znode view similar to files and directories in traditional file systems. It provides basic operations, such as creating, deletin
Sesame HTTP: Non-relational database storage, Sesame database storage
NoSQL stands for Not Only SQL, meaning Not Only SQL, but also non-relational databases. NoSQL is based on key-value pairs and does not need to be parsed at the SQL layer. There is no coupling between data and the performance is very high.
Non-relational databases can be subdivided as follows.
Key-value storage database: Represents Redis, Voldemort, and Oracle BDB.
Column storage database: Cassandra, HBase, and Riak.
simplify bytecode generation. Official Website
Byteman: Tests and troubleshooting are performed using the DSL (rule) bytecode during runtime. Official Website
Javassist: an attempt to simplify bytecode editing. Official Website
Cluster Management
A framework for dynamically managing applications in a cluster.
Apache Aurora: Apache Aurora is a Mesos framework used to run services and scheduled tasks (cron jobs) for a long time ). Official Website
Singularity: Singularity is a Mesos framewor
, Solr Cloud, neo4j, Apache Blur (incubating), and Accumulo.Zookeeper is a distributed, hierarchical file system that facilitates loose coupling between clients and provides a consistent, Znode view similar to files and directories in traditional file systems. It provides basic operations, such as creating, deleting, and checking the existence of Znode. It provides an event-driven model in which the client can observe changes to a particular znode, su
://en.wikipedia.org/wiki/Abstract_syntax_tree/https://javacc.java.net/
3.2. Operating System Knowledge
Ubuntu Centos uses linux and is familiar with shell scripts
3.3. Data Storage knowledge 3.3.1. Relational Database
How does MySQL view the execution plan and how to build MySQL Master/Slave binlog? Derby, H2, PostgreSQL SQLite
3.3.2. NoSQL
Cache Redis Memcached Leveldb Bigtable HBase Cassandra Mongodb graph database neo4j
3.3.3. References
Ht
Chapter One: Spring Data JPA What is spring data is an open-source framework for simplifying database access and supporting cloud services. Its main goal is to make access to data easy and fast, and to support map-reduce framework and cloud computing data Services. Spring Data contains multiple sub-projects: Commons-Provides a shared infrastructure for use by individual sub-projects, supports cross-database persistence JPA-simplifies the creation of JPA data access tiers and persistent layer cap
Chapter One: Introduction to Spring Data JPASpring data is an open-source framework for simplifying database access and supporting cloud services. Its main goal is to make access to data easy and fast, and to support map-reduce framework and cloud computing data Services. Spring Data contains multiple sub-projects:Commons-Provides a shared infrastructure for use by individual sub-projects, supporting cross-database persistenceJPA-simplifies the creation of the JPA data access layer and the persi
Executing startup.bat in the Tomcat bin directory starts normally, but the Tomcat plugin is installed under Eclipse and the Tomcat path is configured to start and error: A Java Exception has occurredWorkaround, in eclipse [window][preferences] Find [TOMCAT][JVM Settings], locate [Classpath (before generated Classpath)] in the configuration page on the right, click [ Jar/zip] in the pop-up File selection box, select the Bin/tomcat-juli.jar file under the Tomcat directory,Start tomcat, error solve
The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion;
products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the
content of the page makes you feel confusing, please write us an email, we will handle the problem
within 5 days after receiving your email.
If you find any instances of plagiarism from the community, please send an email to:
info-contact@alibabacloud.com
and provide relevant evidence. A staff member will contact you within 5 working days.