At present, real-time or quasi-real-time Big data models are more and more, technology is not advanced is not the first reason for popularity, the prosperity of community circles is the most important. Mainly has
- Redshift-An MPP from Amazon supports PB-level databases
- Hive-translates SQL into Map-reduce tasks based on the SQL engine above Hadoop;
- Shark-a SQL engine compatible with hive SQL based on the Spark computing framework;
- Impala-SQL compatible with hive SQL, implemented by the class MPP execution engine;
- Stinger/tez-stinger is Hontonworks, with Cloudera Daleitai products, the next Generation computing framework Tez added to pull the big flag;
Even if it is a real-time product, positioning is also very clear, this is an OLAP product, and hbase and other products, but with the figure calculation engine has a certain degree of communication. In the current situation, Spark has a great advantage. Big Data product Development Update iteration Soon, these are how many of these Google Dremel traces of the product will be, we continue to focus on
Real-time technology for big data