The DTCC Database technology conference is the 9th session, although there are a lot of company's product promotion, in general, there are a lot of dry goods.
Most of the sessions, have chosen to listen to the big data practice with flow-type computing this piece. NetEase and drip share is quite good.
Now we have learned that we are using spark streaming/flink to perform streaming calculations more.
We used to do the real-time warehouse with Kafka+storm+spark.
I don't know what spark streaming/flink is more advanced than storm, and this is a question that seems to be more concise in development. Using SQL-style development?
For hbase queries, we used SOLR to do a two-level index, using Kylin for multidimensional analysis.
But everyone now seems to use Kudu+impala to do the search more, but kylin with a lot of. But most did not say kylin use in the pit, a little regret.
Speaking of Kylin, I also saw the founding team of Kylin, but for my question of multi-table join, multi-field group by inefficient problem does not seem to solve, only said that I can use the wrong way. No more questioning.
About DTCC Database Technology conference