In the era of large data, the wealth brought about by large data is not only embodied in social value, but also reflected in commercial value, while in the commercial field, the real wealth of large data is contained in the industrial chain rather than the consumption chain, so the integration of large data and industry will become the base of landing.
Every "You" on the Internet is aware of browsing news, sending and receiving emails, shopping on the internet, interacting with friends on social networking sites, and even uploading self-portraits with mobile apps ... Each of your Web "Footprints" will be recorded and stored in the form of "data", and these data, which are growing at a geometric level, are gradually reconstructing the internet world and reconstructing our lives.
Over the past year, big data has undoubtedly become the hottest it vocabulary of the year. Multinational it giants such as EMC, IBM, Oracle have launched big Data strategies and products, the United States implemented large data plans, the United Nations issued large data reports, many cities and parks in China announced the development of large data industry. However, the controversy is always with the hot spots, for large data, opponents think this is just another concept of enterprise processing and selling, another horse racing boom.
Due to the lack of data, large data cleaning and analysis capacity and data visualization bottleneck problems, a few years of large data in the domestic slow to fall, at present, with the gradual improvement of infrastructure layout, the development of large data has reached a new critical point.
At present, the extensive application of multimedia, social media and Internet of things will greatly increase the amount of information that enterprises can obtain. Processing machinery built-in sensors are collecting operational data, marketers scanning social media or using smartphone positioning data to understand customer consumption habits, data exchange may be networking with supply chain partners, employees can exchange best practices on the intranet. According to IDC forecasts, by 2020 the world will have a total of 35ZB (10 trillion bytes) of data volume.
Article related courses data exchange and high-performance concurrent processing (open source ETL large data governance tools--kettle use and two development) a large data solution based on Greenplum Hadoop distributed platform and business application case Analysis Linux series seven: Major Data technology topics (Hadoop, NoSQL, zookeeper, MapReduce)