Forbes: hadoop--Big Data tools you have to understand
Now Apache Hadoop has become the driving force behind the development of the big data industry. Techniques such as hive and pig are often mentioned, but they all have functions and why they need strange names (such as Oozie,zookeeper, Flume).
Hadoop has brought in cheap processing of large data (large data volumes are usually 10-100GB or more, with a variety of data types, including structured, unstructured, etc.) capabilities. But what's the difference?
Today enterprise data warehouses and relational databases are good at handling structured data and can store large amounts of data. But the cost is somewhat expensive. This requirement for data limits the types of data that can be processed, and the drawbacks of this inertia affect the search for agility in data warehouses when confronted with massive amounts of heterogeneous data. This usually means that valuable data sources are never mined within the organization. This is the biggest difference between Hadoop and traditional data processing methods.
This paper focuses on the components of the Hadoop system and explains the functions of each component.
NoSQL Movement: Database Architecture Choice
In a conversation last year, Basho's CTO Justin Sheehy that NoSQL was a sport, not technology. I immediately agree, because the previous discussion of NoSQL is not comfortable.
So why is it that NoSQL is a sport, not technology? Justin's argument is straightforward: the reason that NoSQL is a sport is because it is a choice of database architecture. Any single technical subject will obscure the essence of the NoSQL movement.
Since the 80, relational databases such as SQL Server, Oracle, and DB2 have been the dominant backend business systems. These relational database products are excellent and have many similarities.
Looking back over the past 15 years of software development, we have built a number of excellent large-scale database applications, including Web applications. However, since the birth of the relational database, many changes have been made in the database field: Data explosion, second-order query response, 7*24-hour normal operation, more and more important to capture high-speed data streams, unstructured data, and sacrificing acid.
The NoSQL movement has led us to think about what is the database architecture solution we want. Perhaps we will eventually understand that there is no universal truth.
C # and Java still dominate in the NoSQL database environment
With the development of data services and NoSQL databases, Web applications increasingly need NoSQL databases, and now it seems that NoSQL databases have a tendency to break traditional relational databases.
A survey of Couchbase last December showed that many companies are preparing to adopt NoSQL technology over the next year.
Google released its bigtable paper in 2006, and Amazon brought us dynamo. The technological innovations of these internet giants have also pioneered the development of NoSQL technology.
NoSQL's influence has involved some of the earlier relatively traditional businesses, such as finance, insurance, automobiles, transportation, media, manufacturing, and so on. including Zynga, SmugMug, AOL and heknot.com like NoSQL databases such as Couchbase and MongoDB. According to the current trend, the development of NoSQL will reach a new level in 2012 years.
The Couchbase survey received feedback from a total of 1351 people, which suggests they are interested in NoSQL technology and may deploy NoSQL databases this year. They say NoSQL may be better suited to massively complex web apps, which also makes NoSQL more common in the future. The survey also shows that NoSQL does have a good market outlook. About half of the companies surveyed said they would spend more money on NoSQL technology in the first half of 2012.
The 6 innovation technologies that are about to change the world are the biggest
Over the past five or six years, we have witnessed the rise of smartphones, tablets, touch screens, Internet TVs, free Wi-Fi, Facebook, Twitter and other emerging things. What are some of the epoch-making technologies that will emerge in the coming years? What changes will these technologies bring to our lives?
Just as mobile phones and the Internet have changed our lives, six of technologies such as cloud, data analysis, Hadoop, IPV6, low-power sensors, 3D Printing will also bring us a very different experience.
Deltacloud becomes the Apache top project
The Apache Software Foundation (ASF) has announced that the Apache Deltacloud is upgraded from the Apache incubator (Apache incubation project) to a top-level open source project (TLP).
Deltacloud, an Open-source API developed by Redhat in September 2009, defines a restful Web service designed to provide a unified approach to interacting with cloud service providers and cloud resources.
In addition, Deltacloud includes API implementations for the most popular cloud services, such as Amazon, Eucalyptus, Gogrid, IBM, Microsoft, OpenStack, and Rackspace. In addition to server-side API implementations, the project provides a multilingual client library.
Oracle is weak in cloud computing era?
Oracle has been indifferent to cloud computing for years. Ellison (Larry Ellison) first expressed disdain and then started peddling-at least his version of cloud computing. But Oracle's growth has been good, and the industry's gossip seems irrelevant. Big companies still buy big-name software systems.
But this winter, things changed.
Oracle's software-licensed sales revenue barely floated 2% in December, blamed on client budget cuts and worries over the European debt crisis. Sales in Europe, Africa and the Middle East account for One-third of Oracle's revenues. Shares plunged 8% per cent, but perhaps more to the point, Oracle shares have fallen by 22% since reaching their highs in May 2011. The signals from investors seem to suggest that Oracle's recent woes do not seem to stem from the mere meanness of consumers. Is it because the big, bulky IT organizations are changing their purchasing habits?
Net Change: Adjust organizational structure, mobile Internet
In the past the brilliant period brought by the game such as stealing vegetables, also experienced the public opinion of the decline of the user flow, now the happy Net has completed the strategic development direction and organizational structure adjustment, low-key start from the original net of the shell of the broken cocoon out.
1. Internal restructuring
At the beginning of 2012, a happy net to abolish the news of the wireless department caused many speculation in the industry, and this is actually just a part of the restructuring of the net.
2. Mobile Internet with exerting force
Whether it is to the mobile Internet sector personnel adjustment, or from the assessment indicators set up by the departments, mobile Internet is undoubtedly the most important in the 2012.
3. The alliance Tencent: low-key and pragmatic
October 31 last year, Tencent strategic Investment net, then the two sides to carry out strategic cooperation. Although the deal has been a concern for the two leading companies in the domestic social networking market, Tencent and Kaixin have been unusually low-key and have not even opened a press conference to publicize the prospects for cooperation.
Microsoft SQL Azure cuts to attract more users
Recently, Microsoft has lowered the price of the SQL Azure database platform server to attract more users to the Cloud computing server system.
At the same time, Microsoft also released a new 100M database option, which provides a full range of database features, and the price is only half of the previous. According to technology giant Microsoft said, this option can help customers save more than 1GB database 48%-75% space.
Microsoft wants to bring Active Directory into the cloud
Over the next few months, Windows users and Microsoft partners will see more information about Windows Azure Active directory. This concept is the same as the concept of Microsoft Active Directory on Windows Server systems.
Active directory is a centralized directory Management Service (directory services) in Microsoft Windows Server that is responsible for the large network environment in the architecture, and is built from Windows Server start Server product, it handles network objects in the organization, objects can be user, group, computer, domain control station, mail, profile, organizational unit, tree system, etc., as long as the object defined in the Active Directory structure definition file (schema) can be stored in the Active Directory file. and accessed using the Active Directory Service Interface, in fact, many Active Directory management tools use this interface to call and use the Active Directory data.
A few days ago, happy NET to revoke the wireless division, the original Wireless Division staff fully enriched to the platform, new products and gaming departments. Guo that this is precisely because the wireless business is the most important strategic direction of 2012, the wireless division of power into various departments, and effectively improve the product line mobile interconnection level.
From the second half of 2011, happy Net began to adjust the internal organization structure, so far, happy net has divided three lines of business, including the platform department, the game department and new product departments, and before this, happy net no new product department, and the game business and platform business has not "separated."