On the advantages and disadvantages of real-time data statistics and its value analysis

Source: Internet
Author: User
Keywords Real-time

Intermediary transaction SEO diagnosis Taobao guest Cloud host technology Hall

With the continuous development of the Internet, information more attention to real-time, micro-bo popular, search engines have launched real-time search function, but for the site analysis of real-time data is more meaningful?

Actually look at the data to look at the report people often want the data the more real-time the better, they want to master the site every hour or even every 10 minutes of change, able to the current situation of the site at hand, can find problems and quick response. But actually, if you ask them, what can we do when they know the real-time changes in the site's data, or if there's a sudden surge or a surge in the amount of site traffic at some point in time? I reckon most people can't answer it. Just a while ago in the site to do real-time data statistics related work, so some ideas to share here.

Advantages and disadvantages of real-time statistics

Whether real-time statistics are useful or useless, first look at what needs to be done to get real-time statistics, and what real-time data can bring to us, which is the pros and Cons of real-time statistics.

First of all, from a technical point of view, it is clear that real-time data statistics need more resources to occupy, because the site analysis of most of the data need to be computed from the click Stream data, and there is no ready-made results data can be obtained straight line display. The data obtained from the click Stream needs to be calculated and aggregated, which is no doubt that these operations require more cost, especially for large web site large data processing, while real-time data increases the complexity of implementation, and may increase the data inaccuracy to some extent.

But real-time data statistics can show the ability to process data at the technical level, while providing richer report presentations, and even using dynamic trend charts for real time refreshes on the report, which naturally goes without saying, so sometimes many technicians are happy to do the work.

From the point of view of data application and analysis, at present, many real-time data statistics are used to show the changes in real-time traffic, which time period of the highest number of visits, or the site's overall activity is the highest, at the same time can be analyzed every hour of traffic or the number of users distribution, but these analysis of the site in the end how much significance What can we do even if we know that the website has the most online users at 8 or 9 o ' night? The stress tests on the site obviously don't need to be done this way.

So personally think that real-time statistics more is the real-time status of the site monitoring, for analysis, not much practical significance, as to the site optimization and decision support how much effect, at least I have not thought.

Note to Avinash Kaushik in the book mentioned a sentence: "Real-time data:it ' s not really relevant, and It's expensive to Boot." I really agree with this statement. Many people will feel that getting real-time data will be more conducive to real-time response, fine-grained data also provides a more detailed base for data analysis, and we can do more analysis based on this, but we need to see how much the cost and true value of real-time data can be. Avinash Kaushik also enumerates the negative effects of 5 typical data acquisition in real-time, which can be summed up in the following points:

Do not blindly pursue the quantity of data, more should pay attention to the quality of data, and through effective analysis to reflect the value of the data;

does not conform to the principle of 10/90, the cost of real-time data in the acquisition obviously cannot and the analysis value reach 1:9 proportion;

Too much attention to real-time data will be constrained in the choice of analysis tools, can not use a truly excellent web analytics tools;

Technical system resource occupancy, task scheduling and complex process;

May increase the inaccuracy of the data in some way.

Of course, real-time data is not a bit of value, but for its cost considerations, there is no need for each analysis indicators in real-time statistics, or spend a lot of energy to focus on real-time data.

Value of real-time data

In fact, whether Google Analytics or Baidu statistics, have provided some indicators of the hourly statistics. Baidu Statistics will be real-time data statistics on the site shows that the user as long as a login can see the day of PV, UV and other data changes trend:

  

And in Google Analytics, we may find that GA is usually shown in the days of each metric, but in fact, GA also has the whole point of statistics, but the hidden more "deep" in the Visitors-visitor trending, in visits, Pageviews, Bounce rate and other reports will be found in the upper right time range Select the following time to summarize the granularity of more than one option--hour, you will see the trend of data changes every day after you choose:

  

Real-time data is not useless, avinash Kaushik that real-time data can create value when a company has fast analytics, fast decision-making, and fast execution capabilities. I'm here to give you a few of the applications that I think if we can get statistics per hour, then we can know in which time the site has the highest number of user visits, you can do some promotional activities in this time period, and through real-time data statistical analysis of the effect of activities to make a quick and reasonable response. For example, "second kill" activities need to be in a very short period of time to complete the statistics and show the results, of course, the prerequisite is the need to be in the background of the statistical system can withstand conditions.

Finally, borrow Avinash Kaushik a sentence as a summary: if only to see real-time data for real-time statistics, rather than based on real-time data to make corresponding action, then real-time data is quite expensive.

It's your turn, do you have any ideas on real time data analysis? Maybe I can make the real time data that I realized before produces more valuable results in addition to real-time monitoring, please comment.

This article uses»in agreement, reprint please specify the Source: Website data analysis» About real Time data statistics

Original address: http://webdataanalysis.net/personal-view/real-time-data/

Related Article

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.