I like to browse the Web when multiple warn't. This mind needs to look at many elements, each of which provides information. An isolated element is unimportant, but the information that is composed of multiple elements can often be judged with value. So, from these elements:
never ignore URLs
URL is an important information, professional sensitivity so that the analyst will never ignore the characteristics of the URL address. URL reflects the site's clues, directly to the example:
$URL The domain name of the address
Many people are cheated on the internet, a big reason is never pay attention to the URL of the domain name.
such as: http://www.taobao.ipx32.com/about.html, seemingly retarded deception but let many netizens mistakenly think is Taobao site and fall into the trap. This is the ipx32 domain site, not the Taobao site.
in the process of browsing, always pay attention to the page URL domain name, you can know whether to click the link to leave this site, and focus on the new site domain name features. Cross-domain links are common in many major customer marketing processes, and different domains mean that webmasters or advertisers have different monitoring tools and strategies. For example, it portals often feature pages for hardware vendors such as Intel, Asus, and HP (Google Analytics or Nielsen's monitoring code may be added), and there will be links that will introduce visitors to the new domain. such as intel.com.cn, or hp.com.cn (may add omniture, HBX, or WebTrends monitoring code). This cross-domain URL is obvious. $ URL address contains parameters
It's not uncommon to include parameters in a URL, but each parameter has a meaning and is concerned about their ability to fully understand the features of the site. Like what:
http://www.chinawebanalytics.cn/?p=917 This is Sidney's new blog address, you can analyze the blog for a long time, the ID number is 917. http://adsclick.qq.com/adsclick?oid=1112901&loc=QQ_SX_JY_Test6&url=http://www.52-abc.com/This is the right side of the QQ homepage ads link , where the rich parameters tell us the definition of the name of these ad bits, and the destination address of the jump. Http://www.soso.com/q?sp=S&sc=web&cid=w.q.in.sb.web&ty=1&bn=&op=entry&kw=&w=WA This is the "WA" results page of the Tencent search, which tells us that the searching keyword parameter is a W variable. There are some other parameters, may not understand at once, if there is a need for more than one test study, you can determine.
In short, the parameters of the URL concern is the basic skills of analysts.
$ URL refers to the file type
The URL contains information about the file type.
Html/htm: This shows that the Web page is statically processed to facilitate search engine crawling. Now most portals and CMS systems will have static processing functions. Some simple personal pages will also use HTML file names. Jsp/php/aspx/asp: This shows that this is a dynamic page, the Web page source files are on the server side, the characteristics of these files can search for their own understanding. No type: Some addresses, http://www.ectend.com, which belong to the server custom home page, the effect is actually http://www.ectend.com/index.php.do:http://www.ems.com.cn/ Qcgzoutqueryaction.do?reqcode=gotosearch This situation is the Web site of JSP development. In fact, do not need to fully know, can assist. This kind of website generally has certain technical content, but the technical strength is not strong, is not advanced enough, common in some state-funded background enterprise's functional webpage, or the website backstage. Because the current good site, in the front-end page can hide the URL is very good, do not appear. Do this. In addition: There are some open source project site URL address more special, such as wordpress:http://www.ectend.com/index.php/excellent-analytics/; Zh.wikipedia.org/wiki/wikipedia: Home. $ URL address naming feature
From the name of the URL can also find strange or interesting things, such as:
Taobao URL feature is very interesting, is based on the base 64 code. (Thanks to the money of the guidance, fixed the original error, very rewarding, again explained the URL in the universe), a lot of "-". I presume that "-" is a variable, and if there is no value in the middle, it means that the variable is undefined.
http://list.taobao.com/browse/50018957-50018960/ N-1-1---------------------0---------Yes---------------------2-------b--40--commend-0-all-50018960.htm?ssid=r18 ? ad_id=&am_id=&cm_id=&pm_id= the URL of the product is also very distinctive:
http://www.vancl.com/Product_1E10000/RuanNiuPiXiDaiXiuXianXie+HeiSe.html, product pinyin + color. In addition, we can find out the carelessness of their employees: we know that the Chinese brackets () are different from the English brackets (), and that only the parentheses are available in the URL address, and the Chinese brackets are translated into code. Visible VANCL staff in the product entry when there is no unified standard:
http://www.vancl.com/CategoryList-1440-1--1/GaoJiMianTangQuanMianChenShan%EF%BC%88ZunGuiKuan%EF%BC%89.html
http://www.vancl.com/CategoryList-1324-1--1/ShangWuXiuXianKu (Biaozhunkuan). HTML Web page source file
For curious pages, be sure to view the Web page source files. Through the rapid browsing of the source files can determine the technical strength of the site, the quality of the designers, monitoring tools and monitoring methods of deployment. General attention to the following four points:
code is clean or redundant deployment of monitoring tools to monitor the location and sequence of code deployments to see if a suspicious code is added to this article source