See "Cao Peng seo-search engine optimization" Video tutorial Notes

Source: Internet
Author: User

First of all, the most worthwhile learning is not the knowledge of the video, but the speakers of these words

1. All aspects of SEO in this video are involved, but listening to it will increase your insight, but it will take more time to tap into more knowledge and systematize your knowledge. Of course, it is very useful for learning all knowledge.

2.SEO is an avant-garde and very active subject, it is expensive in the new, your experience in the accumulation of information in a timely manner. A lot of computer software knowledge is this, on the one hand to a solid foundation, on the other hand do not forget to follow the update of knowledge

The following notes are very incomplete, so to want to learn SEO knowledge system, need to take their own initiative through a variety of ways to learn the principle of search engine, the principle of crawler work ... More knowledge of expansion

SEO: Search engine optimization

SEO The biggest benefit: traffic. Find information on the Internet, more and more people are through search engines, 80% of the people will only look at the first page of search results, 40% people will only see the first page of the top four, only about 20% people will turn a few pages back, if your site search results compared, then your site's access to traffic will be greatly affected

SEO Purpose: In order to make it easier for netizens to find your website or webpage


General Introduction : The introduction of the search engine (focus on Google), search engine optimization (how the search engine crawler crawling Internet network, search engine is how to sort the search results; what is preferable seo, what is not advisable SEO Natural rankings vs. bid rankings)

optimization strategy : keywords; keywords tool; Web page analysis; Search engine submission

Note : Domain name, link, use of flash, CSS, end

Use Google search time search rules, tips: + 、-、 ""

! "The anatomy of a large-scale hypertextual Web search engine", written by Google's founders, explains the secrets of search engines, and it's sure to benefit you.

! Google's sorting method, PageRank

Yellow Pages and search engines: Yellow Pages are artificially compiled, slow number of updates, search engines are automatically crawling; search engine retrieval is the Web page, Yellow Pages Search is the site, compared with the search engine, the Yellow pages are included in the threshold is relatively high; yellow pages for the search engine to provide data, the yellow pages are good, And there are still a lot of people using the Yellow pages. So do SEO can not ignore the yellow pages of the problem

Content ads and search ads:

Search Engine crawler:

Webpage Snapshot:

How search engines rank pages. It basically looks at three things: 1. Web content; 2. Frequency and concentration of keywords appearing; 3. Popularity of websites

White hat SEO: Look carefully at Google's Webmaster Support Center, and constantly updated. Some things that should not be done, if done, affect the site's search rankings.

Black hat seo: Use cheat method to achieve search engine ranking optimization, do not do this

What is a keyword: when you search, what you enter in the input box is the keyword, and for a website, the words that are most relevant to the most concise description of your site's content are keywords.

Keyword selection suggestions: First list some of your own keywords; view your website stats or server logs; Refer to other people's opinions (potential customers, colleagues ...). ); Use optimization tools

Stop words: Words that are too often used, have no clear meaning, are ignored by search engines, such as the ...

Long Tail theory

Keyword tools: wordtracker tools, keyword Discovery tools,

HTML and SEO have a big relationship:
1. The title of the Web page, <title> tag, for SEO is the most important, try to include this page of keywords, tell others what this page is to do, not too long or too short. Intitle: operator
2.META tag Keywords (keyword) and description (description), because many people abuse these two tags, wrote too much stuff in it, so search engines are increasingly not recognizing these two things
3. The 4. The text of the Web page contains more keywords the better, as much as possible but does not affect people's reading
5. Pictures in the Web

Submit website URL to search engine; Submit website URL to yellow Pages

How to choose a domain name
1. If the domain name contains keywords, it will greatly improve the ranking

Reverse link Lookup: Google link: keywords; link survey software

! Dynamic Web pages, should be in the Web address as far as possible to avoid the appearance of, =, & symbols, the default URL of the dynamic website by writing programs to the search engine has a good format

Robot.txt file is placed in the root directory of the site, tell the search engine crawler, this site is not willing to be crawled to the directory, content

Reprint "Detailed search engine working principle"

A qualified SEO engineer, will understand the working principle of the search engine, the principle of Baidu and Google almost almost, just some of the details of the different, such as word segmentation technology, because the domestic search is generally Baidu, so we will be targeted to the course of Baidu, of course, the basic class is just the same applies to Google!

The working principle of the search engine is actually very simple, first search engine roughly divided into 4 parts, the first part is Spider Crawler, the second part is the data analysis system, the third part is the index system, the fourth is the query system, of course, this is only the basic 4 parts!

Below we talk about the search engine workflow:

What is a search engine spider, what is a reptile program?

Search engine spider program, in fact, is a search engine automatic application, its role is what? In fact, it is very simple, is to browse the information in the Internet, and then the information are crawled to the search engine server, and then set up an index library and so on, we can be a search engine spider as a user, and then this user to visit our site, and then the content of our site to save to their own computer! Better understand.

How does a search engine spider crawl a webpage?

Find a link → download this page → add to temporary library → Extract the links in the Web page → on the download page → Loop

First, the search engine spiders need to find links, as to how to find the simple, that is, link links through links. Search engine spiders found this link will be downloaded and stored in the temporary library, of course, at the same time, will be extracted from the page all the links, and then the loop.

Search engine spiders are almost 24 hours without rest (here for it to feel tragic, no vacations.) Ha ha. What about the Web pages that spiders download back? This requires the second system, which is the search engine analysis system.

Do spiders crawl Web pages in search engines regularly?

This question asks good, then the search engine spider crawls the webpage to have the law exactly? The answer is YES!

If spiders go to crawl Web pages, then the cost of death, the Internet on the web, every day to increase so much, how can spiders crawl over it? So, spiders crawl the web is also a regular!

Spider Crawl Web Strategy 1: Depth First

What is depth first? Simply put, is the search engine spider on a page to find a connection and then climb down the connection, and then on the next page to find a connection, and then crawl down and all crawl, this is the depth-first crawl strategy. Everybody, look.

In is the depth first, we if the Web page A in the Search engine authority is the highest, if the authority of the D Web page is the lowest, if the search engine spiders in accordance with the depth of the first strategy to crawl the Web page, then will be the reverse, is the D page of the authority to the highest, this is the depth of priority!

Spider Crawl Web Strategy 2: Width first

Width first better understanding, is the search engine spiders first the entire page of the link to crawl all at once, and then in the grasp to remove the entire link of a page.

, it is the width first! This is actually what we usually said flat structure, we may be in a mysterious corner to see an article, warn you, the page layer can not be too much, if too many will lead to the inclusion of difficult, this is to deal with the search engine spider width priority strategy, in fact, this reason.

Spider Crawl Web Strategy 3: Weight Priority

If the width of priority than depth first good, in fact, is not absolute, can only say that each has its own advantages, now search engine spiders are generally two crawl strategies together, that is, depth priority + width first, and in the use of these two strategies to crawl, to refer to the weight of this connection, if the weight of the connection is good, Then use depth first, if the weight of the connection is very low, then the width first!

So how does the search engine spider know the weight of this connection?

Here are 2 factors: 1, the level of more and less, 2, the connection of the chain of how much and quality;

So if the hierarchy of too many links is not to be crawled it? This is not absolute, here to consider a number of factors, we in the back of the advanced level will be reduced to a logical strategy, then I in detail to everyone to say!

Spider Crawl Web Strategy 4: Revisit crawl

I think this is a better understanding, that is, for example, yesterday's search engine spiders to crawl our web page, and today we add a new content in this page, then search engine spiders today to crawl new content, this is the re-visit crawl! The revisit crawl is also divided into two, as follows:

1. Full re-visit

The so-called full revisit refers to the spider last crawl of the link, and then in this one months one day, all re-visit crawl once!

2. Single re-visit

A single revisit is generally for a page to update the frequency of relatively fast more stable pages, if we have a page, 1 months is not updated.

So the search engine spider the first day you are like this, the next day, or this look, then the third day the search engine spiders will not come, will be in a period of time to come, such as every 1 months in one, or when all revisit the time in the update.

The above, is the search engine Spider crawl Web strategy! So we said above, in the search engine spider to crawl back the Web page, began the second part, that is, the data analysis of this part.

Data Analysis System

Data analysis system, is to deal with the search engine spider crawl back to the Web page, then the data analysis this piece is divided into a few:

1, the structure of the Web page

Simply put, the HTML code is all erased, extract the content.

2. Noise Cancelling

What does noise elimination mean? In the structure of the Web page, has deleted the HTML code, left the text, then the noise is to leave the theme of the page, delete useless content, such as copyright!

3. Check the weight

Check the weight is good understanding, is the search engine to find duplicate pages and content, if you find duplicate pages, delete.

4. Participle

Participle is God horse thing? is the search engine spider in the previous steps, and then extract the content of the text, and then divide our content into N words, and then arranged out, into the index library! It also calculates how many times this word appears on this page.

5. Link analysis

This step is what we usually do to do fidgety work, search engine will query, how many backlinks of this page, how many links are exported and the chain, and then give this page how much weight and so on.

Data indexing System

After the above steps, the search engine will put the processed information into the search engine index library. Then the index library is roughly divided into the following two systems:

Positive row Index System

What is a positive row index? To put it simply, the search engine adds a number to all URLs, and this number corresponds to the content of the URL, including the URL's outer chain, keyword density, and so on.

Simple working principle of search engine overview

Search engine spider discovery connection → Crawl Web page According to Spider's Crawl strategy → then hand it to the analysis system-Analyze page → build Index Library

OK, this class is finished. It's not easy. I, today is just a simple talk about search engine work, because the search engine of a very complex system, not a few 10 minutes can be a full range of sermons, we will be in advance or advanced tutorial slowly talk about!

See "Cao Peng seo-search engine optimization" Video tutorial Notes

Related Article

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.