Baidu Webmaster College a few days ago released a case, which mentioned some details, especially worthy of webmaster attention.
It this page to crawl to do the optimization, directly to the picture binary content into the HTML resulting in page length too long, size 164K, resulting in content is not included in Baidu.
650) this.width=650; "src=" http://upload.admin5.com/2017/0621/1498025417891.jpg "border=" 0 "alt=" Baidu: page length greater than 128k will affect or even not included "style=" border:0px; "/>
Baidu: page length greater than 128k will affect or even not included
Web site if the needle crawler optimization, then the length of the Web page should be within 128K, not too long. Otherwise the crawler crawl content, the page content is truncated, the captured part can not be identified to the main content, resulting in the page is considered empty short and not included.
The implication, this may be Baidu technical defects caused, if the Web page in 128K above, crawler can not crawl can not be included . If you webmaster site content is too long, try to delete some of the less important information to ensure that the content included.
Baidu engineers suggest:
1, do not recommend the site use JS generated main content, such as JS rendering error, it is likely to lead to page content reading error, page can not crawl
2, such as the site for Crawling crawl optimization, recommended page length within 128k, not too long
3, the crawler to optimize for crawling, please put the subject content in front, to avoid the capture of truncation caused by the content crawl is not complete
Source: Lu Songsong Blog, Welcome to share, (qq/:13340454)
Original address: http://lusongsong.com/blog/post/8966.html
Baidu: page length greater than 128k will affect or even not included