Python crawler crawls user information that is user-aware

Source: Internet
Author: User

I used the Python crawler to find the ID of the sister paper that I didn't want to tell me about her ID ....

After I had done this, she felt that I knew her more and more, hehe.

One day, I found that my long-awaited sister paper in the circle of friends to share the story of the column, I know she also brush know. If I focus on her, I'll know what she's been up to, what she's been thinking about, what she likes and what it's like to talk about, and it's amazing (* ^^) v.
But enter her name ... You can't find it at all, okay? (? ' Know?)
When we were two, we talked about the article she shared,
I naturally said: "You do not know what you use the real name Ah, I am so naïve to use the real name."
She smiled and said: "That can be changed,"
"What do you know about the team not letting me change it!!!", I replied, "Why don't we just ^_-each other?"
Hey, so she opened Zhihu, looked at my homepage, and did not pay attention to me ... Maybe it's too little. Not up to her request, or she does not want me to know what she is looking at, perhaps she wants to know is Shanta, do not want to be seen by people around ... (?-﹏-?) Disappointed.

I went back to think, she said the name can be changed, then she may have used the real name, found a flaw!
Know the name can be changed, but the ID is not changed!
Everyone's homepage address, people behind that is TA's ID,
Http://www.zhihu.com/people/zhang-san-12-45
For example, Zhang San has many names, and numbers are added later. Her name is the same as more pinyin, I tried, this number is not more than 100. It is composed of Zhang-san, zhang-san-1 zhang-san-12-43 and so on.

OK, now I can start looking for her account! Now that she has changed her name, the condition must be: the nickname's pinyin is not the real name. This with the Pypinyin module can be solved, this way, I need to manually view the page is much less.

    1. Download @egrcc Zhihu-python on GitHub
    2. Looking for her, ing
# Coding:utf-8 fromZhihuImportUser fromPypinyinImportPinyin, Lazy_pinyinImportPypinyinuser_url ="'USER_ID ="'L = [u ' bu ',u ' Xu ',u ' kan ']#这里是她名字的拼音, or do not expose her good, (*/ω\*) forNuminchRange -):search within #先在-100    Try: User_url =' http://www.zhihu.com/people/bu-xu-kan-'+ str (num) user = user (user_url) user_id = user.get_user_id ()ifL! = Lazy_pinyin (User_id.decode (' GBK ')):#看看她有没有用原名            PrintUSER_ID,"'Numexcept:Pass forIinchRange -): forJinchRange -):search within #在-100-100        Try: User_url =' http://www.zhihu.com/people/bu-xu-kan-'+ STR (i) +'-'+ STR (j) user = User (user_url) user_id = user.get_user_id ()PrintUSER_ID,"'I'-'Jexcept:Pass

Climbed for a long time, the results came out, these nicknames are not many, I turned over their homepage and fortunately found my favorite sister paper:

xxxxxxxx 26  XXXXXXXX 27  XXXXXXXX 42  XXXXXXXX 72    XXXXXXXX 94  she is  here! 6 -36  XXXXXXXX 6 -76  XXXXXXXX 7 - 86    XXXXXXXX 10 -35  XXXXXXXX 28 -67  XXXXXXXX 32 -28  XXXXXXXX 32 - 66  XXXXXXXX 34 -75   
Since then, I can see her homepage every day ~ As to whether I have caught her ....

After I opened her homepage, I found that she liked sci-fi, was also interested in the fiction, and paid attention to dressing, which was in my appetite. Lately, she's been paying more attention to emotional issues, and I don't know if it's because I've been getting in touch with her lately and it's aroused her feelings, (/ω\)

I will refuel ~

Python crawler crawls user information that is user-aware

Related Article

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.