I used the Python crawler to find the ID of the sister paper that I didn't want to tell me about her ID ....
After I had done this, she felt that I knew her more and more, hehe.
One day, I found that my long-awaited sister paper in the circle of friends to share the story of the column, I know she also brush know. If I focus on her, I'll know what she's been up to, what she's been thinking about, what she likes and what it's like to talk about, and it's amazing (* ^^) v.
But enter her name ... You can't find it at all, okay? (? ' Know?)
When we were two, we talked about the article she shared,
I naturally said: "You do not know what you use the real name Ah, I am so naïve to use the real name."
She smiled and said: "That can be changed,"
"What do you know about the team not letting me change it!!!", I replied, "Why don't we just ^_-each other?"
Hey, so she opened Zhihu, looked at my homepage, and did not pay attention to me ... Maybe it's too little. Not up to her request, or she does not want me to know what she is looking at, perhaps she wants to know is Shanta, do not want to be seen by people around ... (?-﹏-?) Disappointed.
I went back to think, she said the name can be changed, then she may have used the real name, found a flaw!
Know the name can be changed, but the ID is not changed!
Everyone's homepage address, people behind that is TA's ID,
Http://www.zhihu.com/people/zhang-san-12-45
For example, Zhang San has many names, and numbers are added later. Her name is the same as more pinyin, I tried, this number is not more than 100. It is composed of Zhang-san, zhang-san-1 zhang-san-12-43 and so on.
OK, now I can start looking for her account! Now that she has changed her name, the condition must be: the nickname's pinyin is not the real name. This with the Pypinyin module can be solved, this way, I need to manually view the page is much less.
- Download @egrcc Zhihu-python on GitHub
- Looking for her, ing
# Coding:utf-8 fromZhihuImportUser fromPypinyinImportPinyin, Lazy_pinyinImportPypinyinuser_url ="'USER_ID ="'L = [u ' bu ',u ' Xu ',u ' kan ']#这里是她名字的拼音, or do not expose her good, (*/ω\*) forNuminchRange -):search within #先在-100 Try: User_url =' http://www.zhihu.com/people/bu-xu-kan-'+ str (num) user = user (user_url) user_id = user.get_user_id ()ifL! = Lazy_pinyin (User_id.decode (' GBK ')):#看看她有没有用原名 PrintUSER_ID,"'Numexcept:Pass forIinchRange -): forJinchRange -):search within #在-100-100 Try: User_url =' http://www.zhihu.com/people/bu-xu-kan-'+ STR (i) +'-'+ STR (j) user = User (user_url) user_id = user.get_user_id ()PrintUSER_ID,"'I'-'Jexcept:Pass
Climbed for a long time, the results came out, these nicknames are not many, I turned over their homepage and fortunately found my favorite sister paper:
xxxxxxxx 26 XXXXXXXX 27 XXXXXXXX 42 XXXXXXXX 72 XXXXXXXX 94 she is here! 6 -36 XXXXXXXX 6 -76 XXXXXXXX 7 - 86 XXXXXXXX 10 -35 XXXXXXXX 28 -67 XXXXXXXX 32 -28 XXXXXXXX 32 - 66 XXXXXXXX 34 -75
Since then, I can see her homepage every day ~ As to whether I have caught her ....
After I opened her homepage, I found that she liked sci-fi, was also interested in the fiction, and paid attention to dressing, which was in my appetite. Lately, she's been paying more attention to emotional issues, and I don't know if it's because I've been getting in touch with her lately and it's aroused her feelings, (/ω\)
I will refuel ~
Python crawler crawls user information that is user-aware