Data set: Download
Absrtact: MICROBLOGPCU is collected from Sina Weibo, which can be used to study machine learning methods and social relationships.
This data set is used by the original author to explore spammers in Weibo (the person who sent the spam message), and their demo is here
Property information for the dataset:
Weibo_user.csv
-USER_ID: User ID
-user_name: User Nickname
-gender: Sex, Male,female,other
-class: Account Level
-message: Account registration location or other personal information
-post_num: Zip Code
Number of-follower_num:followers
Number of-followee_num:followee
-follow Ratio:followee_num/follower_num;
-is_spammer:manually Annotated label, 1 means spammer,0 represents non-spammer;
User_post.csv
-POST_ID: The ID of Weibo
-post_time: Release Time
-POSTER_ID: The ID of the publishing user
-repost_num: Number of forwards
-commnet_num: Number of reviews
Followe-followee.csv
-follower:the nickname of follower;
-follower_id:the user ID of follower;
-followee:the nickname of Followee;
-followee_id:the User ID of Followee;
post.csv and user_post.csv are similar, and the post in it is retrievalled by a certain key word related to a topic;
Take out some of the data to make a diagram
Install.packages (' igraph ') library (igraph) follower_followee<-read.csv (' Microblogpcu/follower_followee.csv ') Follower_followee_part<-follower_followee[1:1000,]gg<-graph.data.frame (Data.frame (er=follower_followee_ part$follower_id,ee=follower_followee_part$followee_id) plot (GG, Vertex.label=na, # #不显示标签 Edge.arrow.mode= '-', # #不使用箭头 vertex.size = 5 # #设置结点圆的大小 )
Reference: Https://archive.ics.uci.edu/ml/datasets/microblogPCU
Reprint Please specify source: http://blog.csdn.net/zhyoulun/article/details/46442899
[Data set] Sina Weibo data set MICROBLOGPCU