This time the leak is a 14M file. There are 13.1W of extremely accurate data.
Think it is still very scary have wood, even if changed password, ID number, mobile phone number, mailbox, have used password also burst out.
Number 25th to get the data when trying to login a few (can not do bad things, do not catch me), 100% correct rate .... It's horrible ...
For more convenient statistics, I wrote a console program to import the data into the database.
1. Build a table
CREATE TABLE [dbo].[user12306]( [Email] [varchar]( -)NULL, [Password] [varchar]( -)NULL, [Realname] [nvarchar]( -)NULL, [Idcard] [Char]( -)NULL, [username] [varchar]( -)NULL, [Phone] [Char]( the)NULL, [Email2] [varchar]( -)NULL)
View Code
2, console code, I am using sqlbulkcopy interpolation of the DataTable, in order to insert efficiency fast.
usingSystem;usingSystem.Configuration;usingSystem.Data;usingSystem.Data.SqlClient;usingSystem.IO;usingSystem.Text;namespaceimport12306{classProgram {Static voidMain (string[] args) { varFilePath = configurationsettings.appsettings["FilePath"]; varConnStr = configurationsettings.appsettings["ConnStr"]; varTableName = configurationsettings.appsettings["TableName"]; varInsertsize =int. Parse (configurationsettings.appsettings["insertsize"]); Try { varSR =NewStreamReader (FilePath, Encoding.GetEncoding ("gb2312"));//Read the file stringLinestr; varline =1; varSqlBulkCopy =NewSqlBulkCopy (ConnStr, sqlbulkcopyoptions.useinternaltransaction); SqlBulkCopy. DestinationTableName=TableName; varNEWDT =getnewtable (); while((Linestr = Sr.) ReadLine ())! =NULL)//Populating a DataTable { varms = Linestr.replace ("----","|"). Split ('|'); varrow =NEWDT. NewRow (); row["Email"] = ms[0]; row["Password"] = ms[1]; row["Realname"] = ms[2]; row["Idcard"] = ms[3]; row["username"] = ms[4]; row["Phone"] = ms[5]; row["Email2"] = ms[6]; Newdt. Rows.Add (row); if(line% Insertsize = =0) {Console.WriteLine ("is writing the article"+ (Line-insertsize) +"- "+ Line +"Bar data,"+DateTime.Now); Newdt. AcceptChanges (); SqlBulkCopy. WriteToServer (NEWDT); Console.WriteLine ("complete the Write,"+DateTime.Now); Newdt. Clear (); } Line++; } Sr. Close (); } Catch(Exception ex) {Console.WriteLine (ex); } console.readline (); } Private StaticDataTable getnewtable () {#regionCreate a DataTablevarReadtxt =NewDataTable ("Readtxt"); varemail =NewDataColumn {DataType = Type.GetType ("System.String"), ColumnName ="Email" }; READTXT.COLUMNS.ADD (email); varPassword =NewDataColumn {DataType = Type.GetType ("System.String"), ColumnName ="Password" }; READTXT.COLUMNS.ADD (password); varRealname =NewDataColumn {DataType = Type.GetType ("System.String"), ColumnName ="Realname" }; READTXT.COLUMNS.ADD (Realname); varIdcard =NewDataColumn {DataType = Type.GetType ("System.String"), ColumnName ="Idcard" }; READTXT.COLUMNS.ADD (Idcard); varUsername =NewDataColumn {DataType = Type.GetType ("System.String"), ColumnName ="username" }; READTXT.COLUMNS.ADD (username); varPhone =NewDataColumn {DataType = Type.GetType ("System.String"), ColumnName ="Phone" }; READTXT.COLUMNS.ADD (phone); varEMAIL2 =NewDataColumn {DataType = Type.GetType ("System.String"), ColumnName ="Email2" }; READTXT.COLUMNS.ADD (EMAIL2); #endregion returnReadtxt; } }}
View Code
Configuration file App. Config
<?XML version= "1.0" encoding= "Utf-8"?><Configuration> <Startup> <supportedruntimeversion= "v4.0"SKU=". netframework,version=v4.5 " /> </Startup> <appSettings> <AddKey= "ConnectionStr"value= "Data source=.\bu;initial catalog=ac12306; User Id=userid; Password=password; " /> <AddKey= "FilePath"value= "E:\\12306.txt" /><!--file path - <AddKey= "TableName"value= "Usersss" /><!--database table name - <AddKey= "Insertsize"value= "+" /><!--Number of inserts per batch - </appSettings></Configuration>
View Code
No more data, just a few seconds to run ... Then look at the database, the amount of hehe "sinister face" ....
"User Distribution"
In other words, before the analysis said that this is not a leak but "pool", I personally think, from the data is not like a "Crash library." For example, user age distribution, such as:
According to the date of birth of the ID card, indeed 80, 90 users are very concentrated, but all ages are also natural distribution , previously leaked data of the site, such as CSDN, everyone has a very prominent and centralized user distribution.
"Spit Groove" talk about age, the library has a March 2014-year-old baby, your family Rob tickets is really fierce, you are not satisfied with the age of a vest, but also drunk has wood!
Another look at the constellation, can be said to attack by the sentence (crossed off), cough, here is not judged for reference only. Then, what province of people love to run around?
After work, not to be continued ...
After getting 12306 data, the non-technical spit groove