Read about oreilly python for data analysis, The latest news, videos, and discussion topics about oreilly python for data analysis from alibabacloud.com
Peacock City Burton Manor Villa owners anxious to sell a key at any time to see the room 7.584 million Yuan/M2 5 Room 2 Hall 315m2 a total of 3 floors 2014 built Tian Wei-min Chaobai River Peacock City Burlington Manor (Villa) Beijing around-Langfang-Houtan line ['Matching Mature','Quality Tenants','High Safety'] gifted mountain Beautiful ground double Garden 200 draw near Shunyi UK* See at any time 26,863,058 Yuan/m2 4 Room 2 Hall 425m2 total 4 stories built in 2008 Li Tootto Yosemite C Area S
1.1. Pandas Analysis steps
Loading data
COUNT the date of the access_time. SQL similar to the following:
SELECT date_format (access_time, '%H '), COUNT (*) from log GROUP by Date_format (access_time, '%H ');
1.2. Code
Cat pd_ng_log_stat.py#!/usr/bin/env python#-*-Coding:utf-8-*-From Ng_line_parser import NglineparserImport Pandas as PDImport socketImport str
Pandas is the preferred library for subsequent content in this book. The pandas can meet the following requirements:
Data structure with automatic or explicit data alignment by axis. This prevents many common errors caused by data misalignment and data from different data
Python data analysis (Basic)First, install the anaconda:https://www.anaconda.com/download/#windowsIi. NumPy (Basic package of scientific calculation)Three, matplotlib (chart)Iv. SciPy (collection of packages for solving various standard problem domains in scientific calculations)V. Pandas (Treatment of structured data)
specified axisTake and PutRemove a specific element from an arraybroadcasting/BroadcastSpreads along the x-axis and spreads along the y-axis with subtle differencesNp.newaxis () Add new axisAdvanced Ufunc UsageUfunc is the abbreviation for universal function, which is a function that can operate on each element of an array, numpy many of the Ufunc functions are implemented at the C level, so they are computationally fast
Np.add.reduce: Add and operate
Np.add.accumulate: Similar to
variable (local).-python called namespaces-The following functiondef func(): a = [] for i in range(5): a.append(i)-After Func is called, the empty list A is created first, then 5 elements are added, and a is destroyed when the function exits-If we define a as followsa = []def func(): for i in range(5): a.append(i)-Although you can assign a global variable to a function, those variables must be declared as global variables with
Hierarchical Indexes Hierarchical indexing means you can have multiple indexes on an array, for example: a bit like a merged cell in Excel, right?Select a subset of the data based on the index to select a subset of the data from the other layer:Select data in the same way as the index in the layer:Multi-index series conversion to Dataframe hierarchical indexes pl
Http://www.cnblogs.com/batteryhp/p/4868348.htmlChapter I preparatory workStarting today the book-"Data analysis using Python". Both R and Python have to be used, which is the reason for the code book. First, according to the book said to install, Google downloaded Epd_free-7.3-1-win-x86.msi, the translator proposed to
The Basic course has not finished, it came to this, because my usual research is based on data processing. Who says the woman is inferior to the male 650) this.width=650; "src=" Http://img.baidu.com/hi/jx2/j_0011.gif "alt=" J_0011.gif "/>do your own things well done carefully, Hee 650) this.width=650; "src=" Http://img.baidu.com/hi/jx2/j_0003.gif "alt=" J_0003.gif "/>Read the introductory section, download the dat
1, remove the label of the page, such as from Import beautifulrsoup predata=beautifulsoup (data,'html.parser'). Get_text ()2. Remove punctuation, etc., with regular expressions.Import RE#表示将data中的除了大小写字母之外的符号换成空格preData=re.sub (R'[^a-za-z]',' , data)3. Lowercase the words in the text and separate the data with a space
element is the index of the item whose index number is smaller than the previous one. So we see that the value of index 2,3 is 1, and the value of index 1
If you want to use the element following the newly inserted index, you need to use the Bfill method
The replacement index can be extended from series to dataframe, not only to replace the row index, but also to replace the column index or even replace both
Second, delete
① Deleting a series Pandas specificall
Python Data Analysis Prerequisites:1.Anaconda operationFirst, you should set the local data directory as the working directory, so that you can load the local data set into memoryImport Osos.chdir ("d:/bigdata/workspace/testdata/"# Sets the current path to the working path O
python data analysis requirements are not software development requirements , indeed, for a tool, different purposes of the user, the required skills are not the same, such as knife This tool, the butcher used it to kill pigs, the chef used it is cut vegetables, military use it is defend, the guests use it is cut steak, Everyone uses different ways, there are spe
???IndexP.asfreq (' M ', ' Start ') #将年度数据转换为月度的形式, converted to the month of the yearP.asfreq (' M ', ' End ') #将年度数据转换为月度的形式, converted to December of the yearP1=PD. Period (' freq= ', ' A-jun ')P1.asfreq (' m ', ' Start ') #Period (' 2015-07 ', ' m ')P1.asfreq (' m ', ' End ') #Period (' 2016-06 ', ' m ')P2=PD. Period (' 2016-09 ', ' M ')P2.asfreq (' A-jun ') #2016年9月进行频率转换, equivalent to 2017 years in the time frequency ending in JuneRng=pd.period_range (' 2006 ', ' freq= ', ' A-dec ')Ts=ser
The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion;
products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the
content of the page makes you feel confusing, please write us an email, we will handle the problem
within 5 days after receiving your email.
If you find any instances of plagiarism from the community, please send an email to:
info-contact@alibabacloud.com
and provide relevant evidence. A staff member will contact you within 5 working days.