Topic Center

Contact Sales

Home > Internet > Online Trends

Dataframe common operations

Last Update:2020-06-11 Source: Internet

Author: User

Keywords dataframe pandas dataframe dataframe operations

Developer on Alibaba Coud: Build your first app with APIs, SDKs, and tutorials on the Alibaba Cloud. Read more ＞

Dataframe is a data structure in pandas in python. A structure similar to a table.Here intoduces some of Dataframe common operations.

First, create, take a column, delete a column

import pandas as pd
lst=[2,3,5] #represents a line of df
df=pd.DataFrame(data=[lst,lst],columns=[‘col1‘,‘col2’,‘col3’]) #Generate DF from the list
df=pd.DataFrame(data={‘col1‘:[2]*2,‘col2‘:[3]*2,‘col3’:[5]*2})# dictionary to generate DF

df[[‘col1‘,‘col2‘]]#You can take out the data of the first and second columns
df[2:][[‘col1‘,‘col2‘]] #You can take out the data of the first and second columns from the second row to the last row

df.drop(‘col1’, axis=1)#Delete the first column
df.drop([‘col1‘,‘col2‘,axis=1])

Second, operate on one or more columns
1. Use map to operate on a column

df[‘col‘] = df[‘col1‘].map(lambda x: x**2) #Generate a column that is the square of the first column
2. Use apply to operate on one or more columns

df.index = pd.date_range(‘20190101’, periods=5) #Change the original index to use the date as the index
df['col'] = df.apply(lambda x:x['col1']*x['col2'], axis=1) #Rewrite the'col' column to the corresponding row of the'col1' column multiply by ' Corresponding row of col2' column

Third, find the moving average
df[‘MA‘] = df[‘col‘].rolling(window=3, center=False).mean()

Fourth, make the column up or down translation transformation
df = pd.DataFrame({‘id‘:[1,1,1,2,2,3],‘value‘:[1,2,3,4,5,6]})
df[‘value_shift‘] = df.groupby(‘id‘)[‘value‘].shift(1) #Group by id column, shift the value column by translation, that is, move down 1 row
df[‘value_shift_1‘] = df.groupby(‘id‘)[‘value‘].shift(-1) #Group by id column, shift the value column by translation, that is, move up 1 row

Fifth. Standardized treatment of columns:
from sklearn import preprocessing
df = pd.DataFrame({'id':[1,1,1,2,2,3],'value1':[1,2,3,4,5,6],'value2':[1, 3,4,3,7,2]))
value=df[[‘value1‘,‘value2‘]]
value_T=value.transpose() #value_T is an array type
scaler=preprocessing.StandardScaler().fit(value_T) #scaler is to standardize the row data, so the df column data should be transposed
value_T_scale = scaler.transform(value_T)
value_scale = value_T_scale.transpose()

#Sometimes you need to use the reshape of np.array:
y=df[[‘value‘]] #y.shape=(6,1)
y=y.reshape(1,-1) #y.shape=(1,6)
y=y.reshape(-1,1) #y.shape=(6,1)
y=np.repeat(0,len(y)) #Generate zero matrix

Sixth, assign a value to a column
df = pd.DataFrame({‘id‘:[1,1,1,2,2,3],‘value‘:[1,2,3,4,5,6]})
value=[11,22,33]
df.loc[df.index[0:3],‘value‘]=value
df.loc[df.index[0:3], ‘value0’]=value

Seventh. Make frequency statistics for multiple repeated characters in the list
lst=['a','a','a','b','c','c','b','e','f','a','a','c' ]
cnt = pd.Series(lst).value_counts()

This article is an English version of an article which is originally in the Chinese language on aliyun.com and is provided for information purposes only. This website makes no representation or warranty of any kind, either expressed or implied, as to the accuracy, completeness ownership or reliability of the article or any translations thereof. If you have any concerns or complaints relating to the article, please send an email, providing a detailed description of the concern or complaint, to info-contact@alibabacloud.com. A staff member will contact you within 5 working days. Once verified, infringing content will be removed immediately.

Related Keywords:

Front-end Must Learn: CDN Acceleration Principle 12-02

Elements of CDN Network 12-01

8 New Types of Attacks Facing the Cloud Environment 11-26

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

Hot Article

Hot Tags

computing conference access forum computer class data get http html applications

Popular Keywords

html add blank space register business logo register ssl certificate full site sign in sign up node js build cloud register register a subdomain in python network management system tutorial how to learn computer science by myself

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

Get Started for Free

Sales Support

1 on 1 presale consultation

Chat Contact Sales
After-Sales Support

24/7 Technical Support 6 Free Tickets per Quarter Faster Response

Open a Ticket
Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.

Learn More

Dataframe common operations

Contact Us

Hot Article

Hot Tags

Popular Keywords

Recommend Topic

A Free Trial That Lets You Build Big!

Sales Support

After-Sales Support