This article describes how the pandas series with the index index is vectorized:1. Index indexed arrays are the same:S1 = PD. Series ([1, 2, 3, 4], index=['a','b','C','D']) S2= PD. Series ([ten, +, +], index=['a','b','C','D'])PrintS1 +s2a11b22C33D44Dtype:int64Add the values corresponding to each index directly2. Index indexed array values are the same, in different order:S1 = PD. Series ([1, 2, 3, 4], index=['a','b','C','D']) S2= PD. Series ([ten, +,
way, and filtering through a Boolean array.However, it is important to note that because the index of the Pandas object is not limited to integers, it is included at the end when using a non-integer as the tile index.>>> fooa 4.5b 7.2c -5.3d 3.6dtype:float64>>> bar0 4.51 7.22 -5.33 3.6dtype:float64>>> foo[:2]a 4.5b 7.2dtype:float64>>> bar[:2]0 4.51 7.2dtype:float64>>> foo[: ' C ']a 4.5b 7.2c -5.3dtype:float64
It's been a lot of red boxes all afternoon.
Python2 and Python3 version conflicts
Pip version IssuePip-v
Updatesudo apt-get update
sudo apt-get install Python-dev
Finally do not know how to install, feeling is one of the following two ways‘‘‘ C++ sudo easy_install -U setuptools ‘‘‘ ‘‘‘ C++ sudo pip install --upgrade setuptools ‘‘‘
(Just beginning to try also not, do not know why suddenly magic can.) If not again, run both sides, see there is an answer is to run on both
. Display indexes, columns, and underlying numpy data:3. The describe () function is a quick statistical summary of the data:4. Transpose the data:5, by axis to sort6. Sort by valueThird, the choiceWhile the standard python/numpy selection and setup expressions can come in handy, we recommend using optimized pandas data access as the code used for the project:. At,. IAT,. Loc,. Iloc and. IX For details see
row name, where the debt column is added, but there is no data, so it is Nan Can be debt, assign a value Take the line, with IX You can also use nested dictionaries to create dataframe, which are actually series dictionaries, which are dictionaries themselves, so they are nested dictionaries. Can be like a numpy matrix, transpose Essential functionality Here's a look at what the pandas prov
The source of this article:Python for Data Anylysis:chapter 5Ten mintues to Pandas:http://pandas.pydata.org/pandas-docs/stable/10min.html#min1. Pandas IntroductionAfter several years of development, pandas has become the most commonly used package in Python processing data. The following is the beginning of the development of
NaNB 2001 3500 NaN 1C 2002 4500 NaN 2D 2003 6000 NaN 3Del data1[' outcome ']The result of deleting a column is:Year Income MoneyA 2000 3000 0B 2001 3500 1C 2002 4500 2D 2003 6000 3Primary index objects in pandas and their corresponding indexed methods and propertiesThere's also a reindex function to rebuild the indexdata={' year ': [2000,2001,2002,2003],' Income ': [3000,3500,4500,6000]}DATA1=PD. DataFrame (data,columns=[' year ', ' income ', ' outco
Python pandas usage Daquan, pythonpandas Daquan
1. Generate a data table
1. Import the pandas database first. Generally, the numpy database is used. Therefore, import the database first:
import numpy as npimport pandas as pd
2. Import CSV or xlsx files:
df = pd.DataFrame(pd.read_csv('name.csv',header=1))df = pd.DataFrame(pd.read_excel('name.xlsx'))
3. Create a da
This time to bring you pandas+dataframe to achieve the choice of row and slice operation, pandas+dataframe to achieve the row and column selection and the attention of the slicing operation, the following is the actual case, take a look.
Select in SQL is selected according to the name of the column, pandas is more flexible, not only can be selected according to
Recent work and Hive SQL to deal with more, occasionally encountered some problems of SQL is not easy to solve, will be downloaded to the file with pandas to deal with, due to the large amount of data, so there are some relevant experience can be shared with you, hope to learn pandas help YOU.Read and write large text dataSometimes we get a lot of text files, full read into the memory, read the process will
This article mainly introduced the Python pandas in the Dataframe type data operation function method, has certain reference value, now shares to everybody, has the need friend to refer to
The Python data analysis tool pandas Dataframe and series as the primary data structures.
This article is mainly about how to operate the Dataframe data and combine an instance to test the operation function.
1) View Dat
', ' C ', ' d ', ' e '])Two discards the item on the specified axisThe data on a row can be discarded by means of a drop , and the parameter is the row indexin [+]: objOUT[64]:1 42 73 54 3Dtype:int64In [All]: New=obj.drop (1)in [+]: NewOUT[66]:2 73 54 3Dtype:int64Three-index, select and filterIn the list and tuple of Python, we can get the information we want by slicing, and we can also get the information by slicing in pandas. In []: Obj[2:4]OUT[6
TurnThe same lesson is reproduced from the great God. The sample code will be incrementally added in the future.PandasPandas is a numpy-based tool that was created to solve the data analysis task. Pandas incorporates a number of libraries and a number of standard data models, providing the tools needed to efficiently manipulate large datasets. Pandas provides a number of functions and methods that enable us
Querying and analyzing data is an important function of pandas, is also the basis of our learning pandas, the following article mainly introduces you about how to use the data analysis of Python pandas query data, the text through the sample code introduced in very detailed, the needs of friends can reference , let's take a look below.
Objective
In the field of
This time to bring you pandas in the Dataframe query what methods, pandas in the Dataframe query of what matters, the following is the actual case, together to see.
Pandas provides us with a variety of slicing methods, which are often confusing if you don't know them well. The following are examples of how these slices are described.
Data introduction
A random s
Pandas common knowledge required for data analysis and mining in PythonObjectivePandas is based on two types of data: series and Dataframe.A series is a one-dimensional data type in which each element has a label. The series is similar to an array of elements tagged in numpy. Where the label can be either a number or a string.A dataframe is a two-dimensional table structure. Pandas's Dataframe can store many different data types, and each axis has its
Pandas: data Analysis Library built on NumPyPANDAS data structure: Series, DataFrameSeries: class one-dimensional array objects with data labels (also considered as dictionaries)Values, indexMissing data detection: Pd.isnull (), Pd.notnull (), instance method for series objectsThe series object itself and its index have a Name property, which is closely related to pandas other key functionsDataFrame: Tabula
index-feature name-Attribute-easy to understand
2. filter the row and column data of dataframe
import pandas as pd,numpy as npfrom pandas import DataFramedf = DataFrame(np.arange(20).reshape((4,5)),column = list('abcde'))
1. df [] df. Select column data
Df.Df [['A', 'B']
2. df. loc [[index], [colunm] use tags to select data
When you do not filter rows, enter "(cannot be blank)" in "[index]", that is, "df
The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion;
products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the
content of the page makes you feel confusing, please write us an email, we will handle the problem
within 5 days after receiving your email.
If you find any instances of plagiarism from the community, please send an email to:
info-contact@alibabacloud.com
and provide relevant evidence. A staff member will contact you within 5 working days.