Python資料分析以及可視化

最後更新：2017-06-09 來源：互聯網

上載者：User

創建阿里雲帳戶，並獲得超過 40 款產品的免費試用版；而企業帳戶則可以享有總值 $1200 的免費試用版。立即註冊！

標籤：each dia parallel 種類屬性 sns str 分析 ==

引入網址：https://www.kaggle.com/benhamner/d/uciml/iris/python-data-visualizations/notebook

import matplotlib.pyplot as plt

import seaborn as sns

import pandas as pd

匯入資料：

iris=pd.read_csv(‘E:\\data\\Iris.csv‘)

iris.head()

作長條圖：

plt.hist(iris[‘SepalLengthCm‘],bins=15)

plt.xlabel(‘SepalLengthCm‘)

plt.ylabel(‘quantity‘)

plt.title(‘distribution of SepalLengthCm‘)

plt.show()

作散佈圖：

但是這樣的圖並沒有將其中的不同種類話=花的顏色標明出來。所以我們使用另一種方式，通過Species來分類

小提琴圖

箱型圖：

ax=sns.boxplot(x="Species",y="SepalWidthCm",data=iris)

這裡使用jitter=True使得所有點不全落在中間的這條垂直線上

ax=sns.boxplot(x="Species",y="SepalWidthCm",data=iris)

ax=sns.stripplot(x="Species",y="SepalWidthCm",data=iris,jitter=True,edgecolor="gray")

kdeplot:單變數之間的聯絡，並通過一個核密度評估變數潛在的特徵

顯示兩個變數之間的聯絡：

通過diag_kind=‘kde‘顯示雙變數間的核密度並用其估計其特徵

boxplot_1:通過Species將變數間的各個特徵分離出來

Andrews curves:安德魯曲線使用樣本的屬性作為傅裡葉變換的係數

Radviz:多變數可視化,每一個特徵都在一個平面上顯示出來，樣本通過圈上的點與之像聯絡起來

parallel_coordinates通過平行座標進行多變數可視化：對於資料樣本的每一個特徵，我們通過在座標軸上設定特徵並通過畫線的方式來進行設定

Python資料分析以及可視化

本文章原先以中文撰寫並發佈於 aliyun.com，亦設英文版本，僅作資訊用途。本網站不對文章的準確性，完整性或可靠性或其任何翻譯作出任何明示或暗示的陳述或保證。如對該文章有任何疑慮或投訴，請傳送電郵至 info-contact@alibabacloud.com 並提供相關疑慮或投訴的詳細說明。職員會於 5 個工作天內與您聯絡，一經驗證之後，即會刪除該侵權內容。

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

Get Started for Free

Sales Support

1 on 1 presale consultation

Chat Contact Sales
After-Sales Support

24/7 Technical Support 6 Free Tickets per Quarter Faster Response

Open a Ticket
Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.

Learn More

Python資料分析以及可視化

聯繫我們

A Free Trial That Lets You Build Big!

Sales Support

After-Sales Support