Seaborn 실습
1. 환경준비
from sklearn.datasets import load_iris
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns
import warnings
warnings.filterwarnings(action='ignore')
- 사용 데이터: sklearn iris data
iris = load_iris()
iris_df = pd.DataFrame(iris.data, columns=iris.feature_names)
iris_df['target'] = iris.target
iris_df.head()
|
sepal length (cm) |
sepal width (cm) |
petal length (cm) |
petal width (cm) |
target |
0 |
5.1 |
3.5 |
1.4 |
0.2 |
0 |
1 |
4.9 |
3.0 |
1.4 |
0.2 |
0 |
2 |
4.7 |
3.2 |
1.3 |
0.2 |
0 |
3 |
4.6 |
3.1 |
1.5 |
0.2 |
0 |
4 |
5.0 |
3.6 |
1.4 |
0.2 |
0 |
2. seaborn 다양한 차트들
1) 기본 차트들
1. histplot
sns.histplot(data = iris_df, x='sepal length (cm)', bins = 16)
plt.show()
data:image/s3,"s3://crabby-images/a266a/a266a525ef74722d1e327791cde3fa62cafe14b8" alt=""
sns.histplot(data = iris_df, x='sepal length (cm)', bins = 16, hue = 'target')
plt.show()
data:image/s3,"s3://crabby-images/acf41/acf41260baae4dd91846ee22b8f8188e4cd986e9" alt=""
2. densityplot
sns.kdeplot(data = iris_df, x = 'sepal length (cm)')
plt.show()
data:image/s3,"s3://crabby-images/1e62a/1e62a1f23219664e69e6a2bcb3d4ede8ea7b8303" alt=""
sns.kdeplot(data = iris_df, x='sepal length (cm)', hue = 'target')
plt.show()
data:image/s3,"s3://crabby-images/408b2/408b2bd78184b9036523c24ca0b3bba472cc7f35" alt=""
3. boxplot
sns.boxplot(data = iris_df, y = 'sepal length (cm)')
plt.show()
data:image/s3,"s3://crabby-images/875a7/875a7d0128245b794e5fb5706846c0f8cc007bbd" alt=""
sns.boxplot(data = iris_df, y = 'sepal length (cm)', x = 'target')
plt.show()
data:image/s3,"s3://crabby-images/a4430/a4430f2901b97a07d8e33a6d5c28f76acd63e204" alt=""
2) distplot : histplot + density plot
sns.distplot(iris_df['sepal length (cm)'], bins=16, hist_kws=dict(edgecolor='gray'))
plt.show()
data:image/s3,"s3://crabby-images/54602/54602c60f11eb3560b790093e9c488df123715a0" alt=""
sns.histplot(data = iris_df, x = 'sepal width (cm)', bins = 16, hue='target')
plt.show()
data:image/s3,"s3://crabby-images/221c2/221c20efbc07ba31541654dd38fc729a36d9345b" alt=""
sns.kdeplot(data = iris_df, x='sepal width (cm)', hue='target')
plt.show()
data:image/s3,"s3://crabby-images/1e646/1e646b98f8563f3477335ddeebb34236ba9f4370" alt=""
3) jointplot : scatter + histplot(or density plot)
sns.jointplot(x = 'petal length (cm)', y = 'petal width (cm)', data = iris_df)
plt.show()
data:image/s3,"s3://crabby-images/bcecc/bceccc97af9b37dbd0a208e7892f9a9d2c070d30" alt=""
sns.jointplot(x='petal length (cm)', y='petal width (cm)', data = iris_df, hue = 'target')
plt.show()
data:image/s3,"s3://crabby-images/2bdcb/2bdcb83a007f86ef03323527ec6d7f76e9cb60b7" alt=""
4) pairplot : scatter + histogram(or density plot) 확장
sns.pairplot(iris_df, hue = 'target')
plt.show()
data:image/s3,"s3://crabby-images/8c456/8c456ea25efba7fae46b5e0f27b272a9f587b4e0" alt=""
5) countplot : 집계 + bar plot
cnt = iris_df['target'].value_counts()
plt.bar(x = cnt.index, height = cnt.values, color=['r', 'g', 'b'])
plt.show()
data:image/s3,"s3://crabby-images/487c9/487c98df3ef2a3c5a6a2a5ac84c95e62c77e7d19" alt=""
sns.countplot(x = "target", data = iris_df)
plt.show()
data:image/s3,"s3://crabby-images/3fe7e/3fe7e6544ead72913eb28cfa1e2a219e2c78008a" alt=""
6) barplot : 평균비교 bar plot + error bar
sns.barplot(x = "target", y="sepal length (cm)", data = iris_df)
plt.show()
data:image/s3,"s3://crabby-images/66678/666784659e442877f60ccde18d03de7c203861ae" alt=""