[Pandas] 그룹별/범주별 집계 - groupby()

cybergangster·2022년 6월 20일

Pandas

목록 보기

6/20

데이터프레임을 그룹별/범주별로 묶을 수 있다.
한 눈에 볼 수 없지만, 손쉽게 묶어낼 수 있고 빠른 데이터 처리가 가능하다.

df['test'] = ['a','a','b','b','b','c','d','d','b']

test_g=df.groupby('test') # 'test'열을 기준으로 그룹핑하여 test_g에 저장
test_g.size()
--------------
test
a  2
b  4
c  1
d  2
dtype: int64
--------------

test_g=df.groupby('test1','test2') #2개 이상의 열을 기준으로 그룹화
test_g.mean()
test_g.max()
test_g.aggregate([min,max]) # aggregat() == agg() 인듯?
test_g.aggregate(['mean','std'])
test.describes() # 통계 모두 출력
test_g=df.groupby('test1').sum() # 등 필요에 따라 적절하게 사용!

Reference
1) https://seong6496.tistory.com/121
2) https://ponyozzang.tistory.com/291

cybergangster

이전 포스트

[Pandas] astype()으로 dtype/데이터 타입 변경하기

다음 포스트

[Pandas] 그룹별/범주별 집계 - groupby()

Pandas

[Pandas] astype()으로 dtype/데이터 타입 변경하기

[Pandas] pandas란

0개의 댓글

관련 채용 정보