Pandas 피벗 테이블(Pivot table)

pivot table을 사용하면 컬럼에 중복된 값이 있을 때,

컬럼을 인덱스로 만들어 유니크하게 만들 수 있다.

import pandas as pd
import numpy as np

df = pd.read_excel('../data/sales-funnel.xlsx')
# 에러가 날 경우 pip install openpyxl

df

pd.pivot_table(df, index=['Name'])
# index 파라미터에는, 유니크하게 만들고 싶은 컬럼의 이름을 적는다.
# 중복되는게 있을때 디폴트로 수치데이터들의 평균이 출력된다.

# Manager 와 Rep 를 유니크하게 만들고, 수치의 평균을 구하라
pd.pivot_table(df, index=['Manager', 'Rep'])

# Name 컬럼을 유니크하게 만들되, 수치 데이터들은 모두 더하라
pd.pivot_table(df, index=['Name'], aggfunc=np.sum)

# Name 컬럼은 유니크하게 만들되, 수치 데이터들은 모두 더하라
# 필요없는 Account 컬럼은 제외시키고 만든다
pd.pivot_table(df2, index=['Name'], aggfunc=np.sum, values=['Price', 'Quantity'])

# Name 컬럼은 유니크하게 만들되, 수치 데이터들은 더한값, 평균값 두개로 표시
# 필요없는 Account 컬럼은 제외시키고 만든다
pd.pivot_table(df2, index=['Name'], aggfunc=[np.sum, np.mean], values=['Price', 'Quantity'])

'Python > Pandas' 카테고리의 다른 글

Numpy/Pandas 에서 datetime 다루기 (0)	2022.12.02
Pandas 데이터프레임에서 boolean indexing의 활용 (0)	2022.12.01
오픈API Goolglemaps Geocode로 위치정보 받아오기 (0)	2022.12.01
Pandas의 DataFrame.plot() 으로 시각화하기 (0)	2022.12.01
pandas.read_csv() 콤마( , ) 가 포함된 수치형 컬럼 불러오기 (1)	2022.12.01

'Python > Pandas' 카테고리의 다른 글

티스토리툴바