Pivot A Dataframe With Two Columns As The Index
I have data in the following format: Record ID Para Tag 1 A x 1 A y 2 B x 2 B y 1 A z I want to transform the data int
Solution 1:
This is get_dummies
pd.get_dummies(df.set_index(['RecordID','Para'])).sum(level=[0,1]).reset_index()
Out[132]:
RecordIDParaTag_xTag_yTag_z01A11112B110
Solution 2:
pivot_table
pivot_table
works nicely here, and should be fast:
df.pivot_table(
index=['Record ID', 'Para'], columns='Tag', aggfunc='size', fill_value=0
).add_prefix('Tag_').reset_index()
TagRecordIDParaTag_xTag_yTag_z01A11112B110
crosstab
pd.crosstab(
[df['Record ID'], df['Para']], df['Tag']
).add_prefix('Tag_').reset_index()
Tag Record ID Para Tag_x Tag_y Tag_z
0 1 A 1 1 1
1 2 B 1 1 0
Post a Comment for "Pivot A Dataframe With Two Columns As The Index"