Skip to content Skip to sidebar Skip to footer

Pivot A Dataframe With Two Columns As The Index

I have data in the following format: Record ID Para Tag 1 A x 1 A y 2 B x 2 B y 1 A z I want to transform the data int

Solution 1:

This is get_dummies

pd.get_dummies(df.set_index(['RecordID','Para'])).sum(level=[0,1]).reset_index()
Out[132]: 
   RecordIDParaTag_xTag_yTag_z01A11112B110

Solution 2:

pivot_table

pivot_table works nicely here, and should be fast:

df.pivot_table(
    index=['Record ID', 'Para'], columns='Tag', aggfunc='size', fill_value=0
).add_prefix('Tag_').reset_index()

TagRecordIDParaTag_xTag_yTag_z01A11112B110

crosstab

pd.crosstab(
    [df['Record ID'], df['Para']], df['Tag']
).add_prefix('Tag_').reset_index()

Tag  Record ID Para  Tag_x  Tag_y  Tag_z
0            1    A      1      1      1
1            2    B      1      1      0

Post a Comment for "Pivot A Dataframe With Two Columns As The Index"