删除重复行后在数据帧中维护索引



我已经从数据帧中删除了重复项,因此索引被更改,如果我想访问df['Color'][1],它显示错误,那么我该如何维护它

import pandas as pd
boxes = {'Color': ['Green','Green','Green','Blue','Blue','Red','Red','Red'],
'Shape': ['Rectangle','Rectangle','Square','Rectangle','Square','Square','Square','Rectangle']
}
df = pd.DataFrame(boxes, columns = ['Color', 'Shape'])
df = df.drop_duplicates()
print(df)

输出


Color      Shape
0  Green  Rectangle
2  Green     Square
3   Blue  Rectangle
4   Blue     Square
5    Red     Square
7    Red  Rectangle

我想要的

Color      Shape
0  Green  Rectangle
1  Green     Square
2   Blue  Rectangle
3   Blue     Square
4    Red     Square
5    Red  Rectangle

使用ignore_index=True作为drop_duplicates:的参数

>>> df.drop_duplicates(ignore_index=True)
Color      Shape
0  Green  Rectangle
1  Green     Square
2   Blue  Rectangle
3   Blue     Square
4    Red     Square
5    Red  Rectangle

最新更新