Drop all duplicate rows across multiple columns in Python Pandas
The pandas
drop_duplicates
function is great for “uniquifying” a dataframe. However, one of the keyword arguments to pass is take_last=True
or take_last=False
, while I would like to drop all rows which are duplicates across a subset of columns. Is this possible?