I have a Pandas DataFrame with a mix of screen names, tweets, fav’s etc. I want find the max value of ‘favcount’ (which i have already done) and also return the screen name of that ‘tweet’
df = pd.DataFrame() df['timestamp'] = timestamp df['sn'] = sn df['text'] = text df['favcount'] = fav_count print df print '------' print df['favcount'].max()
I cant seem to find anything on this, can anyone help guide me in the right direction?
Answers:
Thank you for visiting the Q&A section on Magenaut. Please note that all the answers may not help you solve the issue immediately. So please treat them as advisements. If you found the post helpful (or not), leave a comment & I’ll get back to you as soon as possible.
Method 1
Use argmax()idxmax() to get the index of the max value. Then you can use loc
df.loc[df['favcount'].idxmax(), 'sn']
Edit: argmax() is now deprecated, switching for idxmax()
Method 2
I think you need idxmax – get index of max value of favcount and then select value in column sn by loc:
df = pd.DataFrame({'favcount':[1,2,3], 'sn':['a','b','c']})
print (df)
favcount sn
0 1 a
1 2 b
2 3 c
print (df.favcount.idxmax())
2
print (df.loc[df.favcount.idxmax()])
favcount 3
sn c
Name: 2, dtype: object
print (df.loc[df.favcount.idxmax(), 'sn'])
c
Method 3
By using same df as above,
# python code
df = pd.DataFrame({'favcount':[1,2,3], 'sn':['a','b','c']})
print (df)
favcount sn
0 1 a
1 2 b
2 3 c
## You can use max()
print(df[df.favcount.max() == df['favcount']])
favcount sn
2 3 c
## If you need specific column you can select it
print(df[df.favcount.max() == df['favcount']].sn)
2 c
All methods was sourced from stackoverflow.com or stackexchange.com, is licensed under cc by-sa 2.5, cc by-sa 3.0 and cc by-sa 4.0