I have a Pandas Dataframe as shown below:
1 2 3 0 a NaN read 1 b l unread 2 c NaN read
I want to remove the NaN values with an empty string so that it looks like so:
1 2 3 0 a "" read 1 b l unread 2 c "" read
Answers:
Thank you for visiting the Q&A section on Magenaut. Please note that all the answers may not help you solve the issue immediately. So please treat them as advisements. If you found the post helpful (or not), leave a comment & I’ll get back to you as soon as possible.
Method 1
df = df.fillna('')
or just
df.fillna('', inplace=True)
This will fill na’s (e.g. NaN’s) with ''.
If you want to fill a single column, you can use:
df.column1 = df.column1.fillna('')
One can use df['column1'] instead of df.column1.
Method 2
import numpy as np df1 = df.replace(np.nan, '', regex=True)
This might help. It will replace all NaNs with an empty string.
Method 3
If you are reading the dataframe from a file (say CSV or Excel) then use :
df.read_csv(path , na_filter=False)
df.read_excel(path , na_filter=False)
This will automatically consider the empty fields as empty strings ''
If you already have the dataframe
df = df.replace(np.nan, '', regex=True)
df = df.fillna('')
Method 4
Use a formatter, if you only want to format it so that it renders nicely when printed. Just use the df.to_string(... formatters to define custom string-formatting, without needlessly modifying your DataFrame or wasting memory:
df = pd.DataFrame({
'A': ['a', 'b', 'c'],
'B': [np.nan, 1, np.nan],
'C': ['read', 'unread', 'read']})
print df.to_string(
formatters={'B': lambda x: '' if pd.isnull(x) else '{:.0f}'.format(x)})
To get:
A B C 0 a read 1 b 1 unread 2 c read
Method 5
Try this,
add inplace=True
import numpy as np df.replace(np.NaN, '', inplace=True)
Method 6
using keep_default_na=False should help you:
df = pd.read_csv(filename, keep_default_na=False)
Method 7
If you are converting DataFrame to JSON, NaN will give error so best solution is in this use case is to replace NaN with None.
Here is how:
df1 = df.where((pd.notnull(df)), None)
Method 8
I tried with one column of string values with nan.
To remove the nan and fill the empty string:
df.columnname.replace(np.nan,'',regex = True)
To remove the nan and fill some values:
df.columnname.replace(np.nan,'value',regex = True)
I tried df.iloc also. but it needs the index of the column. so you need to look into the table again. simply the above method reduced one step.
All methods was sourced from stackoverflow.com or stackexchange.com, is licensed under cc by-sa 2.5, cc by-sa 3.0 and cc by-sa 4.0