Function to replace NaN values in a dataframe with mean of the related column

后端未结

关注

 3  346

EDIT: This question is not a clone of pandas dataframe replace nan values with average of columns because I want to replace the value of each column with th

相关标签:

3条回答

别那么骄傲

2021-01-14 22:12
You can also use fillna
```
df = pd.DataFrame({'A': [1, 2, np.nan], 'B': [2, np.nan, np.nan]})
df.fillna(df.mean(axis=0))
    A   B
0   1.0 2.0
1   2.0 2.0
2   1.5 2.0
```
df.mean(axis=0) computes the mean for every column, and this is passed to the fillna method.

This solution is on my machine, twice as fast as the solution using apply for the data set shown above.
0 讨论(0)
发布评论:

提交评论
- 加载中...
囚心锁ツ

2021-01-14 22:14
You can try something like:
```
[df[col].fillna(df[col].mean(), inplace=True) for col in df.columns]
```
But that is just a way to do it. Your code is a priori almost correct. Your error is that you should call
```
train[value]
```
Instead of :
```
train['value']
```
Everywhere in your code. Because the latter will try to look for a column named as "value" which is rather a variable from a list you are iterating on.
0 讨论(0)
发布评论:

提交评论
- 加载中...
暗喜

2021-01-14 22:26
To fill NaN of each column with its respective mean use:
```
df.apply(lambda x: x.fillna(x.mean())) 
```
0 讨论(0)
发布评论:

提交评论
- 加载中...