Determine the number of NA values in a column

后端未结

关注

 14  1399

I want to count the number of NA values in a data frame column. Say my data frame is called df, and the name of the column I am considering is

相关标签:

14条回答

日久生厌

2020-12-02 04:51
Try this:
```
length(df$col[is.na(df$col)])
```
0 讨论(0)
发布评论:

提交评论
- 加载中...
时光说笑

2020-12-02 04:53
User rrs answer is right but that only tells you the number of NA values in the particular column of the data frame that you are passing to get the number of NA values for the whole data frame try this:
```
apply(<name of dataFrame>, 2<for getting column stats>, function(x) {sum(is.na(x))})
```
This does the trick
0 讨论(0)
发布评论:

提交评论
- 加载中...
北海茫月

2020-12-02 04:54
Similar to hute37's answer but using the purrr package. I think this tidyverse approach is simpler than the answer proposed by AbiK.
```
library(purrr)
map_dbl(df, ~sum(is.na(.)))
```
Note: the tilde (~) creates an anonymous function. And the '.' refers to the input for the anonymous function, in this case the data.frame df.
0 讨论(0)
发布评论:

提交评论
- 加载中...
故里飘歌

2020-12-02 04:55

In the summary() output, the function also counts the NAs so one can use this function if one wants the sum of NAs in several variables.

0 讨论(0)
发布评论:

提交评论
- 加载中...
难免孤独

2020-12-02 04:55
A quick and easy Tidyverse solution to get a NA count for all columns is to use summarise_all() which I think makes a much easier to read solution than using purrr or sapply
```
library(tidyverse)
# Example data
df <- tibble(col1 = c(1, 2, 3, NA), 
             col2 = c(NA, NA, "a", "b"))

df %>% summarise_all(~ sum(is.na(.)))
#> # A tibble: 1 x 2
#>    col1  col2
#>   <int> <int>
#> 1     1     2
```
0 讨论(0)
发布评论:

提交评论
- 加载中...
我寻月下人不归

2020-12-02 04:57
Try the colSums function
```
df <- data.frame(x = c(1,2,NA), y = rep(NA, 3))

colSums(is.na(df))

#x y 
#1 3 
```
0 讨论(0)
发布评论:

提交评论
- 加载中...