What does the “More Columns than Column Names” error mean?

后端未结

关注

 5  1447

春和景丽

I\'m trying to read in a .csv file from the IRS and it doesn\'t appear to be formatted in any weird way.

I\'m using the read.table() function, which I h

相关标签:

5条回答

再見小時候

2021-01-07 21:32

Depending on the data (e.g. tsv extension) it may use tab as separators, so you may try sep = '\t' with read.csv.

0 讨论(0)
发布评论:

提交评论
- 加载中...
谎友^

2021-01-07 21:33

For the Germans:

you have to change your decimal commas into a Full stop in your csv-file (in Excel:File -> Options -> Advanced -> "Decimal seperator") , then the error is solved.

0 讨论(0)
发布评论:

提交评论
- 加载中...
余生分开走

2021-01-07 21:37

you have have strange characters in your heading # % -- or ,

0 讨论(0)
发布评论:

提交评论
- 加载中...
滥情空心

2021-01-07 21:48

This error can get thrown if your data frame has sf geometry columns.

0 讨论(0)
发布评论:

提交评论
- 加载中...
情书的邮戳

2021-01-07 21:55
It uses commas as separators. So you can either set sep="," or just use read.csv:
```
x <- read.csv(file="http://www.irs.gov/file_source/pub/irs-soi/countyinflow1011.csv")
dim(x)
## [1] 113593      9
```
The error is caused by spaces in some of the values, and unmatched quotes. There are no spaces in the header, so read.table thinks that there is one column. Then it thinks it sees multiple columns in some of the rows. For example, the first two lines (header and first row):
```
State_Code_Dest,County_Code_Dest,State_Code_Origin,County_Code_Origin,State_Abbrv,County_Name,Return_Num,Exmpt_Num,Aggr_AGI
00,000,96,000,US,Total Mig - US & For,6973489,12948316,303495582
```
And unmatched quotes, for example on line 1336 (row 1335) which will confuse read.table with the default quote argument (but not read.csv):
```
01,089,24,033,MD,Prince George's County,13,30,1040
```
0 讨论(0)
发布评论:

提交评论
- 加载中...