数据分析和挖掘实战第15章的一段读取.txt文件报错
import pandas as pd
inputfile = 'data/meidi_jd.txt'
outputfile = 'data/meidi_jd_process_1.txt'
data = pd.read_csv(inputfile,encoding = 'utf-8',header =None)
File "pandas/_libs/parsers.pyx", line 965, in pandas._libs.parsers.TextReader._tokenize_rows
File "pandas/_libs/parsers.pyx", line 2208, in pandas._libs.parsers.raise_parser_error
ParserError: Error tokenizing data. C error: Expected 1 fields in line 122, saw 2
修改为:
import pandas as pd
inputfile = 'data/meidi_jd.txt'
outputfile = 'data/meidi_jd_process_1.txt'
data = pd.read_csv(inputfile,encoding = 'utf-8',header =None,sep='\t')
或者:
import pandas as pd
inputfile = 'data/meidi_jd.txt'
outputfile = 'data/meidi_jd_process_1.txt'
data = pd.read_csv(inputfile,encoding = 'utf-8',header =None,delimiter='\t')
来源:oschina
链接:https://my.oschina.net/u/4347910/blog/3972027