Key error when selecting columns in pandas dataframe after read_csv

后端未结

关注

 3  2114

一向

I\'m trying to read in a CSV file into a pandas dataframe and select a column, but keep getting a key error.

The file reads in successfully and I can view the datafr

相关标签:

3条回答

被撕碎了的回忆

2020-12-01 11:07

use sep='\s*,\s*' so that you will take care of spaces in column-names:

transactions = pd.read_csv('transactions.csv', sep=r'\s*,\s*',
                           header=0, encoding='ascii', engine='python')

alternatively you can make sure that you don't have unquoted spaces in your CSV file and use your command (unchanged)

prove:

print(transactions.columns.tolist())

Output:

['product_id', 'customer_id', 'store_id', 'promotion_id', 'month_of_year', 'quarter', 'the_year', 'store_sales', 'store_cost', 'unit_sales', 'fact_count']

0 讨论(0)

轻奢々

2020-12-01 11:20
if you need to select multiple columns from dataframe use 2 pairs of square brackets eg.
```
df[["product_id","customer_id","store_id"]]
```
0 讨论(0)
发布评论:

提交评论
- 加载中...

春和景丽

2020-12-01 11:21

The key error generally comes if the key doesn't match any of the dataframe column name 'exactly':

You could also try:

import csv
import pandas as pd
import re
    with open (filename, "r") as file:
        df = pd.read_csv(file, delimiter = ",")
        df.columns = ((df.columns.str).replace("^ ","")).str.replace(" $","")
        print(df.columns)

0 讨论(0)