Multiple columns with the same name in Pandas

后端未结

关注

 2  349

终归单人心 2021-01-04 13:24

I am creating a dataframe from a CSV file. I have gone through the docs, multiple SO posts, links as I have just started Pandas but didn\'t get it. The CSV file

2条回答

北海茫月 (楼主)

2021-01-04 14:19
the relevant parameter is mangle_dupe_cols

from the docs
```
mangle_dupe_cols : boolean, default True
    Duplicate columns will be specified as 'X.0'...'X.N', rather than 'X'...'X'
```
by default, all of your 'a' columns get named 'a.0'...'a.N' as specified above.

if you used mangle_dupe_cols=False, importing this csv would produce an error.

you can get all of your columns with
```
df.filter(like='a')
```
demonstration
```
from StringIO import StringIO
import pandas as pd

txt = """a, a, a, b, c, d
1, 2, 3, 4, 5, 6
7, 8, 9, 10, 11, 12"""

df = pd.read_csv(StringIO(txt), skipinitialspace=True)
df
```
```
df.filter(like='a')
```
0 讨论(0)

查看其它2个回答
发布评论:

提交评论
- 加载中...