Save pandas dataframe but conserving NA values

后端 未结 2 1004
鱼传尺愫
鱼传尺愫 2020-12-16 19:57

I have this code

import pandas as pd
import numpy as np
import csv
df = pd.DataFrame({\'animal\': \'cat dog cat fish dog cat cat\'.split(),
               \'         


        
相关标签:
2条回答
  • 2020-12-16 20:28

    To get that specific output, you'll have to pass the quotes in explicitly.

    df = pd.DataFrame({'animal': r'"cat" "dog" "cat" "fish" "dog" "cat" "cat"'.split(),
               'size': list(r'"S" "S" "M" "M" "M" "L" "L"'.split()),
               'weight': [8, 10, 11, 1, 20, 12, 12],
               'adult' : [False] * 5 + [True] * 2}); 
    df['weight'] = '%s' %('NA')
    df.to_csv("ejemplo.csv", sep=';', decimal=',',quoting=csv.QUOTE_NONE, index=False)
    
    0 讨论(0)
  • 2020-12-16 20:40

    If you want a string to represent NaN values then pass na_rep to to_csv:

    In [8]:
    df.to_csv(na_rep='NA')
    
    Out[8]:
    ',adult,animal,size,weight\n0,False,cat,S,NA\n1,False,dog,S,NA\n2,False,cat,M,NA\n3,False,fish,M,NA\n4,False,dog,M,NA\n5,True,cat,L,NA\n6,True,cat,L,NA\n'
    

    If you want the NA in quotes then escape the quotes:

    In [3]:
    df = pd.DataFrame({'animal': 'cat dog cat fish dog cat cat'.split(),
                   'size': list('SSMMMLL'),
                   'weight': [8, 10, 11, 1, 20, 12, 12],
                   'adult' : [False] * 5 + [True] * 2})
    df['weight'] = np.NaN
    df.to_csv(na_rep='\'NA\'')
    
    Out[3]:
    ",adult,animal,size,weight\n0,False,cat,S,'NA'\n1,False,dog,S,'NA'\n2,False,cat,M,'NA'\n3,False,fish,M,'NA'\n4,False,dog,M,'NA'\n5,True,cat,L,'NA'\n6,True,cat,L,'NA'\n"
    

    EDIT

    To get the desired output use these params:

    In [27]:
    df.to_csv(na_rep='NA', sep=';', index=False,quoting=3)
    ​
    Out[27]:
    'adult;animal;size;weight\nFalse;cat;S;NA\nFalse;dog;S;NA\nFalse;cat;M;NA\nFalse;fish;M;NA\nFalse;dog;M;NA\nTrue;cat;L;NA\nTrue;cat;L;NA\n'
    
    0 讨论(0)
提交回复
热议问题