Sample datasets in Pandas

前端 未结 4 1245
醉酒成梦
醉酒成梦 2021-01-29 23:43

When using R it\'s handy to load \"practice\" datasets using

data(iris)

or

data(mtcars)

Is there something s

4条回答
  •  借酒劲吻你
    2021-01-30 00:06

    The builtin pandas testing DataFrame is very convenient.

    makeMixedDataFrame():

    In [22]: import pandas as pd
    
    In [23]: pd.util.testing.makeMixedDataFrame()
    Out[23]:
         A    B     C          D
    0  0.0  0.0  foo1 2009-01-01
    1  1.0  1.0  foo2 2009-01-02
    2  2.0  0.0  foo3 2009-01-05
    3  3.0  1.0  foo4 2009-01-06
    4  4.0  0.0  foo5 2009-01-07
    

    other testing DataFrame options:

    makeDataFrame():

    In [24]: pd.util.testing.makeDataFrame().head()
    Out[24]:
                       A         B         C         D
    acKoIvMLwE  0.121895 -0.781388  0.416125 -0.105779
    jc6UQeOO1K -0.542400  2.210908 -0.536521 -1.316355
    GlzjJESv7a  0.921131 -0.927859  0.995377  0.005149
    CMhwowHXdW  1.724349  0.604531 -1.453514 -0.289416
    ATr2ww0ctj  0.156038  0.597015  0.977537 -1.498532
    

    makeMissingDataframe():

    In [27]: pd.util.testing.makeMissingDataframe().head()
    Out[27]:
                       A         B         C         D
    qyXLpmp1Zg -1.034246  1.050093       NaN       NaN
    v7eFDnbQko  0.581576  1.334046 -0.576104 -0.579940
    fGiibeTEjx -1.166468 -1.146750 -0.711950 -0.205822
    Q8ETSRa6uY  0.461845 -2.112087  0.167380 -0.466719
    7XBSChaOyL -1.159962 -1.079996  1.585406 -1.411159
    

    makeTimeDataFrame():

    In [28]: pd.util.testing.makeTimeDataFrame().head()
    Out[28]:
                       A         B         C         D
    2000-01-03 -0.641226  0.912964  0.308781  0.551329
    2000-01-04  0.364452 -0.722959  0.322865  0.426233
    2000-01-05  1.042171  0.005285  0.156562  0.978620
    2000-01-06  0.749606 -0.128987 -0.312927  0.481170
    2000-01-07  0.945844 -0.854273  0.935350  1.165401
    

提交回复
热议问题