Pandas dataframe groupby to calculate population standard deviation

后端未结

关注

 2  1718

野趣味 2021-02-20 15:02

I am trying to use groupby and np.std to calculate a standard deviation, but it seems to be calculating a sample standard deviation (with a degrees of freedom equal to 1).

2条回答

余生分开走 (楼主)

2021-02-20 15:40

You can pass additional args to np.std in the agg function:

In [202]:

df.groupby('A').agg(np.std, ddof=0)

Out[202]:
     B  values
A             
1  0.5     2.5
2  0.5     2.5

In [203]:

df.groupby('A').agg(np.std, ddof=1)

Out[203]:
          B    values
A                    
1  0.707107  3.535534
2  0.707107  3.535534

0 讨论(0)

查看其它2个回答