问题
I have spent hours trying to make statsmodels do my MANOVA without success. Here is the code:
from statsmodels.multivariate.manova import MANOVA
df = data
feats_list = ['col1', 'col2', 'col3' ... 'col4']
var_list = ['col5', 'col6']
endog, exog = np.asarray(df[feats_list]), np.asarray(df[var_list])
manov = MANOVA(endog, exog)
manov.mv_test()
Providing:
---------------------------------------------------------------------------
IndexError Traceback (most recent call last)
<ipython-input-16-c3fc1d1f16f6> in <module>()
1 manov = MANOVA(endog, exog)
----> 2 manov.mv_test()
~\Anaconda3\lib\site-packages\statsmodels\multivariate\manova.py in mv_test(self, hypotheses)
68 name = 'x%d' % (i)
69 L = np.zeros([1, self.exog.shape[1]])
---> 70 L[i] = 1
71 hypotheses.append([name, L, None])
72
IndexError: index 1 is out of bounds for axis 0 with size
1
I tried also to put the hypotheses by myself and I always get a SingularMatrixError so I suppose that I am not using the class correctly.
Thanks in advance for your help.
回答1:
Based on issue 4903 referenced by Josef in the comments above, the following would work
from statsmodels.multivariate.manova import MANOVA
feats_list = ['col1', 'col2', 'col3', 'col4']
var_list = ['col5', 'col6']
df = pd.DataFrame(
np.random.random_sample(size=(100,6)),
columns=feats_list + var_list
)
endog, exog = np.asarray(df[feats_list]), np.asarray(df[var_list])
mod = MANOVA.from_formula('col1 + col2 + col3 + col4 ~ col5 + col6', data=df)
r = mod.mv_test()
print(r)
Multivariate linear model
============================================================
------------------------------------------------------------
Intercept Value Num DF Den DF F Value Pr > F
------------------------------------------------------------
Wilks' lambda 0.3420 4.0000 94.0000 45.2047 0.0000
Pillai's trace 0.6580 4.0000 94.0000 45.2047 0.0000
Hotelling-Lawley trace 1.9236 4.0000 94.0000 45.2047 0.0000
Roy's greatest root 1.9236 4.0000 94.0000 45.2047 0.0000
------------------------------------------------------------
------------------------------------------------------------
col5 Value Num DF Den DF F Value Pr > F
------------------------------------------------------------
Wilks' lambda 0.9297 4.0000 94.0000 1.7775 0.1399
Pillai's trace 0.0703 4.0000 94.0000 1.7775 0.1399
Hotelling-Lawley trace 0.0756 4.0000 94.0000 1.7775 0.1399
Roy's greatest root 0.0756 4.0000 94.0000 1.7775 0.1399
------------------------------------------------------------
------------------------------------------------------------
col6 Value Num DF Den DF F Value Pr > F
------------------------------------------------------------
Wilks' lambda 0.9891 4.0000 94.0000 0.2590 0.9035
Pillai's trace 0.0109 4.0000 94.0000 0.2590 0.9035
Hotelling-Lawley trace 0.0110 4.0000 94.0000 0.2590 0.9035
Roy's greatest root 0.0110 4.0000 94.0000 0.2590 0.9035
============================================================
来源:https://stackoverflow.com/questions/51840604/statsmodels-manova-indexerror-index-1-is-out-of-bounds-for-axis-0-with-size-1