发表新帖

发表新帖

How to split Vector into columns - using PySpark

后端未结

关注

 5  1337

夕颜 2020-11-22 16:23

Context: I have a DataFrame with 2 columns: word and vector. Where the column type of \"vector\" is VectorUDT.

An Example:

5条回答

粉色の甜心 (楼主)

2020-11-22 17:20
For anyone trying to split the rawPrediction or probability columns generated after training a PySpark ML model into Pandas columns, you can split like this:
```
your_pandas_df['probability'].apply(lambda x: pd.Series(x.toArray()))
```
0 讨论(0)

查看其它5个回答
发布评论:

提交评论
- 加载中...

热议问题