Distributed cross correlation matrix computation

前端未结

关注

 2  656

甜味超标 2021-01-12 02:37

How can I calculate pearson cross correlation matrix of large (>10TB) data set, possibly in distributed manner ? Any efficient distributed algorithm suggestion will be ap

2条回答

不知归路 (楼主)

2021-01-12 03:26

Each local data sets can converted into stdv and covariances. Also stdv and covariance and sum make correlation.

This is working example https://github.com/jeesim2/distributed-correlation

0 讨论(0)

查看其它2个回答
发布评论:

提交评论
- 加载中...