How can I perform data lineage in GCP?

只谈情不闲聊 提交于 2019-12-23 15:15:08

问题


When we realize the data lake with GCP Cloud storage, and data processing with Cloud services such as Dataproc, Dataflow How can we generated data lineage report in GCP. Thanks.


回答1:


Google Cloud Platform doesn't have serverless data lineage offering.

Instead, you may want to install Apache Atlas on Google Cloud Dataproc and use it for data lineage.




回答2:


If data lineage is important for you, you will find yourself wanting an Enterprise Data Cloud.

Cloudera is the main supplier in this space, and will allow you to work on Google Cloud (or anywhere else) with mature data governance.


Though I personally stand behind this message, I do want to mention that I happen to be an employee of Cloudera.



来源:https://stackoverflow.com/questions/55000865/how-can-i-perform-data-lineage-in-gcp

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!