Apache Airflow or Apache Beam for data processing and job scheduling

前端 未结 4 1423
难免孤独
难免孤独 2021-01-30 10:57

I\'m trying to give useful information but I am far from being a data engineer.

I am currently using the python library pandas to execute a long series of transformation

4条回答
  •  再見小時候
    2021-01-30 11:18

    I am doing the same as you with airflow, and I've got very good results. I am not very sure about the following: Beam is machine learning focused and airflow is for anything you want. Finally you can create a hive with kubernetes +airflow.

提交回复
热议问题