Tensorflow transform on beams with flink runner

戏子无情 提交于 2019-12-08 13:08:32

Thanks for the question.

Disclaimer: Portable Flink Runner is still in experimental phase will only work with trivial amount of input data.

Here is how you can run TFX on Flink via Beam.

Prerequisite

Instructions to run a python pipeline: https://beam.apache.org/roadmap/portability/#python-on-flink
Note: We currently only support Flink 1.5.5

Instructions

1) Build Worker Containers:

  • Go to Beam checkout dir
  • Run gradle command: ./gradlew :beam-sdks-python-container:docker

2) Run Beam JobServer for Flink:

  • Go to Beam checkout dir
  • Run gradle command: ./gradlew beam-runners-flink_2.11-job-server:runShadow Note: this command will not finish as it starts the job server and keep it running.

3) Submit a pipeline

--experiments=beam_fn_api --runner PortableRunner --job_endpoint=localhost:8099 --experiments=worker_threads=100 --execution_mode_for_batch=BATCH_FORCED

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!