使用Python SDK创建Cloud Dataflow模板的步骤

时间:2020-12-15 15:33:56

I have created Pipeline in Python using Apache Beam SDK, and Dataflow jobs are running perfectly from command-line.

我使用Apache Beam SDK在Python中创建了Pipeline,并且Dataflow作业在命令行中运行良好。

Now, I'd like to run those jobs from UI. For that i have to create template file for my job. I found steps to create template in Java using maven.

现在,我想从UI运行这些工作。为此,我必须为我的工作创建模板文件。我找到了使用maven在Java中创建模板的步骤。

But how do I do it using the Python SDK?

但是我如何使用Python SDK呢?

1 个解决方案

#1


3  

Templates are available for creation in the Dataflow Python SDK since April of 2017. Here is the documentation.

自2017年4月起,可以在Dataflow Python SDK中创建模板。以下是文档。

To run a template, no SDK is needed (which is the main problem templates try to solve), so you can run them from the UI, REST API, or CL and here is how.

要运行模板,不需要SDK(这是模板试图解决的主要问题),因此您可以从UI,REST API或CL运行它们,这就是方法。

#1


3  

Templates are available for creation in the Dataflow Python SDK since April of 2017. Here is the documentation.

自2017年4月起,可以在Dataflow Python SDK中创建模板。以下是文档。

To run a template, no SDK is needed (which is the main problem templates try to solve), so you can run them from the UI, REST API, or CL and here is how.

要运行模板,不需要SDK(这是模板试图解决的主要问题),因此您可以从UI,REST API或CL运行它们,这就是方法。