将文件从Google Dataflow保存到Google存储的最简单方法是什么?

时间:2021-03-15 15:23:23

I've got my pipeline up and running, but I need to save a number to a file, which will allow me to come back to previous processing in case something goes wrong ( or when I want to deploy new version of pipeline, and I cannot use the update-pipeline feature ).

我已经启动并运行了我的管道,但我需要将一个数字保存到一个文件中,这将允许我回到以前的处理,以防出现问题(或者当我想部署新版本的管道时,我无法使用更新管道功能)。

I was trying to add libraries from google-cloud-storage, but there are errors in Runtime (it does not find some classes from google-cloud-core even when I have all dependencies up to date ). There need to be a simple way, just to save ( and then read ) a file. I will be grateful for some snippet or link.

我试图从google-cloud-storage添加库,但运行时有错误(即使我最新的所有依赖项,它也没有找到google-cloud-core的一些类)。需要一种简单的方法,只需保存(然后读取)一个文件。我会感谢一些片段或链接。

1 个解决方案

#1


0  

If you are stuck in dependency hell, and must do this step from within dataflow, then I would recommend looking at side-inputs and outputs...and then pairing that with some simple TextIO operations.

如果您陷入依赖地狱,并且必须从数据流中执行此步骤,那么我建议查看侧输入和输出...然后将其与一些简单的TextIO操作配对。

#1


0  

If you are stuck in dependency hell, and must do this step from within dataflow, then I would recommend looking at side-inputs and outputs...and then pairing that with some simple TextIO operations.

如果您陷入依赖地狱,并且必须从数据流中执行此步骤,那么我建议查看侧输入和输出...然后将其与一些简单的TextIO操作配对。