I am sending data to PubSub from where I am trying to create a DataFlow job to put data into BigQuery. I have a column in the data for unique that I want to do HLL_COUNT.INIT Is there an equivalent method on the DataFlow side so that I can directly store the HLL version of the column in BigQuery?
我正在向PubSub发送数据,我正在尝试创建一个DataFlow作业以将数据放入BigQuery。我在数据中有一列我想要做的唯一HLL_COUNT.INIT在DataFlow方面是否有一个等效的方法,以便我可以直接在BigQuery中存储该列的HLL版本?
1 个解决方案
#1
1
No, DataFlow doesn't have support for BigQuery HLL sketches format, but it is clearly something that would be useful. I created feature request for it in DataFlow issue tracker: https://issuetracker.google.com/62153424
不,DataFlow不支持BigQuery HLL草图格式,但它显然是有用的。我在DataFlow问题跟踪器中为其创建了功能请求:https://issuetracker.google.com/62153424
#1
1
No, DataFlow doesn't have support for BigQuery HLL sketches format, but it is clearly something that would be useful. I created feature request for it in DataFlow issue tracker: https://issuetracker.google.com/62153424
不,DataFlow不支持BigQuery HLL草图格式,但它显然是有用的。我在DataFlow问题跟踪器中为其创建了功能请求:https://issuetracker.google.com/62153424