是否可以使用Google Dataflow处理Google Analytics数据?

时间:2022-01-15 14:01:46

I would like to use Google Dataflow to process Google Analytics data from many websites and store the results in a Google SQL.

我想使用Google Dataflow处理来自许多网站的Google Analytics数据,并将结果存储在Google SQL中。

But I was doing some research and I couldn't find a way to read the GA data from Google Dataflow directly.

但我正在做一些研究,我找不到直接从Google Dataflow读取GA数据的方法。

There is one way if you have Google Analytics 360 you can enable the bigQuery integration but I will be using the Free version of Google Analytics not the 360.

如果你有谷歌分析360有一种方法,你可以启用bigQuery集成,但我将使用免费版的谷歌分析而不是360。

Has anybody worked with something similar?

有人有类似的东西吗?

Cheers,

干杯,

1 个解决方案

#1


2  

It would really depend on what type of data you're looking to pull. The GA360 integration with BigQuery extracts all the raw data (hit level) from Google Analytics. There is no way to do the same thing with the free version of Google Analytics.

这实际上取决于您希望提取的数据类型。 GA360与BigQuery的集成从Google Analytics中提取所有原始数据(命中级别)。使用免费版Google Analytics无法做同样的事情。

However, you should be able to write your own Apache Beam Source that pulls data from Google Analytics using the Reporting API. Of course you would be restricted by the API limits.

但是,您应该能够编写自己的Apache Beam Source,使用Reporting API从Google Analytics中提取数据。当然,您将受到API限制的限制。

As I mentioned in the beginning of my answer, this all depends on what type of data you're trying to get at. Aggregated/high-level data should be possible but granular/hit-level data won't.

正如我在回答的开头所提到的,这完全取决于您尝试获取的数据类型。应该可以使用聚合/高级数据,但粒度/命中级数据不会。

#1


2  

It would really depend on what type of data you're looking to pull. The GA360 integration with BigQuery extracts all the raw data (hit level) from Google Analytics. There is no way to do the same thing with the free version of Google Analytics.

这实际上取决于您希望提取的数据类型。 GA360与BigQuery的集成从Google Analytics中提取所有原始数据(命中级别)。使用免费版Google Analytics无法做同样的事情。

However, you should be able to write your own Apache Beam Source that pulls data from Google Analytics using the Reporting API. Of course you would be restricted by the API limits.

但是,您应该能够编写自己的Apache Beam Source,使用Reporting API从Google Analytics中提取数据。当然,您将受到API限制的限制。

As I mentioned in the beginning of my answer, this all depends on what type of data you're trying to get at. Aggregated/high-level data should be possible but granular/hit-level data won't.

正如我在回答的开头所提到的,这完全取决于您尝试获取的数据类型。应该可以使用聚合/高级数据,但粒度/命中级数据不会。