I am in process of gathering information about web analytics tools (like Google Web Analytics) for my next assignment, but I am not able to find any good information. I am looking for:
我正在为我的下一个作业收集有关网络分析工具(如Google Web Analytics)的信息,但我无法找到任何有用的信息。我在寻找:
- Key terms used.
- What all mediums are available for data collection and How they works.
- Any reference books, white papers etc (technical and non technical both).
- Any open source implementation (especially in .NET).
使用的关键术语。
所有媒体都可用于数据收集及其工作原理。
任何参考书,白皮书等(技术和非技术两者)。
任何开源实现(特别是在.NET中)。
3 个解决方案
#1
Here are the key terms used:
以下是使用的关键术语:
- Hit (internet)
- Page view
- Visit / Session
- First Visit / First Session
- Visitor / Unique Visitor / Unique User
- Repeat Visitor
- New Visitor
- Impression
- Singletons
- Bounce Rate
- % Exit
- Visibility time
- Session Duration
- Page View Duration / Time on Page
- Page Depth / Page Views per Session
- Frequency / Session per Unique
- Click path
访问/会话
第一次访问/第一次会议
访客/独特访客/唯一身份用户
页面查看持续时间/页面上的时间
每个会话的页面深度/页面视图
每个唯一的频率/会话
Methods used:
- Web server logfile analysis
- Page tagging
Web服务器日志文件分析
Web server logfile analysis
Web服务器日志文件分析
In this method you write script to scrape details out of your log files and then write it to your database. This method will not give you real time statistics. You can read more about web log analysis software here.
在此方法中,您编写脚本以从日志文件中删除详细信息,然后将其写入数据库。此方法不会为您提供实时统计信息。您可以在此处阅读有关Web日志分析软件的更多信息
Page tagging
Add a code of javascript or just an image and then use the code to get all the dtails about the page, referrr, visitor etc.
添加一个javascript代码或只是一个图像,然后使用代码获取有关页面,referrr,访问者等的所有细节。
...these were images included in a web page that showed the number of times the image had been requested, which was an estimate of the number of visits to that page. In the late 1990s this concept evolved to include a small invisible image instead of a visible one, and, by using JavaScript, to pass along with the image request certain information about the page and the visitor. This information can then be processed remotely by a web analytics company, and extensive statistics generated...
...这些是包含在网页中的图像,显示图像被请求的次数,这是对该页面的访问次数的估计。在20世纪90年代后期,这个概念演变为包括一个小的不可见图像而不是可见图像,并且,通过使用JavaScript,传递图像请求关于页面和访问者的某些信息。然后,网络分析公司可以远程处理此信息,并生成大量统计信息......
If you are using analytics in your own website, you can use the code provided by Eytan Levit
如果您在自己的网站中使用分析,则可以使用Eytan Levit提供的代码
Credit wikipedia. More information can be found there.
信用*。可以在那里找到更多信息。
#2
Well,
I'm no expert, but here is some common data you can retrieve to build you own analytics:
我不是专家,但这里有一些您可以检索的常见数据来构建您自己的分析:
string str;
str += "Refferer:" + Request.UrlReferrer.AbsolutePath.ToString() + "<BR>";
str += "Form data:" + Request.Form.ToString() + "<br>";
str += "User Agent:" + Request.ServerVariables["HTTP_USER_AGENT"] + "<br>";
str += "IP Address:" + Request.UserHostAddress.ToString() + "<BR>";
str += "Browser:" + Request.Browser.Browser + " Version: " + Request.Browser.Version + " Platform: " + Request.Browser.Platform + "<BR>";
str += "Is Crawler: " + Request.Browser.Crawler.ToString() + "<BR>";
str += "QueryString" + Request.QueryString.ToString() + "<BR>";
You can also parse the keyword the user has reached your website from like this:
您还可以解析用户到达您网站的关键字,如下所示:
protected string GetKeywordFromReferrer(string url)
{
if (url.Trim() == "")
{
return "no url";
}
string urlEscaped = Uri.UnescapeDataString(url).Replace('+', ' ');
string terms = "";
string site = "";
Match searchQuery = Regex.Match(urlEscaped, @"[\&\?][qp]\=([^\&]*)");
if (searchQuery.Success)
{
terms = searchQuery.Groups[1].Value;
}
else
{
Match siteDomain = Regex.Match(urlEscaped, @"http\:\/\/(.+?)\/");
if (siteDomain.Success)
{
site = siteDomain.Groups[1].Value;
}
}
if (terms != "")
{
return terms;
}
if (site != "")
{
return site;
}
return "Direct Access";
}
Hope this has helped a bit.
希望这有所帮助。
#3
1. Key terms used
As with answer 1
1.使用的关键术语与答案1一样
2. What all mediums are available for data collection and How they works.
Log files from Apache, IIS. HTTP Handlers for ASP.NET, or your actual page. Javascript includes (the objects available to Javascript give you most information you need about the client)
2.所有可用于数据收集的媒介及其工作原理。来自Apache,IIS的日志文件。 ASP.NET处理程序或您的实际页面。 Javascript包括(Javascript可用的对象为您提供有关客户端的大部分信息)
3. Any reference books, white papers etc (technical and non technical both)
The RFC on HTTP is useful, that gives you most of the request headers that are capturable.
3.任何参考书籍,白皮书等(技术和非技术两者)HTTP上的RFC很有用,它可以为您提供大部分可捕获的请求标头。
4.Any open source implementation (especially in .NET).
4.任何开源实现(特别是在.NET中)。
I wrote one that has the parsing part of the analysis done (in my view the hardest part). It needs a bit of tweaking in certain areas as it's 4 years old:
我写了一个完成分析的解析部分(在我看来最困难的部分)。它在4年前需要在某些领域进行一些调整:
- Statmagic (for log files)
Statmagic(用于日志文件)
It's missing a DAL, which is harder than it sounds - the main hurdle is making sure you don't replicate the exact data that each row of the log has, as you then may as well just use the log files. The other part is displaying this aggregated data in a nice format. My goal was to have it stored in SQL Server, and also db4o format to cater for smaller websites.
它缺少一个比它听起来更难的DAL - 主要障碍是确保你不会复制日志的每一行所具有的确切数据,因为你可以只使用日志文件。另一部分是以一种很好的格式显示这个聚合数据。我的目标是将它存储在SQL Server中,以及db4o格式以满足较小的网站。
The 'sad' part of the Statmagic project is Google came along and completely wiped out the competition and any point in me finishing it.
Statmagic项目中的“悲伤”部分是谷歌出现并彻底消灭了竞争对手以及我完成它的任何一点。
#1
Here are the key terms used:
以下是使用的关键术语:
- Hit (internet)
- Page view
- Visit / Session
- First Visit / First Session
- Visitor / Unique Visitor / Unique User
- Repeat Visitor
- New Visitor
- Impression
- Singletons
- Bounce Rate
- % Exit
- Visibility time
- Session Duration
- Page View Duration / Time on Page
- Page Depth / Page Views per Session
- Frequency / Session per Unique
- Click path
访问/会话
第一次访问/第一次会议
访客/独特访客/唯一身份用户
页面查看持续时间/页面上的时间
每个会话的页面深度/页面视图
每个唯一的频率/会话
Methods used:
- Web server logfile analysis
- Page tagging
Web服务器日志文件分析
Web server logfile analysis
Web服务器日志文件分析
In this method you write script to scrape details out of your log files and then write it to your database. This method will not give you real time statistics. You can read more about web log analysis software here.
在此方法中,您编写脚本以从日志文件中删除详细信息,然后将其写入数据库。此方法不会为您提供实时统计信息。您可以在此处阅读有关Web日志分析软件的更多信息
Page tagging
Add a code of javascript or just an image and then use the code to get all the dtails about the page, referrr, visitor etc.
添加一个javascript代码或只是一个图像,然后使用代码获取有关页面,referrr,访问者等的所有细节。
...these were images included in a web page that showed the number of times the image had been requested, which was an estimate of the number of visits to that page. In the late 1990s this concept evolved to include a small invisible image instead of a visible one, and, by using JavaScript, to pass along with the image request certain information about the page and the visitor. This information can then be processed remotely by a web analytics company, and extensive statistics generated...
...这些是包含在网页中的图像,显示图像被请求的次数,这是对该页面的访问次数的估计。在20世纪90年代后期,这个概念演变为包括一个小的不可见图像而不是可见图像,并且,通过使用JavaScript,传递图像请求关于页面和访问者的某些信息。然后,网络分析公司可以远程处理此信息,并生成大量统计信息......
If you are using analytics in your own website, you can use the code provided by Eytan Levit
如果您在自己的网站中使用分析,则可以使用Eytan Levit提供的代码
Credit wikipedia. More information can be found there.
信用*。可以在那里找到更多信息。
#2
Well,
I'm no expert, but here is some common data you can retrieve to build you own analytics:
我不是专家,但这里有一些您可以检索的常见数据来构建您自己的分析:
string str;
str += "Refferer:" + Request.UrlReferrer.AbsolutePath.ToString() + "<BR>";
str += "Form data:" + Request.Form.ToString() + "<br>";
str += "User Agent:" + Request.ServerVariables["HTTP_USER_AGENT"] + "<br>";
str += "IP Address:" + Request.UserHostAddress.ToString() + "<BR>";
str += "Browser:" + Request.Browser.Browser + " Version: " + Request.Browser.Version + " Platform: " + Request.Browser.Platform + "<BR>";
str += "Is Crawler: " + Request.Browser.Crawler.ToString() + "<BR>";
str += "QueryString" + Request.QueryString.ToString() + "<BR>";
You can also parse the keyword the user has reached your website from like this:
您还可以解析用户到达您网站的关键字,如下所示:
protected string GetKeywordFromReferrer(string url)
{
if (url.Trim() == "")
{
return "no url";
}
string urlEscaped = Uri.UnescapeDataString(url).Replace('+', ' ');
string terms = "";
string site = "";
Match searchQuery = Regex.Match(urlEscaped, @"[\&\?][qp]\=([^\&]*)");
if (searchQuery.Success)
{
terms = searchQuery.Groups[1].Value;
}
else
{
Match siteDomain = Regex.Match(urlEscaped, @"http\:\/\/(.+?)\/");
if (siteDomain.Success)
{
site = siteDomain.Groups[1].Value;
}
}
if (terms != "")
{
return terms;
}
if (site != "")
{
return site;
}
return "Direct Access";
}
Hope this has helped a bit.
希望这有所帮助。
#3
1. Key terms used
As with answer 1
1.使用的关键术语与答案1一样
2. What all mediums are available for data collection and How they works.
Log files from Apache, IIS. HTTP Handlers for ASP.NET, or your actual page. Javascript includes (the objects available to Javascript give you most information you need about the client)
2.所有可用于数据收集的媒介及其工作原理。来自Apache,IIS的日志文件。 ASP.NET处理程序或您的实际页面。 Javascript包括(Javascript可用的对象为您提供有关客户端的大部分信息)
3. Any reference books, white papers etc (technical and non technical both)
The RFC on HTTP is useful, that gives you most of the request headers that are capturable.
3.任何参考书籍,白皮书等(技术和非技术两者)HTTP上的RFC很有用,它可以为您提供大部分可捕获的请求标头。
4.Any open source implementation (especially in .NET).
4.任何开源实现(特别是在.NET中)。
I wrote one that has the parsing part of the analysis done (in my view the hardest part). It needs a bit of tweaking in certain areas as it's 4 years old:
我写了一个完成分析的解析部分(在我看来最困难的部分)。它在4年前需要在某些领域进行一些调整:
- Statmagic (for log files)
Statmagic(用于日志文件)
It's missing a DAL, which is harder than it sounds - the main hurdle is making sure you don't replicate the exact data that each row of the log has, as you then may as well just use the log files. The other part is displaying this aggregated data in a nice format. My goal was to have it stored in SQL Server, and also db4o format to cater for smaller websites.
它缺少一个比它听起来更难的DAL - 主要障碍是确保你不会复制日志的每一行所具有的确切数据,因为你可以只使用日志文件。另一部分是以一种很好的格式显示这个聚合数据。我的目标是将它存储在SQL Server中,以及db4o格式以满足较小的网站。
The 'sad' part of the Statmagic project is Google came along and completely wiped out the competition and any point in me finishing it.
Statmagic项目中的“悲伤”部分是谷歌出现并彻底消灭了竞争对手以及我完成它的任何一点。