<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>showmeanalytics.com</title>
	<atom:link href="http://showmeanalytics.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://showmeanalytics.com</link>
	<description>Analytics from the Show Me State</description>
	<lastBuildDate>Wed, 05 May 2010 23:56:40 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Web Analytics and the User Experience Team: WA + UX = WIN</title>
		<link>http://showmeanalytics.com/2010/05/web-analytics-and-the-user-experience-team/</link>
		<comments>http://showmeanalytics.com/2010/05/web-analytics-and-the-user-experience-team/#comments</comments>
		<pubDate>Wed, 05 May 2010 23:51:11 +0000</pubDate>
		<dc:creator>angie</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://showmeanalytics.com/?p=125</guid>
		<description><![CDATA[I had a chance to chat with Andrew Janis from Evantage Consulting Monday night at eMetrics, and the subject of web analytics + user experience came up. Sorry to say I missed his talk yesterday, but I did get a chance to read his blog post, Five Web Analytics Metrics for User Experience Professionals, where [...]]]></description>
			<content:encoded><![CDATA[<p>I had a chance to chat with Andrew Janis from Evantage Consulting Monday night at eMetrics, and the subject of web analytics + user experience came up. Sorry to say I missed his talk yesterday, but I did get a chance to read his blog post, <a href="http://userexperience.evantageconsulting.com/2010/04/five-web-analytics-metrics-user-experienc/">Five Web Analytics Metrics for User Experience Professionals</a>, where he lays the groundwork for how UX people can work with analytics data. I’ve been working with our UX team lately, so his post was timely. I’ve heard a lot at this conference about marrying the “what” with the “why,” which is exactly what we do as analysts when we reach out to our UX teams.</p>
<p>How can a UX professional – and indeed, the entire business – benefit from web analytics? There are a number of ways.</p>
<p>Analytics can <strong>help UX focus their efforts</strong> where they can have the <strong>most impact</strong>. When site changes are suggested or when user pain points are uncovered, look at your analytics data to see how many users are affected. This helps the user testing team (and the business owner, too) allocate scarce resources to the areas of the site where they can make the most impact. Changes that affect only a small number of users, or are within a section of the site that is not used very often, should take lower priority over improvements that can affect more users unless there is a specific reason (e.g. the handful of dissatisfied users represent your most profitable customers).</p>
<p><strong>Look for</strong> <strong>user stumbling blocks</strong> that you don’t even know you have. Scenario analysis (a/k/a funnel analysis) shows progression through a process on the site. Finding disproportionately large abandonment on certain steps of the process can point to user issues with the process. Similarly, errors on the site and in forms also point to areas where users are not having a good experience.</p>
<p>Help to <strong>formulate questions</strong> that can be answered through user testing. For example, analytics can identify little-used tools or sections of a website, but it can’t tell us <em>why</em> those sections are underutilized. User testing, however, can tease out whether it’s because those sections are poorly designed and implemented, or if they’re just not part of why people come to the site in the first place. In the former case, improving the design or content is justified. In the latter, the business can avoid spending resources improving features your users don’t care about.</p>
<p>Ensure <strong>test subjects are representative</strong> of actual site users. If a website contains links to information for different audiences (e.g. links for students vs. teachers, or resources for small businesses vs. large businesses), you can often get an idea of the breakdown of those populations by looking at relative usage in each area.  For sites requiring authentication, web analysts also often have access to demographic information in addition to the behavioral data, and can help profile the different segments that should be represented in the user tests.</p>
<p>Last, but certainly not least, web analytics can help <strong>quantify and document how UX adds value</strong> to a project. Before a redesign, UX and WA should work together to determine which metrics show that a visit is or isn’t successful. These metrics can be collected (or pulled historically) for several weeks or months before the redesign, and compared with the same metrics after the redesign to provide concrete evidence of what works and what doesn’t, and how much impact the changes made.</p>
]]></content:encoded>
			<wfw:commentRss>http://showmeanalytics.com/2010/05/web-analytics-and-the-user-experience-team/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>You say I&#8217;m engaged, I say you&#8217;re wasting my time</title>
		<link>http://showmeanalytics.com/2010/02/you-say-im-engaged-i-say-youre-wasting-my-time/</link>
		<comments>http://showmeanalytics.com/2010/02/you-say-im-engaged-i-say-youre-wasting-my-time/#comments</comments>
		<pubDate>Fri, 05 Feb 2010 04:36:02 +0000</pubDate>
		<dc:creator>angie</dc:creator>
				<category><![CDATA[Analytics]]></category>
		<category><![CDATA[analysis]]></category>
		<category><![CDATA[engagement]]></category>

		<guid isPermaLink="false">http://showmeanalytics.com/?p=116</guid>
		<description><![CDATA[For content sites, web analysts often look at engagement-related metrics to try to assess whether or not visitors are having a successful visit. After all, there is no transaction like a purchase, to tell us that something &#8220;good&#8221; happened, if not for our visitor then at least for our business. We may look at metrics [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://showmeanalytics.com/wp-content/uploads/2010/02/ring_small.jpg"><img class="alignright size-full wp-image-120" title="ring" src="http://showmeanalytics.com/wp-content/uploads/2010/02/ring_small.jpg" alt="ring" width="250" height="243" /></a>For content sites, web analysts often look at engagement-related metrics to try to assess whether or not visitors are having a successful visit. After all, there is no transaction like a purchase, to tell us that something &#8220;good&#8221; happened, if not for our visitor then at least for our business. We may look at metrics like time on site, content pages viewed per visit, the ratio of navigation page views to content page views, and micro-conversions like viewing a print-ready version of an article, or emailing a link to a colleague.</p>
<p>I&#8217;ve always suspected there is a fine line between engaging people and wasting their time, especially when you&#8217;re dealing with B2B sites. After all, when I&#8217;m looking for information as part of my workday, I don&#8217;t want to spend a lot of time on a site. I don&#8217;t want to view a lot of pages: I want to find the answer to my question right away. Even micro-conversions don&#8217;t necessarily mean that my visit was successful: maybe I&#8217;m printing out pages or emailing myself a link because I don&#8217;t have time to wade through the confusion right now.</p>
<p>I was recently doing an analysis for a site, and was curious about the &#8220;engagement&#8221; level of two important customer segments. I looked at time on site, content pages per visit, bounce rate, navigation to content ratio (i.e. for a ratio of 2, it would mean that on average, for every content page viewed, visitors must view 2 navigation pages), % visits that contained email to friend actions, and % visits that contained at least one print-ready view. I couldn&#8217;t factor  in other potential engagement/loyalty metrics (visit frequency, etc.) because of the large number of shared computers and accounts for this particular site, which is heavily used in a workplace setting. Here&#8217;s what I found.</p>
<ul>
<li>Segment B had half the bounce rate of Segment A (although both were pretty low).</li>
<li>Segment B spent 30% more time on the site.</li>
<li>Segment B viewed 30% more content.</li>
<li>Segment B viewed print-ready pages in 4% more visits, close enough that I&#8217;d consider their usage of this function to be roughly equivalent.</li>
<li>Segment B needed to go through 8% fewer navigational pages in order to find content.</li>
<li>Usage of the email function was similar for both segments, just slightly higher for Segment A.</li>
</ul>
<p>By all measures except the two conversions, I would have considered Segment B to be a good bit more &#8220;engaged&#8221; than Segment A. Even on the print and email conversions they were roughly the same. But my satisfaction surveys told a different story. I used the responses to our &#8220;Were you able to find the information you were looking for?&#8221; question to double-check overall satisfaction scores. What did I find?</p>
<ul>
<li>Segment B scored 2 points <em>lower </em>than Segment A on overall satisfaction (on a 0-100 scale).</li>
<li>Segment B respondents were 7% less likely to answer &#8220;Yes&#8221; to the question about whether they found what they are looking for.</li>
</ul>
<p>I don&#8217;t know if the above differences are statistically significant, but what I absolutely do know is that higher performance on engagement-related metrics did not mean that Segment B customers were happier or more satisfied with the site. If anything, it&#8217;s just the opposite.</p>
<p>The best thing you can do for a content site (and other sites, too, IMHO) is to install continuous surveys. That&#8217;s the only way you will ever be able to assess the quality of your visitors&#8217; experience online. If you can&#8217;t afford one of the for-pay surveys that can be well-customized (from <a href="http://www.foreseeresults.com/">ForeSee Results</a> or <a href="http://www.iperceptions.com/">iPerceptions</a>), there are still a number of customer satisfaction tools, like <a href="http://www.4qsurvey.com/">4Q </a>and <a href="http://www.kampyle.com/">Kampyle</a>, that allow you to ask your customers straight out whether or not their visit was successful.</p>
]]></content:encoded>
			<wfw:commentRss>http://showmeanalytics.com/2010/02/you-say-im-engaged-i-say-youre-wasting-my-time/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>One visit, two user agents</title>
		<link>http://showmeanalytics.com/2009/07/one-visit-two-user-agents/</link>
		<comments>http://showmeanalytics.com/2009/07/one-visit-two-user-agents/#comments</comments>
		<pubDate>Tue, 14 Jul 2009 12:50:58 +0000</pubDate>
		<dc:creator>angie</dc:creator>
				<category><![CDATA[Analytics]]></category>
		<category><![CDATA[analysis]]></category>
		<category><![CDATA[browsers]]></category>
		<category><![CDATA[Logfiles]]></category>
		<category><![CDATA[visits]]></category>

		<guid isPermaLink="false">http://showmeanalytics.com/?p=110</guid>
		<description><![CDATA[I found out recently that visitors using Internet Explorer 8 on a site that is not compatible with that browser, can exhibit multiple user agent strings during one visit. This is because of a compatibility view provided in IE8 that makes it look and act mostly (but not exactly) like IE7, for sites that don’t [...]]]></description>
			<content:encoded><![CDATA[<p>I found out recently that visitors using Internet Explorer 8 on a site that is not compatible with that browser, can exhibit multiple user agent strings during one visit. This is because of a <a href="http://blogs.msdn.com/ie/archive/2008/08/27/introducing-compatibility-view.aspx">compatibility view</a> provided in IE8 that makes it look and act mostly (<a href="http://blogs.msdn.com/ie/archive/2009/03/12/site-compatibility-and-ie8.aspx">but not exactly</a>) like IE7, for sites that don’t play nicely with the newer browser.  If you are trying to provide a proper browser breakdown in support of a site redesign, or if you are troubleshooting browser-related data or user problems, the compatibility view will complicate things.</p>
<p>I assume that most web analytics tools identify the IE version by looking for <em>MSIE X.Y</em> in the browser string. However, this is no longer valid for IE8. This is because the IE8 user agent string will include <em>MSIE 7.0</em> when in compatibility mode. The difference between the “real” IE7, and IE8 in compatibility mode is the word <em>Trident</em>, which is included in both variants of IE8:</p>
<p><em>Example of a regular IE8 user agent: </em>Mozilla/4.0 (compatible; <strong>MSIE 8.0</strong>; Windows NT 6.0; <strong>Trident</strong>/4.0; SLCC1; Media Center PC 5.0; .NET CLR 3.5.21022)</p>
<p><em>Example of IE8 in compatibility mode:</em> Mozilla/4.0 (compatible; <strong>MSIE 7.0</strong>; Windows NT 6.0; <strong>Trident</strong>/4.0; SLCC1; Media Center PC 5.0; .NET CLR 3.5.21022)</p>
<p>Literally thousands of web sites are not compatible with IE8. A list of <a href="http://www.microsoft.com/downloads/thankyou.aspx?familyId=b885e621-91b7-432d-8175-a745b87d2588&amp;displayLang=en">more than 3,000 incompatible sites</a> is maintained by Microsoft.  This list can be downloaded by IE8 users so that the browser can automatically switch itself into compatibility view when a site is encountered that has previously been identified by IE8 users as incompatible. Many more sites are not compatible, but are not on the list because they have lower traffic levels.</p>
<p>Because a visitor can have multiple user agents in one visit, this raises a number of questions:</p>
<ul>
<li>Does your analytics tool keep the user agent string from each individual page view, or do they associate one browser with the entire visit?</li>
<li>If browser is associated with the entire visit, which browser is recorded? If they keep the string on the entry page, then IE8 is likely represented correctly in your data, but you won’t know if users are resorting to compatibility mode in order to view your site. If your analytics tool keeps the last browser string encountered in the visit, then your numbers are likely biased toward IE7 unless your tool is properly grouping this traffic as IE8.</li>
<li>If browser is associated with page views instead of the visit, then adding up visits in your browser report would give you more than the total visits for your site. In other words, browser visits would not be “summable” the way they are when one can assume that each visit has only one browser. This is not the end of the world, just something to be aware of because it’s not intuitive.</li>
<li>Does your analytics tool properly group the browsers with both <em>MSIE 7.0</em> and <em>Trident</em> as IE8? If not, do they expose the entire string so you can do the calculations yourself to see if your site has IE8 issues?</li>
<li>If you are doing logfile analysis without cookies, sessionization is probably based on IP + User Agent. For sites where I’ve transitioned from logfiles to tags in the same tool, my experience has been that IP/User Agent sessionization tends to over-count visits: this issue will increase that inflation even more. Bear in mind that many tag-based tools resort to IP/UA when cookies are blocked, so there could be a small inflation effect regardless of the type of data-collection you use.</li>
</ul>
<p>I examined a few of my sites and found the percentage of visits with IE8 to be roughly between 5% and 15%, depending on the site. My B2B sites tend to have lower IE8 penetration, while sites that attract high-tech users will tend to show a higher percentage of the latest browsers.</p>
<p>If your web analytics tool exposes the entire browser string (Google Analytics does not), I recommend you search through your user agent strings looking for <em>Trident</em>, and see for yourself if this is an issue for the sites you analyze. One metric I’m looking at is the percentage of my <em>Trident</em> browser visits that also contain <em>MSIE 7</em>, assuming that sites that are not compatible with IE8 will show a higher percentage of users resorting to compatibility mode. For a site with known IE8 issues I calculated 25% , while another site I randomly chose calculated to 12%. I haven’t examined enough sites yet to know if that means the second site also has IE8 issues, or if it just means it&#8217;s &#8220;normal&#8221; for a certain percentage of IE8 users to surf in compatibility mode. Clearly I have more work to do.</p>
<p><strong>Update</strong>: Last night I received an email from a colleague who had read this post, asking why should they care? It&#8217;s a fair question so I thought I&#8217;d answer it publicly.</p>
<p>First, if you&#8217;re asking then you probably aren&#8217;t in a situation where you need to care. That&#8217;s OK: the lowly browser report isn&#8217;t the most important report in your web analytics tool, not by a long shot.</p>
<p>But I can think of a couple of situations where it&#8217;s important:</p>
<p>1. When deciding whether or not to fund development changes to enable compatibility with certain browsers, &#8220;fewer than 5% of our visits use that browser&#8221; is a lot different than &#8220;nearly 10% of our visits use that browser&#8221;.  The numbers you use for those decisions should be as accurate as practical.</p>
<p>2. Your customer service department may receive emails or phone calls from visitors complaining that they are unable to perform certain tasks on your site (like complete a transaction). When they receive multiple complaints that sound similar but are unable to reproduce the problem in house they may ask you, the analytics ninja, for help defining the scope of the problem. These intermittent issues are difficult to troubleshoot because they&#8217;re often environment-related. One starting point is to examine the user experience through that transaction &#8212; transaction page views per visit is sometimes sufficient, or you may want to look at a funnel chart for the process &#8212; and segment it by different browser versions. If the issue is due to a browser incompatibility, you can sometimes pinpoint it quickly with this type of analysis.</p>
]]></content:encoded>
			<wfw:commentRss>http://showmeanalytics.com/2009/07/one-visit-two-user-agents/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Perverts Make My Job Interesting</title>
		<link>http://showmeanalytics.com/2009/07/perverts-make-my-job-interesting/</link>
		<comments>http://showmeanalytics.com/2009/07/perverts-make-my-job-interesting/#comments</comments>
		<pubDate>Sun, 05 Jul 2009 21:57:58 +0000</pubDate>
		<dc:creator>angie</dc:creator>
				<category><![CDATA[Analytics]]></category>
		<category><![CDATA[Logfiles]]></category>
		<category><![CDATA[analysis]]></category>
		<category><![CDATA[search keywords]]></category>

		<guid isPermaLink="false">http://showmeanalytics.com/?p=103</guid>
		<description><![CDATA[If you are a web analyst, and you have ever had to Google “zoo porn” as part of your job, you would understand why I loathe the idea of targeted advertising based on user searches. The terms I’ve searched as part of my job have gotten me on the net-nanny list of every employer I’ve [...]]]></description>
			<content:encoded><![CDATA[<p>If you are a web analyst, and you have ever had to Google “zoo porn” as part of your job, you would understand why I loathe the idea of targeted advertising based on user searches. The terms I’ve searched as part of my job have gotten me on the net-nanny list of every employer I’ve had since working in this field. It’s the perverts: they really affect my data.</p>
<div id="attachment_106" class="wp-caption aligncenter" style="width: 451px"><a href="http://showmeanalytics.com/wp-content/uploads/2009/07/fark.jpg"><img class="size-full wp-image-106" title="Screenshot: www.fark.com" src="http://showmeanalytics.com/wp-content/uploads/2009/07/fark.jpg" alt="Screenshot: www.fark.com" width="441" height="66" /></a><p class="wp-caption-text">If Fark is to be believed, the Internet is all about porn anyway.</p></div>
<p style="text-align: center;">
<p>For the record, I don’t analyze porn sites for a living. While I admit I have done analysis for at least one adult-oriented site in the past, this is different. This is the effect of sexually-oriented search terms on websites that have little or nothing to do with sex, websites that I would happily show to my mother. But if you analyze a wide enough variety of sites, you will find that fetishes come in a surprising variety of shapes and sizes, and you’ll be surprised where they, um, pop up.</p>
<p>There are three ways that these “thrill-seekers” may affect your data.</p>
<p style="padding-left: 30px;">1. <strong>By causing a one-time traffic spike</strong>. This is more likely to happen for a blog or a news site, when an article mentions something sexual in a fairly innocuous way. For example, this article contains plenty of keywords that may attract traffic that is not part of my target audience (and if you haven’t bounced by now, welcome to the world of web analytics!). This can happen on news or magazine sites that run features on a variety of subjects, and it can often catch the web analyst off guard. For example, consider the more-or-less legitimate &#8212; if somewhat sensational &#8212; news articles that were all the rage a couple months ago, talking about teens sending naked pictures of themselves to each other on cell phones. When you mention “teens” and “sex” and “naked pictures”  in the same article, you’re bound to attract some of <em>that</em> kind of traffic.</p>
<p style="padding-left: 30px;">This usually only becomes an issue when the traffic spike for a single article is large enough to influence aggregate numbers for the entire week or month. Any sudden spike (or dip) in traffic should always be investigated: it may have been due to a simple editorial choice instead of that awesome marketing campaign that your HiPPO designed.</p>
<p style="padding-left: 30px;">2. <strong>By inflating search engine visits long-term</strong>. Perhaps “inflating” isn’t the best term, since the traffic is real, it’s human, and it’s coming from search engines. This situation happens when there are articles or images on your site that are intended for one audience but end up attracting another audience – the kind that’s not likely to become a customer – and it can wreak havoc with your conversion rates. A prime example is a site that publishes medical information intended for a professional medical audience. A thorough enough site will likely contain pictures of certain body parts or descriptions of rare medical procedures, and a glance through some of your top search terms can yield insights into the human psyche that you wish you didn’t know.</p>
<p style="padding-left: 30px;">Always look past the “Top X” keyword report that is spit out of your web analytics package by default. Look for terms that seem over-represented on a site like yours. Pay careful attention to image searches, and ensure that you can separate image search keywords from text search keywords if necessary.</p>
<p style="padding-left: 30px;">3. <strong>By logging visits that never really happened</strong>. This is fairly rare, and you will likely only catch it if a) your analytics are based on server logs instead of JavaScript tags, and b) your site contains one or more unprotected redirect URLs, “pages” that contain a URL as a value in the query string. The symptom is a sudden appearance in your keyword reports of sexually-oriented phrases that have absolutely nothing to do with your site. The cause is a search engine ranking hack, where a site-of-ill-repute manages to get themselves indexed by means of your redirect URLs, using your site’s good reputation to increase their rankings. You can confirm by looking at the entry pages for the offending terms to see if they are the redirect pages.</p>
<p>As with any traffic that is obviously unqualified, you very likely want to segment out the perverts from some of your conversion rate calculations, especially if you are doing optimization efforts on one or more areas of your site. Unqualified traffic volume can be more than enough to skew results and mask changes to real customer behavior. However, I don’t recommend you filter this traffic from your entire data set. If your linking, advertising, or SEO efforts are bringing in the wrong kind of traffic, this is something you really need to know.</p>
]]></content:encoded>
			<wfw:commentRss>http://showmeanalytics.com/2009/07/perverts-make-my-job-interesting/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Estimating the effects of cookie-deletion</title>
		<link>http://showmeanalytics.com/2009/04/calculating-the-effects-of-cookie-deletion/</link>
		<comments>http://showmeanalytics.com/2009/04/calculating-the-effects-of-cookie-deletion/#comments</comments>
		<pubDate>Fri, 10 Apr 2009 11:59:43 +0000</pubDate>
		<dc:creator>angie</dc:creator>
				<category><![CDATA[Analytics]]></category>
		<category><![CDATA[cookie-deletion]]></category>
		<category><![CDATA[cookies]]></category>
		<category><![CDATA[unique visitors]]></category>

		<guid isPermaLink="false">http://showmeanalytics.com/?p=75</guid>
		<description><![CDATA[There are differing opinions on how to label the metric historically known as &#8220;Unique Visitors&#8221;. On one side of the fence are those who think it should be relabeled &#8220;Unique Cookies&#8221;, since that is the most popular method used for calculations. On the other side of the fence are others who think the metric is [...]]]></description>
			<content:encoded><![CDATA[<p>There are differing opinions on how to label the metric historically known as &#8220;Unique Visitors&#8221;. On one side of the fence are those who think it should be relabeled &#8220;Unique Cookies&#8221;, since that is the most popular method used for calculations. On the other side of the fence are others who think the metric is a catch-all for the &#8220;best available&#8221; measurement (authenticated visitors, cookies if those aren&#8217;t available, IP/UA combination if neither is available) and should be replaced with a different term if/when a better, standardized way to measure people comes along. What we all agree on, though, is that a Unique Visitor metric measured with cookies is terribly inaccurate.</p>
<p>How bad is it? As usual, it depends. A site where people tend to visit on a daily basis will see more inflation from cookie-deletion than a site that is only visited once per month. In the former example, one person may count toward the monthly total as many as 30 or so times, while in the latter, even a frequent cookie-deleter would only count once.</p>
<p>Let&#8217;s pretend that we know something about the actual people visiting a site, and see if we can determine by how much our web analytics numbers might be affected by cookie-deletion.</p>
<p>In order to make the calculations easy, I have assumed people only visit or delete their cookies on daily, weekly, or monthly boundaries, and I am only considering a one month time frame. However, the same logic could be applied to more granular data. It ultimately boils down to a matrix algebra problem, but I doubt many of us are eager to get into that level of detail.</p>
<p><strong>An example</strong></p>
<p><a href="http://showmeanalytics.com/wp-content/uploads/2009/04/crowd1.jpg"><img src="http://showmeanalytics.com/wp-content/uploads/2009/04/crowd1.jpg" alt="crowd1" title="crowd1" width="200" height="150" class="alignright size-full wp-image-90" /></a>Consider 10,000 people:  not 10,000 &#8220;cookies&#8221; and not 10,000 &#8220;unique visitors,&#8221; but 10,000 real-life, carbon-based beings. Suppose we are able to observe these people in such a way that we know &#8220;the truth&#8221; about their online behavior. Suppose also we have observed that, on average, 10% of our people delete their cookies every day, 15% delete once per week, and the remainder delete their cookies monthly or less frequently.</p>
<p>We have also observed that 20% of these people visit our website every day, 30% visit once per week, and the remainder only visit once in a given month. There&#8217;s no reason for these people to login to our website &#8212; all visits are anonymous &#8212; and we count Unique Visitors using a cookie.</p>
<p><strong>How bad is it?</strong></p>
<p>Our first step is to find out how many different cookies each person will receive over the course of the month, based on the number of times they visit our site and how often they delete their cookies. For simplicity&#8217;s sake, we&#8217;ll assume each month has 30 days and 4 weeks.</p>
<p>Daily deleters will receive a different cookie each time they visit, so daily visitors will log 30 cookies and weekly visitors log 4.  Weekly deleters who visit every day will log 4 different cookies over the course of the month, as will weekly deleters who visit once per week. Everybody else&#8217;s activity will be logged with one cookie. We can summarize as shown below.</p>
<table border="1" cellspacing="0" cellpadding="0">
<tbody>
<tr>
<td width="160" valign="top"><a name="OLE_LINK1">One month&#8217;s time&#8230;</a></td>
<td width="160" valign="top"><em>Daily   deleters</em></td>
<td width="160" valign="top"><em>Weekly   deleters</em></td>
<td width="160" valign="top"><em>Monthly   deleters</em></td>
</tr>
<tr>
<td width="160" valign="top"><em>Visit   every day </em></td>
<td width="160" valign="top">30 cookies</td>
<td width="160" valign="top">4 cookies</td>
<td width="160" valign="top">1 cookie</td>
</tr>
<tr>
<td width="160" valign="top"><em>Visit   once/week </em></td>
<td width="160" valign="top">4 cookies</td>
<td width="160" valign="top">4 cookies</td>
<td width="160" valign="top">1 cookie</td>
</tr>
<tr>
<td width="160" valign="top"><em>Visit   once/month</em></td>
<td width="160" valign="top">1 cookie</td>
<td width="160" valign="top">1 cookie</td>
<td width="160" valign="top">1 cookie</td>
</tr>
</tbody>
</table>
<p>Using the above factors, we can determine the cookie-contribution from each set of people:</p>
<table border="1" cellspacing="0" cellpadding="0">
<tbody>
<tr>
<td width="157" valign="top">(Monthly Calculations)</td>
<td width="134" valign="top">Delete daily (10%)</td>
<td width="134" valign="top">Delete weekly (15%)</td>
<td width="134" valign="top">Delete monthly (75%)</td>
<td width="79" valign="top"># Cookies</td>
</tr>
<tr>
<td width="157" valign="top">Visit daily (2000)</td>
<td width="134" valign="top">2000 x 10% x 30 = 6000</td>
<td width="134" valign="top">2000 x 15% x  4 =   1200</td>
<td width="134" valign="top">2000 x 75% x 1 = 1500</td>
<td width="79" valign="top">8700</td>
</tr>
<tr>
<td width="157" valign="top">Visit weekly (3000)</td>
<td width="134" valign="top">3000 x 10% x 4 = 1200</td>
<td width="134" valign="top">3000 x 15% x 4 = 1800</td>
<td width="134" valign="top">2250 x 75% x 1 = 2250</td>
<td width="79" valign="top">5250</td>
</tr>
<tr>
<td width="157" valign="top">Visit monthly (5000)</td>
<td width="134" valign="top">5000 x 10% x 1 = 500</td>
<td width="134" valign="top">5000 x 15% x 1 = 750</td>
<td width="134" valign="top">5000 x 75% x 1 = 3750</td>
<td width="79" valign="top">5000</td>
</tr>
<tr>
<td width="157" valign="top"></td>
<td width="134" valign="top">7700</td>
<td width="134" valign="top">3750</td>
<td width="134" valign="top">7500</td>
<td width="79" valign="top"><strong>18,950</strong></td>
</tr>
</tbody>
</table>
<p>Wow. Our 10,000 people are being represented as 18,950 unique visitors: the Unique Visitors number is inflated by 90%!</p>
<p><strong>Visitor loyalty reports are affected, too</strong></p>
<p>Unique Visitors isn&#8217;t the only number that&#8217;s affected by cookie-deletion. Any visitor-based number is going to be off, so you have a lot of trouble understanding visitor loyalty. You can tell when your efforts to improve loyalty are working, since the numbers will move in the right direction, but the <em>magnitude</em> of change will be misleading.</p>
<p>For the above example, we know that 20% of our people visited daily, 30% visited weekly, and 50% visited monthly, so the number of visits in a month works out to 77,000 (2000 x 30 + 3000 x 4 + 5000). Visits aren&#8217;t affected by cookie-deletion to any great extent (a good argument for visit-based analysis!) &#8211; so our tool will also report 77,000 visits.</p>
<p>This means our <em>people</em> averaged 7.70 visits (77000/10000) over the course of the month, but our web analytics tool will only report 4.06 (77000/18950) visits per visitor because the visitors are inflated. Our average visits per visitor are under-reported by 47%!</p>
<p>If you prefer to view loyalty using a histogram (# visitors who visited once, twice, three times, etc.), then we need to determine to which bin each visitor&#8217;s cookies will be credited.</p>
<table border="1" cellspacing="0" cellpadding="0">
<tbody>
<tr>
<td width="139" valign="top">(Monthly histogram)</td>
<td width="162" valign="top"><em>Delete daily</em></td>
<td width="162" valign="top"><em>Delete weekly</em></td>
<td width="162" valign="top"><em>Delete monthly</em></td>
</tr>
<tr>
<td width="139" valign="top"><em>Visit daily</em></td>
<td width="162" valign="top" bgcolor="#ffffcc">Each cookie is logged only once</td>
<td width="162" valign="top">Each cookie is seen 7 times in a month</td>
<td width="162" valign="top">Each cookie is seen 30 times (every day)</td>
</tr>
<tr>
<td width="139" valign="top"><em>Visit weekly</em></td>
<td width="162" valign="top" bgcolor="#ffffcc">Each cookie is logged only once</td>
<td width="162" valign="top" bgcolor="#ffffcc">Each cookie is logged only once</td>
<td width="162" valign="top">Each cookie is seen 4 times in a month</td>
</tr>
<tr>
<td width="139" valign="top"><em>Visit monthly</em></td>
<td width="162" valign="top" bgcolor="#ffffcc">Each cookie is logged only once</td>
<td width="162" valign="top" bgcolor="#ffffcc">Each cookie is logged only once</td>
<td width="162" valign="top" bgcolor="#ffffcc">Each cookie is logged only once</td>
</tr>
</tbody>
</table>
<p>The first thing that jumps out of this table is that <em>the majority of the cookies are only encountered once, regardless of how many times someone actually visited</em>. This explains why visitor loyalty graphs, regardless of the tool used, are often overloaded with so many one-time visitors.</p>
<div id="attachment_78" class="wp-caption aligncenter" style="width: 681px"><img class="size-full wp-image-78" title="visitor duration graphs" src="http://showmeanalytics.com/wp-content/uploads/2009/04/duration_graphs.jpg" alt="visitor duration graphs" width="671" height="289" /><p class="wp-caption-text">visitor duration graphs</p></div>
<p>Applying the frequencies in the histogram table to the numbers in the calculations table show us how our visitor retention graph is affected by cookie-deletion.</p>
<div id="attachment_79" class="wp-caption aligncenter" style="width: 538px"><img class="size-full wp-image-79" title="frequency corrections" src="http://showmeanalytics.com/wp-content/uploads/2009/04/frequency_correction.jpg" alt="visitor frequency corrections" width="528" height="77" /><p class="wp-caption-text">visitor frequency corrections</p></div>
<p>Again, wow! While 20% of our people visited the site every day, with cookie-based visitor counting, only 8% appear in this super-loyal segment. The majority of the &#8220;visitors&#8221; that were added to the site via cookie-deletion appear in the 1 visit bin, inflating that number by almost a factor of 3.</p>
<p><strong>Adding Authentication</strong></p>
<p>If 100% of the people to the above site authenticated, and the authenticated visitor identifier were used to count unique visitors, then the number would be pretty accurate (ignoring shared logins, etc.). But most sites don&#8217;t require authentication to see certain pages, so the likelihood of 100% authentication is low except in special cases, like intranets.</p>
<p>For our example above, what if half the visitors authenticated? Half of the 10,000 people would be more-or-less accurately represented, while our cookie-deletion calculations would apply to the remaining 5,000. The unique visitor multiplier factor decreases with increasing percentage of authenticated people.</p>
<div id="attachment_86" class="wp-caption aligncenter" style="width: 493px"><a href="http://showmeanalytics.com/wp-content/uploads/2009/04/uv_graph.jpg"><img class="size-full wp-image-86" title="uv_graph" src="http://showmeanalytics.com/wp-content/uploads/2009/04/uv_graph.jpg" alt="Assumed: 10/15/75 cookie deletion and 20/30/50 visiting frequency (daily/weekly/monthly)." width="483" height="291" /></a><p class="wp-caption-text">Assumed: 10/15/75 cookie deletion and 20/30/50 visiting frequency (daily/weekly/monthly).</p></div>
<p><strong>Is it always that bad?</strong></p>
<p>Not necessarily, it could be worse or it could be better. The above examples assumed that 20% of people visited the website every day, and 30% visited weekly. This was an arbitrary example meant to make calculations easier. In real life, you may have far fewer daily visitors (or more, it just depends on the site). Running the numbers assuming 5% daily and 50% weekly visitors, for example, results in a unique visitor inflation of 1.5 instead of the 1.9 calculated in our example.</p>
<p>I&#8217;ve attached <a href='http://showmeanalytics.com/wp-content/uploads/2009/04/abb_cookie_deletion_200904.xlsx'>my spreadsheet</a> so you can run your own what-ifs.</p>
]]></content:encoded>
			<wfw:commentRss>http://showmeanalytics.com/2009/04/calculating-the-effects-of-cookie-deletion/feed/</wfw:commentRss>
		<slash:comments>10</slash:comments>
		</item>
		<item>
		<title>Unique Visitors suck. That&#8217;s why we shouldn&#8217;t change the definition.</title>
		<link>http://showmeanalytics.com/2009/03/unique-visitors-suck/</link>
		<comments>http://showmeanalytics.com/2009/03/unique-visitors-suck/#comments</comments>
		<pubDate>Sun, 29 Mar 2009 20:50:44 +0000</pubDate>
		<dc:creator>angie</dc:creator>
				<category><![CDATA[Standards]]></category>
		<category><![CDATA[IAB]]></category>
		<category><![CDATA[unique visitors]]></category>

		<guid isPermaLink="false">http://showmeanalytics.com/?p=50</guid>
		<description><![CDATA[Last week saw a lot of back-and-forth about the IAB audience reach definitions vs. the WAA definition of Unique Visitors. Jodi McDermott tried to explain the difference between the WAA&#8217;s UV definition and a completely different definition of the term put out there by the IAB. Her post was met by some rather harsh criticism [...]]]></description>
			<content:encoded><![CDATA[<p>Last week saw a lot of back-and-forth about the IAB audience reach definitions vs. the WAA definition of Unique Visitors. Jodi McDermott <a href="http://www.mediapost.com/publications/?fa=Articles.showArticle&amp;art_aid=102512">tried to explain the difference between the WAA&#8217;s UV definition and a completely different definition of the term put out there by the IAB</a>. Her post was met by some rather <a href="http://blog.webanalyticsdemystified.com/weblog/2009/03/unique-visitors-only-come-in-one-size.html">harsh criticism</a> by Eric Peterson, who was then <a href="http://aprilwilson.net/blog2/2009/03/26/eric-t-peterson-im-mad-at-you/">sternly lectured</a> by April Wilson (who is a WAA Board director) and then apologized to Jodi down in the comments of his post. <a href="http://tech.groups.yahoo.com/group/webanalytics/message/21908">I weighed in on the subject</a> in the Yahoo Group, then <a href="http://tech.groups.yahoo.com/group/webanalytics/message/21910">Eric replied</a> to me.</p>
<p>In a nutshell: Eric thinks the WAA definition sucks, the WAA should adopt the IAB definitions, all of the web analytics tools should immediately change their Unique Visitors metric to Unique Cookies, and the WAA Standards Committee is being <em>asinine</em> (his word, not mine) because they don&#8217;t change direction and adopt the IAB metrics.</p>
<p>The WAA Standards Committee did not seek out a urinary Olympiad with either the IAB or with Eric Peterson, but there are a lot of points out there regarding the IAB&#8217;s Audience Reach Guidelines and the WAA Standards that are not being addressed through Eric&#8217;s <a href="http://en.wikipedia.org/wiki/Bully_pulpit">bully pulpit</a>. As long-time co-chair of the WAA Standards Committee &#8212; first with Jason Burby, and more recently with <a href="http://synerinsights.blogspot.com/">Judith Pascual</a>, both from ZAAZ &#8212; I would like to offer some more insights. Some of this is my own perspective as a practitioner, but most of it has been discussed <em>ad nauseam</em> in our committee meetings over the past three years and represents the committee&#8217;s position on the subject.</p>
<p><strong>Redefining a metric that has been in use for a decade, and is unlikely to change in a lot of tools, will only increase confusion</strong>.</p>
<p>Unique visitors suck, and a better metric should be available. But as long as we are using the same terminology as the sucky metric we&#8217;ve always used, we&#8217;re not fixing anything.  This is really my major beef with the IAB specification, as I applaud their efforts to invent a metric that does a better job measuring <em>people</em>.</p>
<p>Unique visitors has been a technology-based metric for over a decade. It seems unlikely that vendors are going to change either their  technology or its terminology en masse. This means that as an analyst, you will still be stuck trying to figure out what it means when your vendor reports unique visitors, except you&#8217;ll have a lot more variations thrown into the mix.</p>
<p>A different name for an improved metric (unique people? unique individuals? reach?) would provide little doubt that the tool in question is reporting something different, since the new metric wouldn&#8217;t even exist in non-compliant tools.</p>
<p><strong>Data from two different IAB-compliant tools still won&#8217;t be comparable</strong>.</p>
<p>Why? Because the IAB spec doesn&#8217;t tell you <em>how</em> to correct the numbers, just that you&#8217;re supposed to do it. A &#8220;scientific&#8221; method must be used, and it must be based on information obtained directly from people, and beyond that you are relying on the mathematical and/or statistical acumen of the people reporting the numbers. The way the spec is written, they can correct the data however they want as long as they can make an argument that it&#8217;s people-based. And you are likely to be comparing panel-based corrections with cookie-deletion-algorithm-based corrections with registration-based corrections as you go from site to site.</p>
<p><strong>Should you correct detail data? Or aggregate data?</strong></p>
<p>This is not made clear in the IAB&#8217;s spec, either, so it&#8217;s yet another way that numbers reported by two different tools or companies could be different.</p>
<p>Web analytics data is collected on a record-by-record basis. Someone makes a request on a website that is instrumented for analytics, and a record of that request, along with any other information your tool collects (cookies, user ID, referrer, user agent, custom data, etc.), is made. This record is put into one or more tables in the database that makes up the guts of your web analytics tool. The majority of what happens when you &#8220;do&#8221; web analytics is based on queries of that database.</p>
<p>Should correction for actual people take place at the detail level, on those records in the database? Or is there a fudge factor that is applied to the monthly (weekly, quarterly, whatever) data you report from the tool?</p>
<p>And would you use the same terminology for a metric corrected at the detail level as for a metric corrected in aggregate?</p>
<p><strong>Privacy is important to our customers, so it is important to us.</strong></p>
<div class="mceTemp">
<dl id="attachment_51" class="wp-caption alignright" style="width: 160px;">
<dt class="wp-caption-dt"><img class="size-full wp-image-51" title="gorilla" src="http://showmeanalytics.com/wp-content/uploads/2009/03/gorilla.jpg" alt="gorilla" width="150" height="225" /></dt>
</dl>
</div>
<p>Privacy is the 800 pound gorilla sitting right in the middle of this discussion, and if we&#8217;re not careful he&#8217;s going to start flinging poo. In creating a standard metric, we cannot force sites to identify the people that visit their site, if those people do not want to be identified.</p>
<p>We have to balance our need for accurate metrics with the privacy considerations of our visitors and ease of use of our websites. Many Internet users simply do not want to be identified, either for privacy reasons or for convenience, and for many sites there really isn&#8217;t any benefit to the user of logging in.</p>
<p>There are even legal issues for sites used by children under 13 (Children&#8217;s Online Privacy Protection Act, or COPPA) and for sites that deal with medical information, like insurance or medical advice sites (Health Insurance Portability and Accountability Act, or HIPPA). These laws and others like them don&#8217;t prohibit login &#8212; indeed, sometimes it&#8217;s necessary &#8212; but they do throw another business consideration into the mix, and often it&#8217;s easier for the business to go out of their way to ensure they *can&#8217;t* identify individuals, so they don&#8217;t have to deal with any potential Personally Identifiable Information (PII).</p>
<p>I have read suggestions about using  unique device identifiers instead of cookies to measure Unique Visitors so that no PII would be required. In fact this is what is used for some types of mobile analytics, as mobile devices have a unique identifier. From a privacy standpoint, this could spell disaster for our industry. From a user standpoint, a unique device identifier  is the equivalent of a 3rd party cookie that is tied to you and only you: one that you can never delete. Your online behavior can be tracked from site to site to site. Claims about the &#8220;anonymity&#8221; of a unique device identifier are going to be irrelevant to privacy advocates, since <a href="http://epic.org/privacy/search_engine/">a user&#8217;s search data can be and has been used to tie &#8220;anonymous&#8221; information back to individuals</a>.</p>
<p><strong>The audience for the IAB&#8217;s standard is unclear, and attempts to clarify it have failed</strong>.</p>
<p>Eric&#8217;s posts about the IAB spec have put a spotlight on the confusion over just who the IAB specification was written for, and who would be expected to comply. Unfortunately, the IAB (Joe Laszlo and his colleague Sherril Mane) gave me a different answer than the answer Eric wrote about in his post:  I was told that the definitions most likely <em>would</em> be applicable to web analytics vendors.</p>
<p>So who were IAB requirements written for? Your guess is as good as mine.</p>
<p>On the same phone call, Joe and Sherril also told Jodi and me that the IAB didn&#8217;t intend to change anything about their standard, regardless of any input they might receive. The document was &#8220;thoroughly vetted&#8221; by their stakeholders, and that was the final say. Which brings me to my next point&#8230;</p>
<p><strong>If you are reading this post, odds are the IAB doesn&#8217;t represent you</strong>.</p>
<p>Does that make them bad? No. Does that make them irrelevant? No. But they represent a different industry and a different set of stakeholders than the WAA, and your interests as a web analyst are not even a tiny part of their consideration.</p>
<p>Companies pay anywhere from $5,000 (for non-voting associate membership) to upwards of $300,000 per year <a title="to be members" href="http://www.iab.net/member_center/1518/1546">to be members of the IAB</a>. These are the interests they represent. You may or may not work for someone who &#8220;has a say,&#8221; and there are no individual members.</p>
<p>The WAA, on the other hand, is comprised of practitioners, consultants, and vendors in the Web Analytics field. <em>This is who we represent</em>. A <a href="http://www.webanalyticsassociation.com/waawebcastseries/membersonly/">recent survey by the WAA Research Committee</a> (login required) reported that over half of the members responding have individual, not corporate membership. Many of us individual members, myself included, pay the ~$200 WAA membership fee out of our own pockets.</p>
<p>Why does this matter? Because we on the Standards Committee work in the web analytics field, we strive to consider how the work we do affects real practitioners analyzing different types of websites. Over the years, we have had significant participation by practitioners working &#8220;in the trenches&#8221; at companies as diverse as Disney, Saks Fifth Avenue, IBM, Reed Elsevier, FedEx, The Motley Fool, ANSI, Adobe, CNN, Ford, Intuit, AOL, Yahoo!, and more. Many of these people are individual members and many of them have been with us for the better part of our tenure as a committee, even as they moved from one company to another and were able to bring in experience from multiple perspectives.</p>
<p>We don&#8217;t write standards only for sites that buy or sell advertising. A definition that applies to all sites must actually work for all site types. This includes e-commerce sites, portal sites, subscription-based sites, pure content sites, lead-generation sites, and self-service sites in addition to ad-based sites. And standards that we write must benefit the analyst more than they confuse him or her.</p>
<p>Because we don&#8217;t currently have a good way of measuring people that is applicable to all sites, or to even a majority of sites, we have not defined one. Instead, we have attempted to capture the most common practices of today.</p>
<p><strong>Unique Visitors (per the WAA definition) and Unique Cookies are not the same thing.</strong></p>
<p>Eric has suggested more than once that the easy solution to this issue is for web analytics tools to relabel their Unique Visitors numbers as Unique Cookies .  I quote: &#8220;Web analytics practitioners (and theoretically the vendors) will learn to use &#8216;Unique Cookies&#8217; since that is a technically correct and 100% accurate description of the data.&#8221;</p>
<p>But it isn&#8217;t accurate. Using cookies is indeed the <em>most prominent </em>method of identifying Unique Visitors, but it isn&#8217;t the only one. Calling that a 100% accurate description of the data is a huge error. Analysts who care about more accurate unique visitor counts are very likely not using cookies to count unique visitors/users. They are probably using registration data from purchases, or user IDs from logins. This is allowed, and preferred, by the WAA standard.</p>
<p>Even if you are using cookies to count your Unique Visitors, they are probably not being used for 100% of the count. Some of your visitors block the JavaScript that sets the cookie. Your WA tool more likely than not has a hierarchy of calculations they use to estimate the &#8220;anonymous&#8221; (or &#8220;guessed&#8221;) unique visitors.</p>
<p>If your tool analyzes logfiles, and your site does not drop tracking cookies itself, then the Unique Visitors metric in your analytics tool counts the number of different IP address/User Agent (browser + operating system) combinations it finds. This isn&#8217;t even close to counting Unique Cookies.</p>
<p>Changing the name of a bad metric in the tools, to a metric that doesn&#8217;t describe what the tool does, doesn&#8217;t fix a thing.</p>
<p><strong>What is a &#8220;standard,&#8221; anyway?</strong></p>
<p><a href="http://tech.groups.yahoo.com/group/webanalytics/message/21910">Eric&#8217;s reply to me on the Yahoo Group</a> defines what is probably the crux of the issue:</p>
<p>&#8220;Perhaps we disagree on what a &#8217;standard&#8217; is. I think a standard should describe the way things ** should be ** not the way things (unfortunately) are.&#8221;</p>
<p>Yes, we definitely disagree on that point, and perhaps that is what makes this discussion so difficult. I would call the way things<em> should</em> <em>be </em>a <strong><em>requirement</em></strong>, not a <strong><em>standard</em></strong>. It&#8217;s no doubt true that <em>standard</em> can be interpreted either way.</p>
<p>The dozens of people &#8211; practitioners, consultants, and vendors &#8212; who have been meeting every other week for the last three plus years to help establish the WAA standards have chosen to use the standards to describe reality as we know it, so that&#8217;s the way I think of a standard. We think that, at this point, it is more important for analysts to <em>understand what they are measuring</em> than to <em>change what they are supposed to measure</em>, and that has been the focus of our committe&#8217;s efforts to date. This doesn&#8217;t mean we will never define the future as we think it should be, only that the industry is young, and we believe this is the appropriate first step.</p>
<p><strong>And I really couldn&#8217;t let this one slide.</strong></p>
<p><img class="alignleft size-full wp-image-52" title="orly_owl_small" src="http://showmeanalytics.com/wp-content/uploads/2009/03/orly_owl_small.jpg" alt="orly_owl_small" width="219" height="200" />Finally, in one of his posts Eric says &#8220;&#8230;<em>considering the <strong>fact</strong> [emphasis mine] that the WAA&#8217;s definition is wrong</em>&#8230;&#8221;</p>
<p>I know I should probably ignore that statement. The issues here are a matter of interpretation: the WAA is not wrong, and Eric is not wrong (as long as he stays away from black-and-white, I&#8217;m-right- and-you-are-wrong arguments), and &#8212; despite my and the committee&#8217;s disagreement &#8212; the IAB is not necessarily wrong. We represent disparate opinions about what terms and definitions will most benefit those in our respective industries.  But I can&#8217;t help but point out:</p>
<p>The WAA definition of Unique Visitors describes how <em>nearly every single analytics tool that exists</em><strong> </strong>calculates Unique Visitors. The WAA definition of Unique Visitors is also essentially <em>in agreement with the JICWEBS definition</em> of Unique Users. (For the US-centric among us, JICWEBS is the Joint Industry Committee for Web Standards, a standard-setting body from the UK and Ireland. It includes ABCe and the IAB UK.)</p>
<p>On the other hand, the IAB definition takes an admittedly flawed metric and <em>completely redefines</em> it.  It represents the way almost<strong><em> </em></strong><em>no one</em><em> </em>calculates unique visitors or unique users.</p>
<p>And yet, it&#8217;s a <strong><em>fact</em></strong> that the WAA is <em>wrong</em>? Oh really?</p>
<h6>Photo credits:</p>
<p>O RLY Owl: <a href="http://www.hjo3.net/orly/gallery1.htm">http://www.hjo3.net/orly/gallery1.htm</a></p>
<p>Gorilla: User noladoc30 on stock.xchng (<a href="http://www.sxc.hu/photo/1112963">http://www.sxc.hu/photo/1112963</a>)</h6>
]]></content:encoded>
			<wfw:commentRss>http://showmeanalytics.com/2009/03/unique-visitors-suck/feed/</wfw:commentRss>
		<slash:comments>9</slash:comments>
		</item>
		<item>
		<title>More Unix Nerdiness: xargs and tar</title>
		<link>http://showmeanalytics.com/2009/01/more-unix-nerdiness-xargs-and-tar/</link>
		<comments>http://showmeanalytics.com/2009/01/more-unix-nerdiness-xargs-and-tar/#comments</comments>
		<pubDate>Sat, 24 Jan 2009 16:44:54 +0000</pubDate>
		<dc:creator>angie</dc:creator>
				<category><![CDATA[Unix]]></category>
		<category><![CDATA[ls]]></category>
		<category><![CDATA[mv]]></category>
		<category><![CDATA[tar]]></category>
		<category><![CDATA[xargs]]></category>

		<guid isPermaLink="false">http://showmeanalytics.com/?p=39</guid>
		<description><![CDATA[I didn&#8217;t want to do two Unix posts in a row, but something came up last week that made me dust off the xargs command. Even if you do a lot of logfile parsing, you probably won&#8217;t use xargs very much. But when you need it, it can save a ton of time.
Commands used in [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignright size-full wp-image-45" title="smallnerdglasses" src="http://showmeanalytics.com/wp-content/uploads/2009/01/smallnerdglasses.jpg" alt="smallnerdglasses" width="200" height="133" />I didn&#8217;t want to do two Unix posts in a row, but something came up last week that made me dust off the xargs command. Even if you do a lot of logfile parsing, you probably won&#8217;t use xargs very much. But when you need it, it can save a ton of time.</p>
<p>Commands used in this post:</p>
<p><strong>xargs </strong>- allows you to use a list generated by one command as arguments for another command</p>
<p><strong>mv </strong>- moves or renames a file</p>
<p><strong>ls </strong>- lists files</p>
<p><strong>tar </strong>- an archive utility that lets you store and extract multiple files; although it can be used to write files to tape, one very useful property is that it can also write output to a regular file, which is extremely handy if you need to transfer a bunch of files to your PC (WinZip can handle tarfiles)</p>
<p>Let&#8217;s say you have a lot of files in your directory named test1.txt, test2.txt, test3.txt, and so on, and you want to append .old to all of them. You could run a mv (move) command on each individual file, or you could rename all of them at once using xargs as follows:</p>
<p><span style="color: #800080;">ls -1 test* | xargs ABC mv ABC ABC.old</span></p>
<p>•	ls -1 (that&#8217;s dash-&#8221;one&#8221;, not a lowercase &#8220;L&#8221;) lists the files so that there is only one file per line</p>
<p>•	Each file name is temporarily held in a variable called ABC, and the rest is just substitution (mv test1.txt test1.txt.old, etc.).</p>
<p>I ran into a situation last week with a batch job that spits out thousands of reports, each into its own subdirectory. After all the reports had run, an issue was found that affected several hundred of them, which were then rerun. Because the reports had already been archived and moved to their final location (not difficult, but time-consuming due to sheer volume), we only wanted to re-archive the reports that had been rerun that day. This was relatively easy to do thanks to xargs. Here&#8217;s the command I used:</p>
<p><span style="color: #800080;">ls -l */*.out | grep &#8220;Jan 21&#8243; | sed ‘s/^.*Jan 21 ..:.. //&#8217;  | xargs tar cvf reportreruns.tar</span></p>
<p>•	ls -l (this time it&#8217;s a lowercase &#8220;L&#8221;, not a &#8220;one&#8221;) does a long listing, which includes permissions, size, owner, date stamp, etc. The first few lines for the listing look something like this:</p>
<p>-rw-r&#8211;r&#8211;   1 owner  group   1234567  Jan 18  22:07 subdir1/report.out<br />
-rw-r&#8211;r&#8211;   1 owner  group   1234567  Jan 21  05:46 subdir2/report.out<br />
-rw-r&#8211;r&#8211;   1 owner  group   1234567  Jan 19  09:31 subdir3/report.out<br />
-rw-r&#8211;r&#8211;   1 owner  group   1234567  Jan 21  05:57 subdir4/report.out</p>
<p>•	The grep command (see prior post for more information) pulls out only those lines that include the string &#8220;Jan 21&#8243;.</p>
<p>•	The sed command (see prior post for more information) strips off everything up to  the subdirectory and file. Quite literally, it&#8217;s substituting everything from the beginning of the line (^) to &#8220;Jan 21&#8243;, a space, two characters (. is a wildcard for a single character), a colon, two more characters (that just took care of the time stamp), and another space &#8211; and replaces it with nothing.</p>
<p>•	The xarg command takes all of those files and passes them to the tar command. Notice I didn&#8217;t use the variable (ABC) this time: xargs will automatically put the arguments at the end of the following command, so you only need to use a variable if the arguments appear elsewhere in the command.</p>
<p>•	The cvf modifier to tar tells it to create (&#8221;c&#8221;) the archive, see verbose (&#8221;v&#8221;) messages (it&#8217;ll output a message as each file is added: it&#8217;s optional), and write the archive to a file (&#8221;f&#8221;) named reportreruns.tar. The tar command needs to be followed by a list of files to put into the archive, but xargs has already taken care of that.</p>
<address><em>Photo courtesy of <a title="stock.xchng" href="http://www.sxc.hu">stock.xchng</a>.</em><br />
</address>
]]></content:encoded>
			<wfw:commentRss>http://showmeanalytics.com/2009/01/more-unix-nerdiness-xargs-and-tar/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Unix commands for logfile parsing</title>
		<link>http://showmeanalytics.com/2009/01/unix-commands-for-logfile-parsing/</link>
		<comments>http://showmeanalytics.com/2009/01/unix-commands-for-logfile-parsing/#comments</comments>
		<pubDate>Mon, 12 Jan 2009 04:18:41 +0000</pubDate>
		<dc:creator>angie</dc:creator>
				<category><![CDATA[Unix]]></category>
		<category><![CDATA[awk]]></category>
		<category><![CDATA[cut]]></category>
		<category><![CDATA[grep]]></category>
		<category><![CDATA[Logfiles]]></category>
		<category><![CDATA[server logs]]></category>
		<category><![CDATA[sort]]></category>
		<category><![CDATA[uniq]]></category>

		<guid isPermaLink="false">http://showmeanalytics.com/?p=19</guid>
		<description><![CDATA[Although many web analysts use a JavaScript-tagged solution, some of us still do log analysis on one or more sites. Even when JS data is used, sometimes you have a troubleshooting situation that requires you to go back to your logs. If you have access to a Unix environment, commands like grep, cut, and awk [...]]]></description>
			<content:encoded><![CDATA[<p class="MsoNormal">Although many web analysts use a JavaScript-tagged solution, some of us still do log analysis on one or more sites. Even when JS data is used, sometimes you have a troubleshooting situation that requires you to go back to your logs. If you have access to a Unix environment, commands like grep, cut, and awk are invaluable for prowling through large files. You can also download these commands to use in a PC/DOS environment, although I’ve found the DOS version to be a little more awkward to use.</p>
<p class="MsoNormal">Here is an introduction to some of my favorite commands:</p>
<p class="MsoNormal"><strong>grep</strong> – used to find lines that contain a certain string or regular expression (note that regular expressions are not fully supported in the default grep command for some Unix systems)</p>
<p class="MsoNormal"><strong>cut</strong> – used to pull out specific columns from a file based on a specified delimiter;  most logs are space-delimited</p>
<p class="MsoNormal"><strong>awk</strong> – a programming language that can parse through text files; short pieces of code can be used on the command line</p>
<p class="MsoNormal"><strong>sort</strong> – sort the output; the –n modifier sorts numerically; can use a –t modifier to sort on something other than the beginning of the line</p>
<p class="MsoNormal"><strong>uniq</strong> – eliminate duplicate lines; the –c modifier shows a count of how many times each line appears</p>
<p class="MsoNormal">In order to make “cut” work, you need to know which fields contain your data of interest. If you use the <a href="http://httpd.apache.org/docs/1.3/logs.html">“combined” log format</a>, the following table lists the fields where data is located. Cutting out cookie data can be a bit more difficult: we’re using cut with a space delimiter, but spaces can be contained in the user agent field so pulling out cookie values takes a little more work.</p>
<table border="1" cellspacing="0" cellpadding="0">
<tbody>
<tr>
<td width="73" valign="top">
<p class="MsoNormal"><strong>Field #</strong></p>
</td>
<td style="padding: 0in 5.4pt; width: 238.5pt;" width="318" valign="top">
<p class="MsoNormal"><strong>Information</strong></p>
</td>
</tr>
<tr>
<td style="padding: 0in 5.4pt; width: 54.9pt;" width="73" valign="top">
<p class="MsoNormal">1</p>
</td>
<td style="padding: 0in 5.4pt; width: 238.5pt;" width="318" valign="top">
<p class="MsoNormal">IP address</p>
</td>
</tr>
<tr>
<td style="padding: 0in 5.4pt; width: 54.9pt;" width="73" valign="top">
<p class="MsoNormal">3</p>
</td>
<td style="padding: 0in 5.4pt; width: 238.5pt;" width="318" valign="top">
<p class="MsoNormal">Auth (userid) field; note   it’s not always populated</p>
</td>
</tr>
<tr>
<td style="padding: 0in 5.4pt; width: 54.9pt;" width="73" valign="top">
<p class="MsoNormal">4</p>
</td>
<td style="padding: 0in 5.4pt; width: 238.5pt;" width="318" valign="top">
<p class="MsoNormal">Timestamp</p>
</td>
</tr>
<tr>
<td style="padding: 0in 5.4pt; width: 54.9pt;" width="73" valign="top">
<p class="MsoNormal">7</p>
</td>
<td style="padding: 0in 5.4pt; width: 238.5pt;" width="318" valign="top">
<p class="MsoNormal">Request URL</p>
</td>
</tr>
<tr>
<td style="padding: 0in 5.4pt; width: 54.9pt;" width="73" valign="top">
<p class="MsoNormal">9</p>
</td>
<td style="padding: 0in 5.4pt; width: 238.5pt;" width="318" valign="top">
<p class="MsoNormal">Status code</p>
</td>
</tr>
<tr>
<td style="padding: 0in 5.4pt; width: 54.9pt;" width="73" valign="top">
<p class="MsoNormal">11</p>
</td>
<td style="padding: 0in 5.4pt; width: 238.5pt;" width="318" valign="top">
<p class="MsoNormal">Referrer</p>
</td>
</tr>
<tr>
<td style="padding: 0in 5.4pt; width: 54.9pt;" width="73" valign="top">
<p class="MsoNormal">12-</p>
</td>
<td style="padding: 0in 5.4pt; width: 238.5pt;" width="318" valign="top">
<p class="MsoNormal">User agent (the dash means   go through the end of the line; UA can contain spaces and thus spans several   columns)</p>
</td>
</tr>
<tr>
<td style="padding: 0in 5.4pt; width: 54.9pt;" width="73" valign="top">
<p class="MsoNormal">(varies)</p>
</td>
<td style="padding: 0in 5.4pt; width: 238.5pt;" width="318" valign="top">
<p class="MsoNormal">Cookies</p>
</td>
</tr>
</tbody>
</table>
<p class="MsoNormal">In Unix environments, you are allowed to view your results page by page on the screen, or to save them to a file. To page through the results on screen, pipe the command through “more” as shown:</p>
<p class="MsoNormal"><span style="color: #800080;"><em>command</em> | more</span></p>
<p class="MsoNormal">To save the results to a file, redirect your output to a file of your choice using the greater than symbol:</p>
<p class="MsoNormal"><span style="color: #800080;"><em>command</em> &gt; <em>outputfile</em></span></p>
<p class="MsoNormal">To open a logfile, use gunzip –c (the –c will only gunzip it to the screen instead of uncompressing and saving your file) if your file is ends in a .gz, which indicates it is compressed. Use the “cat” command if the logfile is not compressed. To take a peek at one of your logfiles, you would do the following:</p>
<p class="MsoNormal"><span style="color: #800080;">gunzip –c <em>file.gz</em> | more</span> or                     <span style="color: #800080;">cat <em>file</em> | more</span></p>
<p class="MsoNormal">The remainder of our examples assume we are examining a compressed logfile.</p>
<p class="MsoNormal"><strong>To pull out all records from one IP address (1.2.3.4, for example):</strong></p>
<p class="MsoNormal"><span style="color: #800080;">gunzip –c <em>file.gz</em> | grep “1.2.3.4” &gt; <em>outputfile</em></span></p>
<p class="MsoNormal"><strong>To pull out all records from any IP address that begins with 12: </strong></p>
<p class="MsoNormal"><span style="color: #800080;">gunzip –c <em>file.gz</em> | grep “^12.” &gt; <em>outputfile</em></span></p>
<p class="MsoNormal">Notes:</p>
<ul>
<li>A caret (^) is the how you specify the beginning of a line with a regular expression</li>
<li>The backslash () tells the regular expression you are looking for an actual period instead of a wildcard</li>
</ul>
<p class="MsoNormal"><strong>To look at the requests made by one IP address (1.2.3.4, for example):</strong></p>
<p class="MsoNormal"><span style="color: #800080;">gunzip –c <em>file.gz</em> | grep “^1.2.3.4 “  | cut –d’ ‘ –f7 &gt; <em>outputfile</em></span></p>
<p class="MsoNormal"><strong>Pull out “page” requests only (status code = 200, and not an image, css, or javascript file):</strong></p>
<p class="MsoNormal"><span style="color: #800080;">gunzip –c <em>file.gz</em> | grep “ 200 “ | grep –v “.jpg “ | grep –v “.gif “ | grep –v “.png “ | grep –v “.css “ | grep –v “.ico “ | grep –v “.js “ &gt; <em>outputfile</em></span></p>
<p class="MsoNormal">Notes:</p>
<ul style="margin-top: 0in;" type="disc">
<li class="MsoNormal">You can exclude any other      file extensions you wish by piping another grep –v into your command; ending      the grep string ends with a space ensures you will only eliminate lines      where those extensions are the request, and not embedded in a query string      value.</li>
<li class="MsoNormal">If you do a lot of logfile      parsing, you may wish to put all the grep –v commands into a script so you      don’t have to type all the commands every time you want to limit your      output to pages.</li>
</ul>
<p class="MsoNormal"><strong>Make a list of the most popular referrer fields for the /index.html page:</strong></p>
<p class="MsoNormal"><span style="color: #800080;">gunzip –c <em>file.gz</em> | grep “GET /index.html” | cut –d’ ‘ –f11 | sort | uniq –c | sort –nr &gt; <em>outputfile</em></span></p>
<p class="MsoNormal">Notes:</p>
<ul style="margin-top: 0in;" type="disc">
<li class="MsoNormal">The output will be a      sorted list of lines with a number and a URL; the number is how many times      the referrer occurred, and the URL is the referrer</li>
<li class="MsoNormal">The uniq command must be      executed on sorted input, which is why we sort the output first</li>
<li class="MsoNormal">The second sort command      lists the output by most to least popular referrer; -n is numeric and –r is      reverse order</li>
</ul>
<p class="MsoNormal"><strong>Pull out all the records from userid “angie”, and sort them by timestamp:</strong></p>
<p class="MsoNormal"><span style="color: #800080;">gunzip –c <em>file.gz</em> | grep “ angie “ | sort –t’ ‘ +3 &gt; <em>outputfile</em></span></p>
<p class="MsoNormal">Note:</p>
<p class="MsoNormal">The sort command is modified as follows: -t’ ‘ says the input is space-delimited, while the +3 says to sort on the fourth column (defaults to first column, but we need to move it over three columns)</p>
<p class="MsoNormal"><strong>Find all requests that are more than 1000 characters long:</strong></p>
<p class="MsoNormal">Very long requests are often a sign that something is wrong: they can indicate a problem with your website’s code or they can be indicative of someone trying to hack into your website (especially if the requests contain any SQL code words).</p>
<p class="MsoNormal"><span style="color: #800080;">gunzip –c <em>file.gz</em> | awk ‘length &gt; 1000’ &gt; <em>outputfile</em></span></p>
<p class="MsoNormal">Stay tuned for more posts with additional commands.</p>
]]></content:encoded>
			<wfw:commentRss>http://showmeanalytics.com/2009/01/unix-commands-for-logfile-parsing/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>How to get the most from your professional services dollars</title>
		<link>http://showmeanalytics.com/2009/01/how-to-get-the-most-from-your-professional-services-dollars/</link>
		<comments>http://showmeanalytics.com/2009/01/how-to-get-the-most-from-your-professional-services-dollars/#comments</comments>
		<pubDate>Fri, 02 Jan 2009 10:31:49 +0000</pubDate>
		<dc:creator>angie</dc:creator>
				<category><![CDATA[Services]]></category>
		<category><![CDATA[analysis]]></category>
		<category><![CDATA[professional services]]></category>
		<category><![CDATA[vendor]]></category>

		<guid isPermaLink="false">http://showmeanalytics.com/?p=11</guid>
		<description><![CDATA[So, you have decided to invest somewhere between four and six digits in a special analysis from your web analytics vendor. Having been on the performance end of that deal for several years, I would like to offer some insights that can make the engagement more valuable to you.
1. If the data-pulling part of the [...]]]></description>
			<content:encoded><![CDATA[<p class="MsoNormal">So, you have decided to invest somewhere between four and six digits in a special analysis from your web analytics vendor. Having been on the performance end of that deal for several years, I would like to offer some insights that can make the engagement more valuable to you.</p>
<p class="MsoNormal"><strong>1. If the data-pulling part of the analysis can be done in the tool, ask for instructions.</strong></p>
<p class="MsoNormal">One thing you should know up front: If the answer to your business problem is something that can be found in the tool, you will save a lot of money by doing it yourself. However, not everyone has the time or the expertise to pull and/or interpret the numbers themselves. If you know (or suspect) that this is the case, ask your consultant to include step by step instructions on how to repeat the analysis on your own at a later date, and to document why she reached the conclusions she did. This may increase your estimate by a few hours to cover documentation costs, but you will save in the long run by being able to repeat the analysis as desired at a later date, to establish trends or to see how making changes to your site affects the outcomes. You will also have gained some valuable insight into how an “expert” approaches your particular business problem.</p>
<p class="MsoNormal"><strong>2. Tell your consultant what business problem you’re trying to solve.</strong></p>
<p class="MsoNormal">Make sure you clearly communicate the specific business problem you are trying to solve. If you know of one methodology that will get the answer you are looking for, don’t be afraid to share. But it is usually best not to over-specify the methodology to be used, even if you think you know exactly what you want. Your analysis tool has a maze of tables and relationships and data under the covers, and there is often more than one way to tease out the same business insight. Allowing the consultant the flexibility to suggest other methodologies could result in significant cost savings, or in added insight for the same amount of hours. For example, asking your consultants for “all paths” that lead to purchase should set off some alarm bells with your consultant. A request for a list of “all” anything – unless the data is intended to be moved into another database for further analysis – is usually a sign that the business question hasn’t been well-formed. A good consultant will try to guide you to the real reason behind your question instead of providing you with a data dump: Were you trying to find out whether users were more likely to search or to navigate before making a purchase? Or were you trying to figure out whether the new tools you added in your product view area led to a higher likelihood to purchase?</p>
<p class="MsoNormal"><strong>3. Beware of “canned” analysis.</strong></p>
<p class="MsoNormal">I am not talking about your consultant re-using queries and methodologies from prior engagements – that’s smart business, and reduces the time it takes to get answers and the likelihood of errors – but about “analysis” menus from which you can order as though you are in a restaurant. These are seldom tailored to your site, and are based on a one-size-fits-all methodology. These analyses are not all bad – they can serve as a training tool for your own in-house analysts if they include instructions on how to replicate the analysis yourself – but you should expect to pay accordingly.</p>
<p class="MsoNormal"><strong>4. Meet with the consultant </strong><em><strong>before </strong></em><strong>you ask for a quote.</strong></p>
<p class="MsoNormal">When you first approach your vendor for services and speak with the consultant, insist on a web conference where you can call up your actual website and point to specific areas of concern. Taking time to do this up front helps to avoid misunderstandings, helps your consultant understand any company-specific terminology you use when referring to activity on your site, and should result in a more accurate estimate and more satisfactory analysis.</p>
<p class="MsoNormal"><strong>5. Clearly define — in writing — what the analysis engagement is about.</strong></p>
<p class="MsoNormal">Finally, don’t be afraid to ask for a “solutions design” document to specify exactly what you’re paying for. Don’t expect the design document to include all of the details: it is hard to run a business when a consultant executes most of the engagement up front, only to have the client decide not to spend the money after all. However, the document should include enough detail about the methodology for the consultant to know how much work is involved, and to minimize any disconnects between what you asked for and what she delivers.</p>
<p class="MsoNormal">When you contract with your vendor for professional services, you are paying for special access to your site’s dataset, a breadth of knowledge that allows the consultant to apply lessons learned on a variety of sites, and for the business acumen to understand something about your particular business and what makes your site different from everyone else’s. Make the best of the opportunity!</p>
]]></content:encoded>
			<wfw:commentRss>http://showmeanalytics.com/2009/01/how-to-get-the-most-from-your-professional-services-dollars/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>If you pay your visitors’ salaries, then it pays to optimize</title>
		<link>http://showmeanalytics.com/2008/12/if-you-pay-your-visitors%e2%80%99-salaries-then-it-pays-to-optimize/</link>
		<comments>http://showmeanalytics.com/2008/12/if-you-pay-your-visitors%e2%80%99-salaries-then-it-pays-to-optimize/#comments</comments>
		<pubDate>Sun, 28 Dec 2008 06:01:24 +0000</pubDate>
		<dc:creator>angie</dc:creator>
				<category><![CDATA[Intranet]]></category>
		<category><![CDATA[corporate]]></category>
		<category><![CDATA[employees]]></category>
		<category><![CDATA[time on site]]></category>

		<guid isPermaLink="false">http://showmeanalytics.com/?p=5</guid>
		<description><![CDATA[If you run an intranet or other web-based tool for your employees, you are in the unique position of paying people to use your website. If you are wasting their time, you are wasting your money!

Many larger companies have self-service websites where employees research their benefit offerings, reconcile their expense accounts, and book their own [...]]]></description>
			<content:encoded><![CDATA[<p class="MsoNormal"><em>If you run an intranet or other web-based tool for your employees, you are in the unique position of paying people to use your website. If you are wasting their time, you are wasting your money!</em></p>
<p class="MsoNormal">
<p><img class="alignright size-full wp-image-7" title="timemoney" src="http://showmeanalytics.com/wp-content/uploads/2009/01/timemoney.png" alt="timemoney" width="150" height="186" />Many larger companies have self-service websites where employees research their benefit offerings, reconcile their expense accounts, and book their own travel. And even smaller companies are embracing the intranet: a place where employees can find company policies, view documentation, open trouble tickets, and/or find employee contact details. These sites have one thing in common that requires a different approach from your external websites: You are paying the salaries of your site’s visitors, and their time spent equals your company’s money.</p>
<p class="MsoNormal">It’s often hard to justify advanced analytics or A/B testing on an intranet. If the site is instrumented at all, it’s often the case that someone “slapped some tags” on the site, and examination of the data ends at whatever standard reports are generated. However, if the company is large enough, then a good financial case can often be made for performing the same types of optimization efforts you practice on your external site. The larger the company, and the more expensive the workforce, the more compelling the case becomes.</p>
<p class="MsoNormal">Let’s consider a company with 5,000 employees — “unique visitors” in your web analytics tool — who use your intranet. Assume that on average, these employees visit twice per day, with an average visit duration of 1 minute. On a monthly basis, intranet usage represents over 3300 labor hours – not an insignificant amount. To put it in perspective, that’s roughly the equivalent of 20 full-time employees doing nothing but surfing the intranet!</p>
<p class="MsoNormal">
<div id="attachment_6" class="wp-caption aligncenter" style="width: 482px"><img class="size-full wp-image-6" title="Hours per Month" src="http://showmeanalytics.com/wp-content/uploads/2009/01/hoursmonth.png" alt="Hours per Month Calculation" width="472" height="51" /><p class="wp-caption-text">Hours per Month Calculation</p></div>
<p>The dollar value put on those labor hours varies depending on company size, workforce skill and experience level, overhead costs, etc. If you don’t know the “wrap rate” (fully burdened rate that includes overhead, administration, etc. – usually 2-3 times an average hourly pay) used to estimate projects at your company, then $100/hour is good rule of thumb. It’s not unusual for very large companies to run higher than that, or for small/medium companies to run much lower, but it’s a nice round number that makes calculations easy.</p>
<p class="MsoNormal">Now let’s set a goal to shave 10% off the total visit hours each month by improving search or navigation on the site. For our example above, that’s a savings of 330 hours, or $33,000 per month. Potential savings of nearly $400,000 per year helps makes a compelling ROI argument indeed!</p>
<p class="MsoNormal"><em>Note: For this analysis, it is especially important to know whether your analytics tool eliminates “bounced” (single-page) visits from your average visit duration calculation. That’s because the intranet is set as the home page for the default browser install within many companies, leading to a very high percentage of visits where only one page is viewed. I recommend you eliminate these visits from your calculations, since they don’t represent an optimization opportunity since people are just passing through on their way to another site.</em></p>
<p><span style="color: gray;">Photo courtesy stock.xchng</span></p>
<p class="MsoNormal">
<p class="MsoNormal"><cite><a href="http://www.linkedin.com/in/webmetrics">Thomas Bosilevac</a></cite> Says:<br />
<span style="font-size: 10pt;"><a href="../?p=3#comment-2">December 28th, 2008 at 12:02 am</a> </span></p>
<p>Great article! I manage the analytics service for the Intranet sites of a large multinational myself. Yes, just Intranet sites. You are right, in some cases this may only warrant the need to “slap a tag on it” and it does not get as fun as multi-variant testing, heat maps and campaign analysis. For organizations who spend millions (probably at least the fortune 1000) on their Intranet strategy, however, I can easily argue that a bit more care MUST be made, especially considering most organizations cannot use free tools such as Google Analytics. So let’s keep preaching for better Intranet stats!<br />
Your article does a great job describing the start of Intranet valuation. How many people and how long do they spend. I place great caution though on the example to decrease this holistically. In a time where “shaving 10%” never meant more, this is a dangerous single metric to make into a KPI. The Intranet, if used wisely, can be an extremely valuable and cost-saving asset to the company. Instead of shaving off, perhaps increasing usage of such apps as client communication templates, call Center Deflection, etc is in the best interest to the bottom line.<br />
While search may be an area where “time to find result” warrants a decrease, we have found many areas that we are now promoting due to the cost savings vs. “traditional” models involving too much people, paper and process.<br />
Sorry for the rant, I was very excited to see a blog category specifially for Intranet analytics I had to comment away!</p>
<p class="MsoNormal">
<p class="MsoNormal"><cite>angiebb</cite> Says:<br />
<span style="font-size: 10pt;"><a href="../?p=3#comment-3">December 28th, 2008 at 3:34 pm</a> </span></p>
<p>Thomas, thanks for you insights. It’s great that your organization does analytics on your intranet sites. That’s one area where many companies fall down, and terribly.</p>
<p>I completely agree with you that there are times when increasing time spent is actually better, and your examples are good. When I wrote the post, I was imagining a company that does little to no analytics on their intranet, and TOS is a quick and dirty way to get someone’s attention: enough to know that this stuff definitely IS worth analyzing, and it might not take much in terms of improvement to completely recoup whatever investment is made in analytics capabilities (people and/or tools).</p>
<p>I appreciate your comments!</p>
<h3>Trackbacks</h3>
<ol type="1">
<li class="MsoNormal"><a href="http://bosilytics.wordpress.com/2008/12/30/times-are-rough-what-are-your-troops-doing/">Times      are rough &#8211; what are your troops doing? « Hear &#8211; Listen &#8211; Communicate</a></li>
</ol>
]]></content:encoded>
			<wfw:commentRss>http://showmeanalytics.com/2008/12/if-you-pay-your-visitors%e2%80%99-salaries-then-it-pays-to-optimize/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>
