<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>searchengineland.com &#187; SEM Tools: URL Shorteners</title>
	<atom:link href="http://searchengineland.com/library/sem-tools/sem-tools-url-shorteners/feed" rel="self" type="application/rss+xml" />
	<link>http://searchengineland.com</link>
	<description>Search Engine Land: Must Read News About Search Marketing &#38; Search Engines</description>
	<lastBuildDate>Mon, 23 Nov 2009 05:34:29 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Is Twitter Sending You 500% To 1600% More Traffic Than You Might Think?</title>
		<link>http://searchengineland.com/is-twitter-sending-you-500-to-1600-more-traffic-than-you-might-think-22696</link>
		<comments>http://searchengineland.com/is-twitter-sending-you-500-to-1600-more-traffic-than-you-might-think-22696#comments</comments>
		<pubDate>Sat, 18 Jul 2009 18:48:36 +0000</pubDate>
		<dc:creator>Danny Sullivan</dc:creator>
				<category><![CDATA[Features: Analysis]]></category>
		<category><![CDATA[SEM Tools: URL Shorteners]]></category>
		<category><![CDATA[SEM Tools: Web Analytics]]></category>
		<category><![CDATA[Top News]]></category>
		<category><![CDATA[Twitter]]></category>

		<guid isPermaLink="false">http://searchengineland.com/?p=22696</guid>
		<description><![CDATA[Earlier I posted that Google Analytics and other JavaScript-based tracking  tools might be undercounting visits from Twitter. I&#8217;ve done some more digging,  which supports the case. In my test, Twitter seems to have sent 500% to 1600% more traffic than log files or hosted stats packages like Google Analytics might show.
How  Twitter [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fsearchengineland.com%2Fis-twitter-sending-you-500-to-1600-more-traffic-than-you-might-think-22696"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fsearchengineland.com%2Fis-twitter-sending-you-500-to-1600-more-traffic-than-you-might-think-22696" height="61" width="51" /></a></div><p>Earlier I posted that Google Analytics and other JavaScript-based tracking  tools might be undercounting visits from Twitter. I&#8217;ve done some more digging,  which supports the case. In my test, Twitter seems to have sent 500% to 1600% more traffic than log files or hosted stats packages like Google Analytics might show.</p>
<p><a href="../../how-twitter-might-send-far-more-traffic-than-you-think-21482">How  Twitter Might Send Far More Traffic Than You Think</a> is my earlier article  that explains how I&#8217;d often seen big gaps in how many people apparently clicked  on a tweeted link as measured by Bit.ly versus how many page views that Google  Analytics was showing.</p>
<p>To test this further, I tweeted a particular <a href="http://daggle.com/mothers-cookies-closes-the-sadness-for-products-i-no-longer-have-389?utm_source=twitter&amp;utm_medium=micro-blog&amp;utm_campaign=twitter">page</a> on my personal blog along with tracking code designed to especially help ensure  it appeared in Google Analytics. I&#8217;m going to toss out a bunch of numbers as  part of this analysis. If they get confusing, skip to the end for the  conclusion.</p>
<p><strong>The Numbers Bit</strong></p>
<p>For July 7, Bit.ly <a href="http://bit.ly/info/cHXSP">reported</a> that the  page had registered 58 clicks. Were there 58 corresponding page views? No.  Google Analytics only reported 17 page views from 11 unique users. That meant a  gap of 41 views.</p>
<p>Was the gap due to clicks from non-human robots that don&#8217;t process Google  Analytics JavaScript tracking code? Visits from people using mobile browsers  that didn&#8217;t get tracked, because they might not process the code? To explore  further, I went to the raw log files, the records that the server itself keeps.  These shows any request made for the page, regardless of any JavaScript  issues.</p>
<p>I found that there had been 57 total requests &#8212; practically the same as  Bit.ly reported. However, 14 of these were for the page without the tracking  codes I&#8217;d used when tweeting the page through Bit.ly.</p>
<p>In other words, this is the URL I put out through Bit.ly, which was reported  to receive 58 visits:</p>
<blockquote><p><a href="http://mothers-cookies-closes-the-sadness-for-products-i-no-longer-have-389/?utm_source=twitter&amp;utm_medium=micro-blog&amp;utm_campaign=twitter">http:///mothers-cookies-closes-the-sadness-for-products-i-no-longer-have-389<strong>?utm_source=twitter&amp;utm_medium=micro-blog&amp;utm_campaign=twitter</strong></a><strong> </strong></p></blockquote>
<p>See the part in bold? Those are tracking codes or parameters. From the log  files, I found that URL above (with the codes) had 43 visits (not 58) and the  same page without tracking codes like this received 14 further visits:</p>
<blockquote><p><a href="http://mothers-cookies-closes-the-sadness-for-products-i-no-longer-have-389/">http:///mothers-cookies-closes-the-sadness-for-products-i-no-longer-have-389</a></p></blockquote>
<p>Those 14 visits without tracking codes almost all came from robots (Google:  5; Yanga: 4; Microsoft: 3). Two other visits seemed to be from humans. These  robotic visits all likely had nothing to do with my tweet. The requests were  from spiders doing their regular crawls of the web, it seems. The few human  visits to the page without tracking codes were probably people who came to my  article for reasons unconnected with the tweet.</p>
<p>What about those other 43 visits to the page that did have the tracking code?  Well, 11 visits were from what appeared to be robots (OneRiot: 1; PycURL: 2;  Ginxbot: 2; WebShot: 1; Google: 2; Tweetmeme: 1; Python-urllib: 1; LongURL API:  1).</p>
<p>That left 32 visits that appear to be from humans. That&#8217;s almost between the  58 views Bit.ly reported and the 17 page views Google Analytics reflected. Why  still such a gap with GA?</p>
<p>One leading argument has been that some Twitter applications on mobile  devices load pages within the application, rather than using an external  browser, and so aren&#8217;t getting registered by Google Analytics. Also, some mobile  browsers might not process JavaScript. I could see at least four iPhone-based  requests like this. But there were plenty of other requests that appear to be  from full-fledged desktop-based browsers. Why weren&#8217;t they showing up?</p>
<p>One clue is that of the 34 requests, only 5 of them contained &#8220;referrer&#8221;  data, information that some browsers pass on that indicate how they found the  page in the first place. For Google Analytics (or ANY analytics program) to  properly indicate how much traffic a particular site is driving, it needs as  much referrer data as it can get.</p>
<p>Of those referrers, only 2 of them were from the twitter.com domain (1 more  was from my own blog&#8217;s domain, 1 from iconfactory.com/twitterific, probably  indicating a Twitterific users, and one from powertwitter.me, probably  indicating a Twitter-visit via a Firefox plug-in).</p>
<p>In short, based on referrer traffic alone, ANY analysis program would have  reported that at best, Twitter sent my page only 2 visits. Yet, both Google  Analytics and Bit.ly reported that it received far more than that.</p>
<p>Remember, Google Analytics said the page had received 17 views in all, 11  from unique users. How many of those 11 unique users came to the page via  Twitter? Google Analytics said 9. One more came directly, it said; another  person did a search to find it (mother&#8217;s cookie site:daggle.com was the search,  which was me locating the article. Oddly, this request does NOT appear in the  raw log files).</p>
<p><strong>The Big Conclusion</strong></p>
<p>All those earlier numbers hurt your head? Here are the most meaningful ones.  Thanks for hanging in there!</p>
<p>Based only on referrers, at best, Google or any analytics program would have  said Twitter sent 2 visits. But because I used tracking codes, I was able to  overcome the lack of referring data and see that Twitter (itself or via  applications or web sites using Twitter data) sent 9 visits. That means  analytics packages might be undercounting Twitter visits by nearly 500%.</p>
<p>Meanwhile, Bit.ly was showing those 58 clicks to the page. Let&#8217;s say it  wasn&#8217;t filtering out some of the robots. I can still see that there are 32  visits that the log files recorded, all with the tracking codes that never  existed until I tweeted the link with them. So those are all Twitter-derived  visits. That means an undercount by a standard analytics tool depending on  referrer data by 1600%.</p>
<p><strong>And The Analytics Companies Say?</strong></p>
<p>I sent my logs to both Bit.ly and Google, along with a draft of this  article, for any reaction.</p>
<p>Google said they&#8217;re aware that activity on mobile devices can cause issues  with tracking and that they&#8217;re looking for ways to improve their product.</p>
<p>Bit.ly said they filter out robotic clicks such as Ginxbot, Google, and  Python-urllib, through PycURL. When I asked further about the gap, they emailed  back:</p>
<p>It looks like three types of events make up the delta.</p>
<blockquote><p>First, browser plug-ins and automated url-lengthener applications, which make  requests to the bit.ly URL, but don&#8217;t follow the redirect to the destination  site.</p>
<p>One example is the &#8220;eventBox&#8221; at http://thecosmicmachine.com. Here&#8217;s how it  appears in the logs:</p>
<p>(eventBox) : &#8211; - [07/Jul/2009:20:41:31 -0400] &#8220;GET /cHXSP HTTP/1.1&#8243; 301 410  &#8220;-&#8221; &#8220;EventBox567 CFNetwork/438.12 Darwin/9.7.0 (i386) (iMac9%2C1)&#8221; 301</p>
<p>Second, small bots that make their way through our screening system:</p>
<p>(slicehost): &#8211; - [07/Jul/2009:21:05:43 -0400] &#8220;GET /cHXSP HTTP/1.1&#8243; 301 410  &#8220;-&#8221; &#8220;-&#8221; 301</p>
<p>Third, browsers which don&#8217;t support JavaScript, as well as browsers with  JavaScript settings turned off and browsers running JavaScript-blocking  extensions like noscript.</p></blockquote>
<p><strong>And Some Related Reading</strong></p>
<p>Last week, Fred Wilson posted <a href="http://www.avc.com/a_vc/2009/07/does-this-blog-get-more-traffic-from-google-or-twitter.html">Does  This Blog Get More Traffic From Google or Twitter?</a>, finding that for his  personal blog, Twitter traffic has risen past Google search traffic. Fred  suspected that the Twitter traffic was even more than being shown, due to  undercounting. I think he&#8217;s right. While I think Google search traffic still  remains a major traffic driver for many sites, those who have lots of Twitter  followers or have a story go &#8220;hot&#8221; through retweets certainly may discover  Twitter is a new major traffic resource &#8211;and one that&#8217;s likely  undercounted.</p>
<p>Over at the Zebu Blog, <a href="http://zebugroup.com/blog/2009/06/link-tracking-lies-damn-lies-statistics/">Link  Tracking &#8211; (lies, damn lies &amp;) Statistics?</a> also looks at the issue,  questioning whether Bit.ly is overcounting. In a follow up <a href="http://zebugroup.com/blog/2009/06/link-tracking-lies-damn-lies-statistics/#comment-508">comment</a>,  Mayank Sharma did his own small scale experiment and found:</p>
<blockquote><p>We created a bit.ly url for this post, and posted it on Twitter. The next  instant we saw, that bit.ly’s count was already 4. This only means that some  twitter crawler/indexer received the tweet and de-referenced the url mentioned  in it. After that I hovered my mouse over the link shown in Twitterfox. Sure  enough bit.ly’s count increased by one. We did this repeatedly from multiple  desktop’s of several friends and the count just kept on increasing. Not one of  these folks during this time had actually clicked on the link.</p></blockquote>
<p>I agree &#8212; Bit.ly seems to be overestimating views. But Google Analytics  seems to be underestimating them, perhaps severely based on my small scale log  analysis program. Using tracking codes occasionally is one way to get a reality  check.</p>
<p>Finally, if you want to add tracking parameters for URLs you tweet, consider  the <a href="https://addons.mozilla.org/en-US/firefox/addon/11492">Snip-n-Tag</a> add-on for Firefox. I&#8217;ve been using it, and it makes adding these to URLs super  easy.</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/is-twitter-sending-you-500-to-1600-more-traffic-than-you-might-think-22696/feed</wfw:commentRss>
		<slash:comments>13</slash:comments>
		</item>
		<item>
		<title>How Twitter Might Send Far More Traffic Than You Think</title>
		<link>http://searchengineland.com/how-twitter-might-send-far-more-traffic-than-you-think-21482</link>
		<comments>http://searchengineland.com/how-twitter-might-send-far-more-traffic-than-you-think-21482#comments</comments>
		<pubDate>Wed, 24 Jun 2009 20:43:24 +0000</pubDate>
		<dc:creator>Danny Sullivan</dc:creator>
				<category><![CDATA[Features: Analysis]]></category>
		<category><![CDATA[SEM Tools: URL Shorteners]]></category>
		<category><![CDATA[SEM Tools: Web Analytics]]></category>
		<category><![CDATA[Top News]]></category>
		<category><![CDATA[Twitter]]></category>

		<guid isPermaLink="false">http://searchengineland.com/?p=21482</guid>
		<description><![CDATA[Over the past year, I&#8217;ve seen many people report that Twitter can send tons  of traffic to a web site. Certainly I&#8217;ve seen first-hand how Twitter has become  one of the top non-search referrer sources for Search Engine Land and for some other sites I  oversee. But as much as you think [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fsearchengineland.com%2Fhow-twitter-might-send-far-more-traffic-than-you-think-21482"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fsearchengineland.com%2Fhow-twitter-might-send-far-more-traffic-than-you-think-21482" height="61" width="51" /></a></div><p>Over the past year, I&#8217;ve seen many people report that Twitter can send tons  of traffic to a web site. Certainly I&#8217;ve seen first-hand how Twitter has become  one of the top non-search referrer sources for <a href="http://searchengineland.com/">Search Engine Land</a> and for some other sites I  oversee. But as much as you think Twitter is driving traffic, it might be  sending even more that you&#8217;re unable to measure.</p>
<p>One key culprit may be that a large number people view web pages using  mobile Twitter applications such as <a href="http://www.atebits.com/tweetie-iphone/">Tweetie</a>. Click on a link from  a tweet in some of these applications, and they load web pages using their own  browsers. Those browser may not run JavaScript. And if JavaScript doesn&#8217;t run,  then some hosted metrics tools like <a href="http://www.google.com/analytics/">Google Analytics</a> never realize that your  page was viewed.</p>
<p>Consider this recent example. I wrote an article on my personal blog about  the collapse of the Clear airport security program last night and how it seems  likely that alternatives to it will also go away (<a href="http://daggle.com/tsa-stays-silent-on-registered-traveler-meltdown-758">TSA  Stays Silent As Its Registered Traveler Program Melts Down</a>).</p>
<p>I tweeted that article using <a href="http://bit.ly/">Bit.ly</a>, a <a href="../../analysis-which-url-shortening-service-should-you-use-17204">URL  shortener</a> that allows me to track how many people clicked on the link I  tweeted:</p>
<p><a title="Bit.ly Stats by search-engine-land, on Flickr" href="http://www.flickr.com/photos/searchengineland/3657420677/"><img src="http://farm3.static.flickr.com/2468/3657420677_bf337e2652.jpg" border="0" alt="Bit.ly Stats" width="500" height="85" /></a></p>
<p>The screenshot shows that Bit.ly <a href="http://bit.ly/info/r3BfL">had seen</a> 343 total clicks to that URL  through its system (when I did the screenshot), 339 of them coming from the shortened version I created. So  Google Analytics ought to show me roughly that same number of people coming to the page from  Twitter, right?</p>
<p>Nope. Google Analytics reports that Twitter sent 63 visits:</p>
<p><a title="Google Analytics Stats by search-engine-land, on Flickr" href="http://www.flickr.com/photos/searchengineland/3657420755/"><img src="http://farm4.static.flickr.com/3618/3657420755_0a1ebfd0e2.jpg" border="0" alt="Google Analytics Stats" width="500" height="374" /></a></p>
<p>I know some visits from today have yet to be reflected in Google Analytics.  That&#8217;s why you see the line drop. But I also know that those visits, based on my  Bit.ly data, should be about the same as yesterday. In short, at best, Google  Analytics is going to report I had about 130 visitors to the article from  Twitter. But Bit.ly&#8217;s saying I had about 340 visits. Where are the missing 200  or so people? How come Bit.ly&#8217;s showing more than three times the visits overall  to the page than Google Analytics does. I&#8217;ve seen this time-and-time again for various stories, sometimes with gaps of several thousand visits.</p>
<p>This is where the JavaScript stuff I&#8217;ve mentioned comes in. It seems to be the  best candidate for explaining the discrepancy. Apparently, a lot of people on  Twitter use mobile applications, or view from mobile devices, that aren&#8217;t processing the Google Analytics tracking code. There are also a few other reasons, as Bit.ly&#8217;s general manager Andrew  Cohen explained to me in email:</p>
<blockquote><p>Good question. You&#8217;re right about embedded browsers that don&#8217;t run  JavaScript. Clients that decode bit.ly links but don&#8217;t click through to the  underlying content are another challenge. If you were to download a browser  plug-in to automatically expand short urls, for instance, it would look a lot  like a human user. We&#8217;d see a GET request from an API, return the underlying  long URL, and increment the click count &#8230; but we wouldn&#8217;t be able to see the  actual CTR [clickthrough rate].</p>
<p>Absent JavaScript on the page, it can be tough to distinguish between a  decode event and an actual click-through. That&#8217;s why we think that bit.ly  complements rather than replaces JavaScript-based packages like Google Analytics  or Chartbeat. One precaution that we&#8217;ve taken, though, is to filter out HEAD  requests:</p>
<p><a href="http://blog.bit.ly/post/89178273/talking-heads">http://blog.bit.ly/post/89178273/talking-heads</a></p>
<p>We&#8217;re also working on some predictive modeling techniques to screen out more  bots and scrapers from our click totals.</p>
<p>You might be interested in this <a href="http://www.technologyreview.com/business/22122/">article</a> from the MIT  Technology Review &#8212; http://bit.ly/qrjrf &#8212; which shows that these kinds of  metrics issues are an industry-wide challenge.</p></blockquote>
<p>I plan to look at the issue more in the future, but I wanted to get a  post out there now with my initial findings and see what others have to say  (please comment below!).</p>
<p>Somewhat related, also consider reading this post from Bit.ly: <a href="http://blog.bit.ly/post/94233211/registered-applications-and-better-click-referrer">Registered  Applications And Better Click Referrer Data</a>.</p>
<p>Just as some referrers might  not be seen if JavaScript isn&#8217;t loaded, some of the ways people are following  tweeted links outside of Twitter aren&#8217;t registering, because the applications  don&#8217;t send any referrer information. So, you&#8217;ll see these as &#8220;direct&#8221; visits as if someone typed in a URL, rather than clicking on a link. But by working with vendors, Bit.ly is  trying to improve the picture (other tracking providers may also be doing the  same, too).</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/how-twitter-might-send-far-more-traffic-than-you-think-21482/feed</wfw:commentRss>
		<slash:comments>12</slash:comments>
		</item>
	</channel>
</rss>
