<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>searchengineland.com &#187; Stats: Freshness</title>
	<atom:link href="http://searchengineland.com/library/stats/stats-freshness/feed" rel="self" type="application/rss+xml" />
	<link>http://searchengineland.com</link>
	<description>Search Engine Land: Must Read News About Search Marketing &#38; Search Engines</description>
	<lastBuildDate>Sat, 21 Nov 2009 03:30:01 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Google&#8217;s Last Visited Time Stamp Gets Down To The Minute</title>
		<link>http://searchengineland.com/googles-last-visited-time-stamp-gets-down-to-the-minute-11839</link>
		<comments>http://searchengineland.com/googles-last-visited-time-stamp-gets-down-to-the-minute-11839#comments</comments>
		<pubDate>Wed, 01 Aug 2007 12:41:05 +0000</pubDate>
		<dc:creator>Barry Schwartz</dc:creator>
				<category><![CDATA[Google: Web Search]]></category>
		<category><![CDATA[Search Features: Dates]]></category>
		<category><![CDATA[Stats: Freshness]]></category>

		<guid isPermaLink="false">http://searchengineland.com/beta/googles-last-visited-time-stamp-gets-down-to-the-minute-11839.php</guid>
		<description><![CDATA[
]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fsearchengineland.com%2Fgoogles-last-visited-time-stamp-gets-down-to-the-minute-11839"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fsearchengineland.com%2Fgoogles-last-visited-time-stamp-gets-down-to-the-minute-11839" height="61" width="51" /></a></div><p><a href="http://www.seroundtable.com/archives/014350.html">Google Cache Showing Last Retrieve Dates in Minutes</a> at Search Engine Roundtable shows how Google is now showing when it last visited some pages on a per minute or per hour basis, rather than in the traditional per day style.</p>
<p>For example, this is how Google has normally shown last visit dates for some pages:</p>
<p><a href="http://www.flickr.com/photos/rustybrick/974205556/" title="Photo Sharing"><img src="http://farm2.static.flickr.com/1070/974205556_cc223e8c7f_o.gif" width="319" height="65" alt="Google Cache Date by Minute" /></a></p>
<p>Now some pages are getting a time stamp showing they were last visited within minutes or hours, rather than days:</p>
<p><a href="http://www.flickr.com/photos/rustybrick/974205516/" title="Photo Sharing"><img src="http://farm2.static.flickr.com/1206/974205516_25bd959c68_o.gif" width="352" height="64" alt="Google Cache Date by Minute" /></a></p>
<p><span id="more-11839"></span>
You should be able to reproduce this yourself by going to <a href="http://66.249.89.147/">http://66.249.89.147/</A> and searching for some news sites that Google is known to spider on a frequent basis, such as the BBC, CNN, and so on. Here&#8217;s an example for the BBC: <a href="http://66.249.89.147/search?q=site%3Abbc.co.uk">site:bbc.co.uk</a>.
<strong>
Postscript From Danny:</strong> My <a href="http://searchengineland.com/070227-154718.php">Squeezing The Search Loaf: Finding Search Engine Freshness &#038; Crawl Dates</a> article from last February covers how the various search engines show dates for when they last visited a page, whether it be on the search results page, on cached pages or through webmaster tools. At that time, Google was being inconsistent in how it showed dates for some pages and never provided an official, on-the-record explanation. Now it seems likely they&#8217;re trying to be more consistent and, in fact, may announce how they hit some pages within minutes. As Barry said, we&#8217;re checking on this.</p>
<p><Strong>Postscript From Barry:</strong> Here is a response from Google:</p>
<blockquote><p>We are always working on innovative ways to improve our index and provide users with relevant information. What you are seeing is an experiment that we are currently testing. We run between 50-200 search experiments at any given time and based on user feedback, we may or may not develop new features and products.  For more information on our live experiments, please visit http://googleblog.blogspot.com/2006/04/this-is-test-this-is-only-test.html.</p></blockquote>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/googles-last-visited-time-stamp-gets-down-to-the-minute-11839/feed</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Finding Search Engine Freshness &amp; Crawl Dates</title>
		<link>http://searchengineland.com/squeezing-the-search-loaf-finding-search-engine-freshness-crawl-dates-10619</link>
		<comments>http://searchengineland.com/squeezing-the-search-loaf-finding-search-engine-freshness-crawl-dates-10619#comments</comments>
		<pubDate>Tue, 27 Feb 2007 19:47:18 +0000</pubDate>
		<dc:creator>Danny Sullivan</dc:creator>
				<category><![CDATA[Ask: SEO]]></category>
		<category><![CDATA[Ask: Web Search]]></category>
		<category><![CDATA[Google: SEO]]></category>
		<category><![CDATA[Microsoft: Bing]]></category>
		<category><![CDATA[Microsoft: Bing SEO]]></category>
		<category><![CDATA[SEO: Blocking Spiders]]></category>
		<category><![CDATA[SEO: General]]></category>
		<category><![CDATA[SEO: Titles & Descriptions]]></category>
		<category><![CDATA[Search Features: Dates]]></category>
		<category><![CDATA[Stats: Freshness]]></category>
		<category><![CDATA[Yahoo: SEO]]></category>
		<category><![CDATA[Yahoo: Site Explorer]]></category>

		<guid isPermaLink="false">http://searchengineland.com/beta/squeezing-the-search-loaf-finding-search-engine-freshness-crawl-dates-10619.php</guid>
		<description><![CDATA[A reader emailed me today noticing that Google was showing a date next to his
listing, which made me think this was a good time to revisit how, when
and where search engines show crawl dates for pages. These dates are a useful
way for site owners to understand how often they are being revisited or for
anyone to [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fsearchengineland.com%2Fsqueezing-the-search-loaf-finding-search-engine-freshness-crawl-dates-10619"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fsearchengineland.com%2Fsqueezing-the-search-loaf-finding-search-engine-freshness-crawl-dates-10619" height="61" width="51" /></a></div><p>A reader emailed me today noticing that Google was showing a date next to his<br />
listing, which made me think this was a good time to revisit how, when<br />
and where search engines show crawl dates for pages. These dates are a useful<br />
way for site owners to understand how often they are being revisited or for<br />
anyone to &quot;squeeze the loaf&quot; of a search engine to see how fresh it is. Here&#8217;s a<br />
search engine-by-search engine rundown on date display. I&#8217;ll also cover how<br />
we&#8217;ve sadly lost crawl dates being embedded next to listings, over the years.<br />
But that&#8217;s not all! Read now and you&#8217;ll even get a free at-a-glance table<br />
explaining how dates are displayed. Read now &#8212; web server operators are<br />
standing by!</p>
<p><span id="more-10619"></span></p>
<p><b>Google</b></p>
<p>When you do a search, some pages may show a date below the description of a<br />
listing, as illustrated below:</p>
<p><a href="http://www.flickr.com/photos/dannysullivan/404957359/" title="Photo Sharing"><br />
<img src="http://farm1.static.flickr.com/134/404957359_1aa1d4fb0d_o.jpg" width="464" height="283" alt="Crawl Dates At Google" border="0" /></a></p>
<p>I thought Google had long done this for certain pages that it revisits on a<br />
super-frequent basis. And when I did a search for<br />
<a href="http://www.google.com/search?hl=en&#038;q=cars">cars</a> today, I saw a date<br />
like this coming up for the cars.com listing as shown above. An hour later, the date was<br />
gone. I then tried that search again using a particular Google data center,<br />
rather than whatever data center was assigned to my browser randomly. Doing the<br />
<a href="http://64.233.161.107/search?hl=en&#038;q=cars&#038;btnG=Google+Search">same</a><br />
search at that data center gave me dates again.</p>
<p>I&#8217;m checking with Google on how long dates have been showing and why they may<br />
come and go as I saw today. I&#8217;ll postscript what I&#8217;m told at the end of this<br />
story.  </p>
<p>The example above shows that only some pages have dates. In contrast, the<br />
<a href="http://www.google.com/help/features.html#cached">Google Cache</a> can<br />
give you dates for nearly any web page.</p>
<p>The Google Cache allows you to view a copy of a page that is stored on<br />
Google&#8217;s servers, rather from the website directly. (Don&#8217;t like Google caching this<br />
for your site? Learn how to prevent it<br />
<a href="http://googleblog.blogspot.com/2007/02/robots-exclusion-protocol.html"><br />
here</a> and<br />
<a href="http://www.google.com/support/webmasters/bin/answer.py?answer=35306"><br />
here</a>. Don&#8217;t see a cached link option? Then the site owner is blocking<br />
caching). </p>
<p>Going back to our search for<br />
<a href="http://www.google.com/search?hl=en&#038;q=cars">cars</a> and the screenshot<br />
above, you&#8217;ll see that the disney.go.com listing doesn&#8217;t have a date next to it.<br />
To find the date the page was visited, you have to click on the link that says &quot;Cached&quot; under the description of that<br />
listing. That makes the cached page load like<br />
<a href="http://209.85.135.104/search?q=cache:a9XNRIJY7JsJ:disney.go.com/disneypictures/cars/+cars&#038;hl=en&#038;ct=clnk&#038;cd=2"><br />
this</a>. At the top of that page, you&#8217;ll see this:</p>
<blockquote>
<p>This is Google&#8217;s<br />
<a href="http://www.google.com/intl/en/help/features.html#cached"><br />
cache</a> of<br />
<a href="http://disney.go.com/disneypictures/cars/">http://disney.go.com/disneypictures/cars/</a> as retrieved on<br />
<b>22 Feb<br />
2007 14:34:08 GMT</b>.</p>
</blockquote>
<p>See the date and time, which I&#8217;ve put in bold? That&#8217;s when the page was last visited by Google.</p>
<p>FYI, before<br />
September 2006, that date reflected the last time Google found the page to have<br />
changed, not when it was last visited. In other words, if Google visited the<br />
page in January 2005, then revisited it throughout the year but the page never<br />
changed, the cached date would keep saying January 2005.</p>
<p>Since September 2006,<br />
that&#8217;s been different. The date was altered to reflect the last time Google<br />
visited the page &#8212; a good change to make. Google explains more about this on<br />
the Google Webmaster Central blog<br />
<a href="http://googlewebmastercentral.blogspot.com/2006/09/better-details-about-when-googlebot.html"><br />
here</a>, and Google&#8217;s Matt Cutts also did a video about it<br />
<a href="http://www.mattcutts.com/blog/video-crawl-dates-in-the-google-cache/"><br />
here</a>.</p>
<p>The options above allow anyone to see the freshness of any pages within<br />
Google, one page at a time (as long as they are cached). What if you want to get industrial strength<br />
and view the freshness of all your own pages at once?<br />
Unfortunately, the <a href="http://www.google.com/webmasters/">Google Webmaster<br />
Central</a> tools don&#8217;t let you see the last time all your pages were spidered.<br />
But that&#8217;s something they&#8217;re considering for the future. The tools will,<br />
however, show you any problems Google had in reaching any of your pages and the<br />
last time a crawl error happened for those pages. Using the &quot;Crawl rate&quot; option<br />
found under the Diagnostics tab, you can also see a general graph of crawling<br />
activity to your site.</p>
<p>There is one other type of date that you might see associated with<br />
listings that has nothing to do when the page was crawled. Look here:</p>
<p><a href="http://www.flickr.com/photos/dannysullivan/404957713/" title="Photo Sharing"><br />
<img src="http://farm1.static.flickr.com/183/404957713_436e6bac7c.jpg" width="500" height="74" alt="Google Personalized Search Last Visit Date" border="0" /></a></p>
<p>See the &quot;3 visits &#8211; Feb 14&quot; part? That&#8217;s coming from<br />
<a href="http://www.google.com/support/bin/topic.py?topic=1593">Google<br />
Personalized Search</a> and shows that I&#8217;ve clicked on that listing 3 times,<br />
with the last visit being on Feb. 14. My<br />
<a href="http://searchengineland.com/070202-224617.php">Google Ramps Up<br />
Personalized Search</a> article from earlier this month explains more about how<br />
Google Personalized Search works and can be disabled, if you don&#8217;t like it on,<br />
as now happens much more often.</p>
<p><b>Microsoft Live Search</b></p>
<p><a href="http://www.live.com/">Microsoft Live Search</a> operates like<br />
Google. Some pages show dates next to them, as I&#8217;ve highlighted below:</p>
<p><a href="http://www.flickr.com/photos/dannysullivan/404957514/" title="Photo Sharing"><br />
<img src="http://farm1.static.flickr.com/164/404957514_a66ae9378a.jpg" width="500" height="304" alt="Crawl Date At Microsoft Windows Live" border="0" /></a></p>
<p>As with Google, this seems to happen with pages that are being spidered<br />
frequently, but I&#8217;ll check on this. Does a page lack a date? Then click on the<br />
&quot;cached page&quot; link. When the cached page loads, you&#8217;ll see something like this<br />
at the top of it:</p>
<blockquote>
<p>This is a version of<br />
<a href="http://www.pixar.com/theater/trailers/cars/index.html"><br />
http://www.pixar.com/theater/trailers/cars/index.html</a> as it looked when<br />
our crawler examined the site on <b>2/16/2007</b>. The page you see below is the<br />
version in our index that was used to rank this page in the results to your<br />
recent query.</p>
</blockquote>
<p>The date (which I&#8217;ve but in bold above) tells you when the page was last spidered.</p>
<p>Don&#8217;t see a cached page<br />
option? The site owner is probably blocking caching. Are you a site owner that wants to<br />
block caching? Visit the<br />
<a href="http://help.live.com/help.aspx?project=wl_webmasters">help area</a> at<br />
Live and<br />
search for &quot;cache&quot; to find more info. I&#8217;d point you to the right place, but it<br />
remains impossible to link to particular pages in Microsoft&#8217;s absurd help<br />
system.</p>
<p><em></p>
<p>[<strong>Postscript</strong>: Microsoft sent this information: "We only show the last-crawl date when it is within a few days. This is a decision to draw attention to the freshest content without highlighting older content. Crawl dates for other documents can be found by looking at the cached page."]</p>
<p></em></p>
<p><b>Ask.com</b></p>
<p>At <a href="http://www.ask.com/">Ask.com</a>, you can only get  dates by looking at the cached pages,<br />
similar to how that works at Google and Microsoft. Click on the &quot;Cached&quot; link that you&#8217;ll see<br />
next to the URL of a listing, as highlighted below:</p>
<p><a href="http://www.flickr.com/photos/dannysullivan/404957539/" title="Photo Sharing"><br />
<img src="http://farm1.static.flickr.com/150/404957539_cdad4bf8ff_o.jpg" width="459" height="80" alt="Crawl Date At Ask.com" border="0" /></a></p>
<p>At the top of the page, you&#8217;ll see something like this with the date and time<br />
(shown in bold below) that the page was last visited:</p>
<blockquote>
<p>Below is a cache or saved snapshot of&nbsp;<br />
<a href="http://www.cars.com/"><br />
http://www.cars.com/</a>&nbsp; as we found it on <b>February 19, 2007 1:24:56 AM</b>. </p>
</blockquote>
<p><b>Yahoo</b></p>
<p>At <a href="http://www.yahoo.com/">Yahoo</a>, you can only get  dates one way, through using<br />
<a href="http://siteexplorer.search.yahoo.com/">Yahoo Site Explorer</a>. You&#8217;ll<br />
have to create an account for your web site, then authenticate your account,<br />
then you&#8217;ll be shown last crawl dates as I&#8217;ve highlighted in the first listing<br />
below:</p>
<p><a href="http://www.flickr.com/photos/dannysullivan/404957439/" title="Photo Sharing"><br />
<img src="http://farm1.static.flickr.com/169/404957439_0f90f804d2.jpg" width="500" height="238" alt="Crawl Date At Yahoo Site Explorer" border="0" /></a></p>
<p>More than any other search engine, Yahoo makes it easy for a site owner to<br />
see the freshness of many  pages all at once. However, the huge disadvantage from a<br />
searcher perspective is that you can&#8217;t spot check the freshness of any page you<br />
randomly select. </p>
<p><b>The Date &amp; Freshness Table</b></p>
<p>I love nothing more than doing tables, so let&#8217;s put everything above into a<br />
nice one:</p>
<table border="1" cellpadding="3" cellspacing="0" style="border-collapse: collapse" bordercolor="#111111" width="500" height="99" bordercolorlight="#000000" bordercolordark="#000000" bgcolor="#FFFFFF">
<tr>
<td width="165" align="center" height="24"><b><font size="2">Feature</font></b></td>
<td width="83" align="center" height="24"><b><font size="2">Ask</font></b></td>
<td width="83" align="center" height="24"><b><font size="2">Google</font></b></td>
<td width="83" align="center" height="24"><b><font size="2">Microsoft</font></b></td>
<td width="83" align="center" height="24"><b><font size="2">Yahoo</font></b></td>
</tr>
<tr>
<td width="165" align="center" height="25"><font size="2">Dates Next<br />
To Listings?</font></td>
<td width="83" align="center" height="25"><font size="2">No</font></td>
<td width="83" align="center" height="25"><font size="2">Some</font></td>
<td width="83" align="center" height="25"><font size="2">Some</font></td>
<td width="83" align="center" height="25"><font size="2">No</font></td>
</tr>
<tr>
<td width="165" align="center" height="25"><font size="2">Dates On<br />
Cached Pages?</font></td>
<td width="83" align="center" height="25"><font size="2">Yes</font></td>
<td width="83" align="center" height="25"><font size="2">Yes</font></td>
<td width="83" align="center" height="25"><font size="2">Yes</font></td>
<td width="83" align="center" height="25"><font size="2">No</font></td>
</tr>
<tr>
<td width="165" align="center" height="25"><font size="2">Dates In <br />
Webmaster Tools?</font></td>
<td width="83" align="center" height="25"><font size="2">No <br />
Tools</font></td>
<td width="83" align="center" height="25"><font size="2">For Errors &amp; Home<br />
Page</font></td>
<td width="83" align="center" height="25"><font size="2">No<br />
Tools</font></td>
<td width="83" align="center" height="25"><font size="2">Yes</font></td>
</tr>
</table>
<p>Ideally, I&#8217;d like to see that top row &#8212; &quot;Dates Next To Listings?&quot; &#8212; be<br />
completely &quot;Yes.&quot; Some site owners block caching, which makes it hard to measure<br />
freshness. Putting the dates right next to the listings makes it easy for anyone<br />
who cares to see at a glance if a search engine is stale or fresh.</p>
<p>In fact, I have to laugh. I&#8217;ve been asking for this for years. On the old<br />
features chart I used to maintain about dates, I<br />
<a href="http://searchenginewatch.com/showPage.html?page=2155971#datedisplay"><br />
wrote</a> in 2001:</p>
<blockquote>
<p>Along with the page description, some search engines show the date when a web<br />
page was created or modified. As noted above, these dates may not always be<br />
reliable. However, they do provide a useful clue as to how fresh or stale a<br />
search engine&#8217;s listings are. Thus, search engines that show a date deserve<br />
praise for doing so. </p>
</blockquote>
<p>That was from 2001! Nearly six years later, it&#8217;s still the case that dates<br />
aren&#8217;t being shown. In fact, it&#8217;s a reversal. Back in 2001, the major search<br />
engines of AltaVista, HotBot (Inktomi) and Northern Light all showed dates for<br />
all listings right within search results. Fast forward to today, and none of the<br />
major search engines do.</p>
<p>The reason is simple enough. Over time, the search engines either couldn&#8217;t<br />
maintain freshness or didn&#8217;t want to show they were sometimes stale. So dates<br />
either went away or never got added. C&#8217;mon gang &#8212; time to bring them back right<br />
into the search results. If they aren&#8217;t there by default, make it an option<br />
people can enable.</p>
<p><b>Verifying Freshness</b></p>
<p>In the meantime, there&#8217;s a favorite tactic for those search watchers who want<br />
to track freshness. Google&#8217;s Matt Cutts<br />
<a href="http://www.mattcutts.com/blog/measuring-freshness/">once wrote</a><br />
about this back in 2005, describing exactly a technique I and others have long<br />
used. You simply find a page that you know carries a date that&#8217;s constantly<br />
updated. Look at the cached page and see what the time and date says on it. </p>
<p>But Yahoo doesn&#8217;t show a date on cached pages! No, it doesn&#8217;t, but you&#8217;re not<br />
looking for the date that the search engine inserts. You want the date on the<br />
page itself. For example,<br />
<a href="http://216.109.125.130/search/cache?p=cnn&#038;ei=UTF-8&#038;fr=FP-tab-web-t&#038;x=wrt&#038;subscr=WSJ&#038;u=cnn.com/&#038;w=cnn&#038;d=ACErxhIeOWzO&#038;icp=1&#038;.intl=us"><br />
here&#8217;s</a> the cached page over at Yahoo for CNN:</p>
<p><a href="http://www.flickr.com/photos/dannysullivan/404957666/" title="Photo Sharing"><br />
<img src="http://farm1.static.flickr.com/183/404957666_87d0d81785.jpg" width="500" height="366" alt="Finding Dates On Cached Pages" border="0" /></a></p>
<p>See the part I highlighted in red, that says:</p>
<blockquote>
<p><b>UPDATED:</b> 3:53&nbsp;a.m.&nbsp;EST,&nbsp;February 26, 2007</p>
</blockquote>
<p>That&#8217;s the date that CNN had on its own page when the Yahoo spider last<br />
visited. When I looked, the date and time was 3:10 pm EST on February 27 &#8212; so<br />
the page is only 12 hours old. Not bad in this case, but I wouldn&#8217;t expect a<br />
major news site to be much out of date.</p>
<p><b>Return Of The Freshness Guarantee?</b></p>
<p>Finally, I&#8217;ll leave you with this trip down memory lane. Back in June 1999,<br />
AltaVista once offered a freshness guarantee that was quickly broken. As I<br />
<a href="http://searchenginewatch.com/showPage.html?page=2167591">wrote</a> at<br />
the time:</p>
<blockquote>
<p><i>&quot;AltaVista search is able to<br />
make its Freshness Guarantee: no search site will have fresher results than<br />
AltaVista.&quot;</i></p>
<p>AltaVista unveiled its first<br />
&quot;Freshness Guarantee&quot; back when it relaunched in June, promising that its<br />
entire index would be refreshed at least once per month. That guarantee was<br />
almost immediately broken, as even AltaVista President Rod Schrock admitted<br />
when we talked recently. &quot;We turned our attention to this new system,&quot; Schrock<br />
said.</p>
<p>OK, fair enough &#8212; they wanted<br />
to build something even better. But this new guarantee has already been<br />
broken, as described above. If claims like these are going to be made, then<br />
they should actually be met. And not to meet them in the midst of a huge media<br />
blitz is an incredible blunder.</p>
</blockquote>
<p>Freshness is one important component to what makes a good search engine. It&#8217;s<br />
not the only thing. Having fresh results means nothing if the results aren&#8217;t<br />
relevant. And some pages don&#8217;t need to be spidered that often. But putting dates<br />
next to listings is an easy form of search &quot;food&quot; labeling that can give<br />
reassurance about a major search engines. Surely it&#8217;s time for dates to make a<br />
comeback.</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/squeezing-the-search-loaf-finding-search-engine-freshness-crawl-dates-10619/feed</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
	</channel>
</rss>
