Apr 19, 2007 at 4:34pm ET by Danny Sullivan
In my story about Google’s new content removal tools, I highlighted the fact that anyone can potentially wipe out the cached copy of a page, even if they aren’t the page’s author. To illustrate this, I was going to use as an example how the official George W. Bush page’s cached copy could get nixed. To my amazement, I found the cached copy was gone already. Had the White House decided to wipe out the cache, perhaps as damage control after the page started ranking for "failure" again recently? And why were Yahoo, Live.com and Ask still showing cached versions of that page? Was the White House — gasp! – cloaking only Google with a noarchive command?
Not at all! But it was a real mystery. As it turns out, some pages that Google revisits on a super-fast basis — pages that have dates next to them in the listings — might no longer have cached pages showing even when they should. Google says this is due to a bug they’re now checking on.
To illustrate this, look below:
Notice how any pages from the whitehouse.gov site with dates next to them do NOT have cached copies? And pages with dates DO have cached copies?
I see the same thing with other sites (site:searchengineland.com, for example) but I can also see it NOT happening in some cases (site:cnn.com, for example).
As I said, Google’s now aware of the issue and looking into it. And why do some pages get dates? Squeezing The Search Loaf: Finding Search Engine Freshness & Crawl Dates from me earlier this year explains that in much more depth.
Share, Bookmark & Discuss This Article
More:
Keep Updated: News Via Email | News Via RSS Feed | News Via Twitter
See more stories like this in the Members Library! Check out the Google: SEO, Search Features: General sections of the Members Library where this story is filed. Members also get access to exclusive video content, a members-only weekly & monthly newsletter, plus more. Check out all the benefits!
TOP STORIES
SEARCH NEWS BRIEFS
FEATURES & ANALYSIS
RECENT COMMENTS
Stay on top of all the search news with our daily summary, the SearchCap newsletter. View a sample ›
Search Engine Land produces SMX, the Search Marketing Expo conference series. SMX events deliver the most comprehensive educational and networking experiences - whether you're just starting in search marketing or you're a seasoned expert.
SMX Web Site » | SMX Difference » | SMX News »
Join us at an upcoming SMX event:
Learn more about search marketing with our free online webcasts and webinars from our sister site, Search Marketing Now. Upcoming online events include:
Featured sites from our Blogroll
Become a premium member today and receive:
TeeHee,
You have discovered something very interesting, I have a theory on this and I was wondering if Google has said anything at all else about missing cached pages (other than it is a bug and they are aware of it)
Special thanks to Vanessa Fox and Evan Roseman for digging into this. Danny, you’re correct that the url removal tool has nothing to do with this and you’re correct that it’s occurring for some documents that we are refreshing very quickly. I don’t think it’s that hard to show the cached links, but I’m not sure of the timeframe (e.g. few days vs. a few weeks).