<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Search Engine Land &#187; Legal: Crawling &amp; Indexing</title>
	<atom:link href="http://searchengineland.com/library/legal/legal-crawling-indexing/feed" rel="self" type="application/rss+xml" />
	<link>http://searchengineland.com</link>
	<description>Search Engine Land: News On Search Engines, Search Engine Optimization (SEO) &#38; Search Engine Marketing (SEM)</description>
	<lastBuildDate>Fri, 25 May 2012 23:34:25 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
<xhtml:meta xmlns:xhtml="http://www.w3.org/1999/xhtml" name="robots" content="noindex" />
		<item>
		<title>Banned Holiday Deal Sites Return To Bing</title>
		<link>http://searchengineland.com/banned-holiday-deal-sites-return-to-bing-104479</link>
		<comments>http://searchengineland.com/banned-holiday-deal-sites-return-to-bing-104479#comments</comments>
		<pubDate>Mon, 12 Dec 2011 23:22:11 +0000</pubDate>
		<dc:creator>Danny Sullivan</dc:creator>
				<category><![CDATA[Features: Analysis]]></category>
		<category><![CDATA[Google: Antitrust]]></category>
		<category><![CDATA[Legal: Crawling & Indexing]]></category>
		<category><![CDATA[Microsoft: Bing]]></category>
		<category><![CDATA[Microsoft: Bing SEO]]></category>
		<category><![CDATA[Top News]]></category>

		<guid isPermaLink="false">http://searchengineland.com/?p=104479</guid>
		<description><![CDATA[Holiday deal sites that Bing banned from its search listings just before the busy shopping days of Black Friday and Cyber Monday have now been allowed to return. They include a site run by the group that created the entire Cyber Monday concept. Banned: Not Your Usual Suspects We reported previously how the sites had [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignright size-full wp-image-103297" style="margin-top: 4px; margin-bottom: 4px; margin-left: 14px; margin-right: 14px;" title="bing-deals-ban-featured" src="http://searchengineland.com/figz/wp-content/seloads/2011/11/bing-deals-ban-featured.jpg" alt="" width="247" height="117" />Holiday deal sites that Bing banned from its search listings just before the busy shopping days of Black Friday and Cyber Monday have now been allowed to return. They include a site run by the group that created the entire Cyber Monday concept.</p>
<h2>Banned: Not Your Usual Suspects</h2>
<p>We <a href="http://searchengineland.com/bing-bans-holiday-deals-sites-102856">reported previously</a> how the sites had gone missing, something Bing described as keeping with long-standing policies against &#8220;thin&#8221; content but which came <a href="http://marketingland.com/holiday-deals-sites-confirm-bing-dropped-them-just-before-black-friday-cyber-monday-332">out-of-the-blue</a> to some site owners.</p>
<p>The banned sites included <a href="http://www.cybermonday.com/">CyberMonday.com</a>, which is run by Shop.org, the group that created the entire Cyber Monday concept.</p>
<p>Also banned were <a href="http://www.blackfriday2011.com/">BlackFriday2011.com</a> and <a href="http://www.cybermonday2011.com/">CyberMonday2011.com</a>, which provided content to Yahoo Deals. Yahoo is one of Bing&#8217;s strategic partners. The Yahoo site itself, ironically, wasn&#8217;t banned.</p>
<p>Several of these sites had top listings were not deemed to be spam by Google, which has been on a campaign to remove content not deemed to have enough substance to it. Indeed, some of them had top listings with Google.</p>
<p>Now, despite being deemed to thin, several of these sites have returned to Bing. CyberMonday.com is back, as is BlackFriday2011.com. CyberMonday2011.com remains banned.</p>
<h2>Algorithm Now Deems Them No Longer Questionable</h2>
<p>Why the change? Bing sent this statement:</p>
<blockquote>Black Friday and Cyber Monday are notorious times for spammers, and during this time Bing&#8217;s spam classification algorithm picked up this spam pattern and heightened its criteria.</p>
<p>Bing took proactive action to protect our users by removing questionable domains.</p>
<p>In an effort to protect our users some questionable domains may have been demoted or removed that some may consider legitimate sites.</p>
<p>We have since revised our algorithm which has led to some previously blocked sites returning to the index.</blockquote>
<h2>Really An Algorithm Change?</h2>
<p>A search engine&#8217;s algorithm, an automated computer process, is like a recipe that measures many <a href="http://searchengineland.com/seotable">different factors</a> to determine what content should rank well, as well as what to ban (watch our video <a href="http://searchengineland.com/guide/what-is-seo">here</a> about this).</p>
<p>This algorithm change at Bing didn&#8217;t seem to weigh many factors. Instead, the change simply seemed to be that Bing banned any site that had the words &#8220;black friday&#8221; or &#8220;cyber monday&#8221; in their domain name, rather than an analysis of the content itself, to decide if it was too &#8220;thin&#8221; to retain.</p>
<p>Bing, however, did reconfirm to us that an algorithm was involved.</p>
<h2>Bing Denies Favoring Itself</h2>
<p>Several of those dropped have wondered if it was all part of an attempt by Bing simply to drive more traffic to a special <a href="http://www.bing.com/shopping/black-friday-and-cyber-monday/r/329?crea=blkfricmleft&amp;publ=ia&amp;qpvt=cyber+monday&amp;FORM=HURE">section</a> of Bing Shopping that was ranking (and still does) in the first page of listing for searches at Bing for <a href="http://www.bing.com/search?q=black+friday">black friday</a> and <a href="http://www.bing.com/search?q=cyber+monday">cyber monday</a>, like this:</p>
<p><img class="aligncenter size-full wp-image-104488" title="find bing deals" src="http://searchengineland.com/figz/wp-content/seloads/2011/12/find-bing-deals.jpg" alt="" width="426" height="138" /></p>
<p>Bing denied the move was designed to help itself:</p>
<blockquote>Bing did not replace algorithmic results with its own sites.</p>
<p>Since its inception Bing has offered verticalized content like shopping and travel if it helps address a user&#8217;s intent.</p>
<p>These Instant Answers are available for a variety of topics and scenarios, such as shopping, checking stock prices, or stats for athletes.</blockquote>
<h2>More Information</h2>
<p>For more background on the removals, including how it might help Google with accusations that it has banned shopping sites simply to help itself, see our past articles below:</p>
<ul>
<li><a href="http://searchengineland.com/bing-bans-holiday-deals-sites-102856">Bing Bans Holiday Deals Sites, Including One By Group That Created Cyber Monday</a></li>
<li><a href="http://marketingland.com/holiday-deals-sites-confirm-bing-dropped-them-just-before-black-friday-cyber-monday-332">Holiday Deals Sites Confirm: Bing Dropped Them Just Before Black Friday &amp; Cyber Monday</a></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/banned-holiday-deal-sites-return-to-bing-104479/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Bing Bans Holiday Deals Sites, Including One By Group That Created Cyber Monday</title>
		<link>http://searchengineland.com/bing-bans-holiday-deals-sites-102856</link>
		<comments>http://searchengineland.com/bing-bans-holiday-deals-sites-102856#comments</comments>
		<pubDate>Wed, 30 Nov 2011 14:00:18 +0000</pubDate>
		<dc:creator>Danny Sullivan</dc:creator>
				<category><![CDATA[Features: Analysis]]></category>
		<category><![CDATA[Google: Antitrust]]></category>
		<category><![CDATA[Google: Panda Update]]></category>
		<category><![CDATA[Legal: Crawling & Indexing]]></category>
		<category><![CDATA[Microsoft: Bing]]></category>
		<category><![CDATA[Microsoft: Bing SEO]]></category>
		<category><![CDATA[Top News]]></category>

		<guid isPermaLink="false">http://searchengineland.com/?p=102856</guid>
		<description><![CDATA[Ironically, the trade group that created the Cyber Monday concept had its own CyberMonday.com site lost in cyberspace this week. Lost, that is, if you tried to use Bing to find it. The site, along with some Black Friday deals sites, have been deliberately dropped from Bing as being too &#8220;thin&#8221; in content. Bing&#8217;s Version [...]]]></description>
			<content:encoded><![CDATA[<p>Ironically, the trade group that created the Cyber Monday concept had its own CyberMonday.com site lost in cyberspace this week. Lost, that is, if you tried to use Bing to find it. The site, along with some Black Friday deals sites, have been deliberately dropped from Bing as being too &#8220;thin&#8221; in content.</p>
<h2>Bing&#8217;s Version Of The Google Panda Update</h2>
<p>If &#8220;thin is bad&#8221; sounds familiar, that&#8217;s because Google kicked off this trend earlier this year. The <a href="http://searchengineland.com/google-forecloses-on-content-farms-with-farmer-algorithm-update-66071">Google Panda Update</a> was a change in how Google ranked web pages, designed to penalize pages deemed to be content-light.</p>
<p>Google&#8217;s change didn&#8217;t remove web pages. It just meant that pages deemed thin had less chance of ranking well.</p>
<p>In contrast, Bing&#8217;s change is more dramatic. It has completely removed some sites deemed thin from its search engine.</p>
<h2>CyberMonday.com: Loved By Google, Banned By Bing</h2>
<p>Consider the case of <a href="http://www.cybermonday.com/">CyberMonday.com</a>. The site is run by <a href="http://www.shop.org/">Shop.org</a>, the digital division of the <a href="http://www.nrf.com/">National Retail Federation</a>. Microsoft &#8212; which owns Bing &#8212; is one of the Shop.org <a href="http://www.shop.org/web/guest/about/membercompanies">member companies</a>, along with many other major brands.</p>
<p>Shop.org <a href="http://www.shop.org/cybermonday#cyber_made_up">coined the term</a> Cyber Monday <a href="http://www.shop.org/c/journal_articles/view_article_content?groupId=1&amp;articleId=623&amp;version=1.0">in November 2005</a>, as even covered by <a href="http://www.businessweek.com/bwdaily/dnflash/nov2005/nf20051129_9946_db016.htm">BusinessWeek</a> by then. The following year, Shop.org <a href="http://www.shop.org/c/journal_articles/view_article_content?groupId=1&amp;articleId=605&amp;version=1.0">created</a> the CyberMonday.com web site to go along with the campaign.</p>
<p>On Google, CyberMonday.com currently ranks in the top results for a search on <a href="http://www.google.com/search?q=cyber+monday">cyber monday</a>:</p>
<p><a href="http://searchengineland.com/figz/wp-content/seloads/2011/11/cybermonday-google-rank1.png"><img class="aligncenter size-large wp-image-102862" title="cyber monday search on google" src="http://searchengineland.com/figz/wp-content/seloads/2011/11/cybermonday-google-rank1-600x546.png" alt="" width="540" height="491" /></a></p>
<p>On Bing, it <a href="http://www.bing.com/search?q=cyber+monday">doesn&#8217;t rank</a> in the top results. Noteworthy, Bing&#8217;s own special <a href="http://www.bing.com/shopping/black-friday-and-cyber-monday/r/329">page</a> to promote Black Friday and Cyber Monday deals available through Bing Shopping gets top billing:</p>
<p><a href="http://searchengineland.com/figz/wp-content/seloads/2011/11/cybermonday-bing-rank.png"><img class="aligncenter size-large wp-image-102868" title="cyber monday search on bing" src="http://searchengineland.com/figz/wp-content/seloads/2011/11/cybermonday-bing-rank-600x544.png" alt="" width="540" height="490" /></a></p>
<h2>Bing Kills Thin Sites, Doesn&#8217;t Just Bury Them</h2>
<p>CyberMonday.com doesn&#8217;t rank because it has no pages listed within Bing at all:</p>
<p style="text-align: center;"><a href="http://searchengineland.com/figz/wp-content/seloads/2011/11/bing-ban-cyber.png"><img class="aligncenter size-large wp-image-102870" title="site:cybermonday.com bringing up no results on bing" src="http://searchengineland.com/figz/wp-content/seloads/2011/11/bing-ban-cyber-600x130.png" alt="" width="540" height="117" /></a></p>
<p>CyberMonday.com isn&#8217;t the only site impacted this way. A number of Black Friday-oriented sites have also gone missing, including:</p>
<ul>
<li>blackfriday.info</li>
<li>theblackfriday.com</li>
<li>black-friday.net</li>
<li>2011blackfridayads.com</li>
<li>blackfriday.com</li>
<li>blackfriday2011.com</li>
</ul>
<h2>Bing: Dropping Thin Sites Isn&#8217;t New</h2>
<p>Why were they dropped? Bing told me:</p>
<blockquote>One of our main goals at Bing is to deliver quality results for our users. Consistent with our <a href="http://www.bing.com/community/site_blogs/b/webmaster/default.aspx">guidance to site owners</a>, websites that seem to rely mostly on affiliate content or that offer only thin content often don’t deliver the value searchers are looking for and may be demoted or removed from our index. This is something we continually refine and look at closely throughout the year.</blockquote>
<p>Is this new? Does it impact more than Black Friday and Cyber Monday sites? I was told:</p>
<blockquote>It&#8217;s nothing new, and follows guidance we&#8217;ve given on our webmaster site. We don&#8217;t have any specifics to share.</blockquote>
<h2>But It Sure Feels New</h2>
<p>Despite what Bing says, this sure feels new. Bing&#8217;s statement says this is all consistent with its guidelines, with the link it emailed as part of that statement supposedly pointing to that guidance. But I found nothing there that gave the impression this was a long-standing policy.</p>
<p>The link leads to the Bing Webmaster Center blog, not to its publisher guidelines. Going through the posts myself, looking for any news that thin content gets you banned, the best I could find was <a href="http://www.bing.com/community/site_blogs/b/webmaster/archive/2011/08/19/18-things-you-need-to-know-about-seo.aspx">this post</a> in August that said:</p>
<blockquote>The worst approach leaves you to seem &#8220;thin&#8221; in your approach.  This essentially means you aren&#8217;t really providing content of value.  An example of this would be a website aggregating content from multiple sources on one page.  Such an approach amounts to little more than a links page related to the query entered.</blockquote>
<p>Thin&#8217;s not good, clearly, but that article doesn&#8217;t say that thin pages will be banned.</p>
<p>Heading over to actual <a href="http://onlinehelp.microsoft.com/en-us/bing/gg132923.aspx">publisher guidance area</a>, the <a href="http://onlinehelp.microsoft.com/en-us/bing/hh204434.aspx">Guidelines For Successful Indexing</a> doesn&#8217;t list thin content as a reason a site might be removed from Bing. Neither does the page about <a href="http://onlinehelp.microsoft.com/en-us/bing/hh204494.aspx">requesting reinclusion</a>.</p>
<p>Instead, I had to go over to another <a href="http://onlinehelp.microsoft.com/en-us/bing/ff808535.aspx">section</a> at Bing designed for searchers &#8212; not for publishers &#8212; to find this <a href="http://onlinehelp.microsoft.com/en-us/bing/ff808447.aspx">definition</a> of spam that seems to include thin pages as something that might be removed:</p>
<blockquote>Some pages captured in our index turn out to be pages of little or no value to users and may also have characteristics that artificially manipulate the way search and advertising systems work in order to distort their relevance relative to pages that offer more relevant information. Some of these pages include only advertisements and/or links to other websites that contain mostly ads, and no or only superficial content relevant to the subject of the search. To improve the search experience for consumers and deliver more relevant content, we might remove such pages from the index altogether, or adjust our algorithms to prioritize more useful and relevant pages in result sets.</blockquote>
<p>I&#8217;m not sure how long this has been there; it&#8217;s certainly not in an area publishers would easily spot. It doesn&#8217;t seem to guarantee that sites that are thin will be completely blown out of Bing, in the way that these sites have been.</p>
<h2>And Feels Targeted To Holiday Deal Sites</h2>
<p>Despite what Bing says, this change does seem new &#8212; and specifically aimed at sites featuring holiday shopping deal offers.</p>
<p>The person who tipped us to these missing sites last week certainly thought this was a recent change (with the Thanksgiving break, it took until now to get a statement back from Bing).</p>
<p><a href="http://www.webmasterworld.com/msn_microsoft_search/4389293.htm">Over at WebmasterWorld</a>, <a href="http://www.webmasterworld.com/profilev4.cgi?action=view&amp;member=robdwoods">Rob Woods</a> noted the same in a discussion:</p>
<blockquote>Bing just nuked all but one of the top black friday ad sites from the index. Not just penalized but completely removed all the top ranking domains entirely (except one).</p>
<p>Bing also just nuked any site with &#8220;Cyber Monday&#8221; in the URL including the .com which is the official site of the National Retail Federation which created Cyber Monday in the first place. Having launched their own black friday / cyber monday affiliate site it looks like they are trying to eliminate any competition from the SERPs. Virtually any site with either keyword in the domain has been completely removed from Bing&#8217;s index.</blockquote>
<h2>And Is The Opposite Of What Bing Said About Panda</h2>
<p>Another oddity is that earlier this year, when asked about Google&#8217;s Panda effort, Bing suggested that it wasn&#8217;t relevant for Google to be applying sitewide penalties. <a href="http://www.stonetemple.com/articles/interview-stefan-weitz.shtml">In an interview with Eric Enge</a>, Bing Director Stefan Weitz said:</p>
<blockquote>Google&#8217;s Panda Update was an interesting event. I saw reports recently on DemandMedia showing they were down 40% on their traffic. What this speaks to is the necessity to look at page level quality. I think one of the things that started the work on Panda was the JC Penney paid link issue which called into question the quality of PageRank.</p>
<p>Google initially responded by blocking the entire JC Penney domain for a few days. We thought that hurt the users because we did the same thing in a test. We blocked all JC Penney internally and asked our human ranking systems &#8220;does this result for the search phrase &#8220;comforters&#8221; look better or worse after this change?&#8221; Everyone said it looked worse because they expected to see JC Penney there.</p>
<p>What it told us was there are different ways to classify quality of pages. We have page level classifiers that look at every page we index that attempt to discern a quality score. It looks at things like reading levels, number of ads versus content, length of words, length of page, all those standard things, and some not so standard things as well.</blockquote>
<p>In short, Bing believes it&#8217;s important that each page be individually assessed for its relevancy and further says it has the technology to do this. However, in the case of these sites, none of this is happening. They were just completely wiped out.</p>
<h2>Are These Sites That Bad?</h2>
<p>Of course, I&#8217;m not suggesting that Bing should keep crappy sites in its search engine. It&#8217;s just pretty strange to discover that the site run by the group that started the whole Cyber Monday idea was among those to get the axe, a group supported by Microsoft itself, a site that includes the Microsoft Store among the listings.</p>
<p>Sure, the Blekko search engine that&#8217;s been <a href="http://searchengineland.com/blekko-slashes-more-spam-with-zorro-update-82620">campaigning</a> against thin content doesn&#8217;t list any pages from it, either. But Google &#8211; which also has guidelines against thin content &#8212; deems it worthy of a top ranking.</p>
<p>Similarly, these three sites are all deemed good enough the top results for <a href="http://www.google.com/search?q=black+friday">black friday</a> on Google:</p>
<p style="text-align: center;"><a href="http://searchengineland.com/figz/wp-content/seloads/2011/11/google-ranking-black-friday-sites.png"><img class="aligncenter size-large wp-image-102871" title="google ranking black friday sites" src="http://searchengineland.com/figz/wp-content/seloads/2011/11/google-ranking-black-friday-sites-600x850.png" alt="" width="540" height="765" /></a></p>
<p style="text-align: left;">The same sites are completely banned by Bing. They have no pages at all listed, and so they certainly don&#8217;t rank for <a href="http://www.bing.com/search?q=black+friday">black friday</a>:</p>
<p style="text-align: center;"><a href="http://searchengineland.com/figz/wp-content/seloads/2011/11/bing-black-friday-search.png"><img class="aligncenter size-large wp-image-102872" title="bing black friday search" src="http://searchengineland.com/figz/wp-content/seloads/2011/11/bing-black-friday-search-600x855.png" alt="" width="540" height="770" /></a></p>
<p style="text-align: left;">Those two arrows? One points to the aforementioned page run by Bing itself offering Cyber Monday and Black Friday specials through Bing Shopping.</p>
<p style="text-align: left;">The other one, higher up, highlights how <a href="http://bfads.net/">BFAds.net</a> managed to survive the culling, even though I&#8217;m hard pressed to see the difference between this <a href="http://bfads.net/Cyber-Monday-Sales-Already-Starting">post</a> on that site versus what Cyber Monday&#8217;s home page offered.</p>
<p style="text-align: left;">Can you spot the difference?</p>
<p style="text-align: center;"><a href="http://searchengineland.com/figz/wp-content/seloads/2011/11/comparison-of-two-sites.png"><img class="aligncenter size-large wp-image-102873" title="comparison of two sites" src="http://searchengineland.com/figz/wp-content/seloads/2011/11/comparison-of-two-sites-600x341.png" alt="" width="540" height="307" /></a></p>
<p style="text-align: left;">BFAds.net is on the left; CyberMonday.com is on the right.</p>
<h2 style="text-align: left;">Thin Is Bad; Thinner Than Thin For Yahoo Is OK?</h2>
<p style="text-align: left;">In another inconsistency, consider this search for <a href="http://www.bing.com/search?q=black+friday+ads">black friday ads</a> at Bing:</p>
<p style="text-align: center;"><a href="http://searchengineland.com/figz/wp-content/seloads/2011/11/black-friday-ads-search-at-bing.png"><img class="aligncenter size-large wp-image-102874" title="black friday ads search at bing" src="http://searchengineland.com/figz/wp-content/seloads/2011/11/black-friday-ads-search-at-bing-600x714.png" alt="" width="540" height="643" /></a></p>
<p style="text-align: left;">The arrow points to a <a href="http://deals.yahoo.com/black-friday-cyber-monday">page</a> at Yahoo Deals with Black Friday ads:</p>
<p style="text-align: center;"><a href="http://searchengineland.com/figz/wp-content/seloads/2011/11/yahoo-deals.png"><img class="aligncenter size-large wp-image-102875" title="yahoo deals" src="http://searchengineland.com/figz/wp-content/seloads/2011/11/yahoo-deals-600x984.png" alt="" width="540" height="886" /></a></p>
<p style="text-align: left;">See those ads that the arrow points to? Those come from <a href="http://www.blackfriday2011.com/">BlackFriday2011.com</a>. Click on any of them, and you go to a page at that site &#8212; which itself is banned by Bing.</p>
<p style="text-align: left;">Now that&#8217;s irony. BlackFriday2011.com is deemed to thin for Bing, but a page that points to all that thin content? That&#8217;s apparently thick enough to keep.</p>
<h2 style="text-align: left;">Blowback On The Antitrust Front</h2>
<p style="text-align: left;">Google is one company that will getting a nice holiday bump from the removals. Google has been <a href="http://searchengineland.com/googleopoly-the-definitive-guide-to-antitrust-investigations-against-google-82906">under fire</a> that it manipulates its results to promoting its own content and keep competitors out.</p>
<p style="text-align: left;">Microsoft <a href="http://searchengineland.com/admitting-role-in-google-anti-trust-complaints-microsoft-complains-of-google-lock-in-37009">has been</a> one of the companies funding efforts to fight Google on this front, such as through <a href="http://www.fairsearch.org/">FairSearch</a>. But now, Bing appears to be doing exactly what Google&#8217;s accused of. Sites have been dropped that potentially compete with the special Bing Shopping page that&#8217;s ranking well.</p>
<p style="text-align: left;">Like Google, Bing has a dedicated shopping search engine. <a href="http://searchengineland.com/does-the-fairsearch-white-paper-on-google-being-anticompetitive-hold-up-96567">Like Google, searchers are automatically suggested</a> to use this for relevant queries, such as this unit that&#8217;s inserted into a search for digital cameras, showing shopping results:</p>
<p style="text-align: left;"><a href="http://searchengineland.com/figz/wp-content/seloads/2011/11/shop-unit.png"><img class="aligncenter size-full wp-image-102954" title="shop unit" src="http://searchengineland.com/figz/wp-content/seloads/2011/11/shop-unit.png" alt="" width="563" height="186" /></a></p>
<p style="text-align: left;">In contrast, this unit for the special Bing holiday shopping page isn&#8217;t showing shopping results:</p>
<p style="text-align: left;"><a href="http://searchengineland.com/figz/wp-content/seloads/2011/11/specials.jpg"><img class="aligncenter size-full wp-image-102955" title="specials" src="http://searchengineland.com/figz/wp-content/seloads/2011/11/specials.jpg" alt="" width="434" height="132" /></a></p>
<p style="text-align: left;">Instead, it&#8217;s designed to drive people to a particular page in the shopping area. It also carries a special tracking code. Both things are unusual, red flags that this is something being done to promote Bing above and beyond what its regular relevancy algorithms would do.</p>
<p style="text-align: left;">As for the removals, <a href="http://www.foundem.co.uk/">Foundem</a> &#8212; another member of FairSearch &#8212; has lead the charges that it and others have been banned from Google not for quality issues but instead for anti-competitive reasons.</p>
<p style="text-align: left;">Google&#8217;s Panda Update, Foundem recently argued in a white <a href="http://www.foundem.co.uk/Google_Conflict_of_Interest.pdf">paper</a> (PDF), was just a continuation of this process:</p>
<blockquote>Panda marks a significant escalation in Google’s undeclared war on its vertical-search rivals. So far, few have made a connection between Panda and the various antitrust Investigations into Google. But Panda isn&#8217;t just relevant to these investigations; it is central to them.  Despite being widely touted as an attack on content-farms—which are almost the polar opposite of vertical search services—Panda also marks an aggressive escalation of Google&#8217;s vertical-search targeted, “lack of original content” penalties&#8230;.</p>
<p>With Panda, Google is now targeting many established vertical search brands, as well as emerging ones. Still mindful that it cannot openly penalise well-known competitors, Panda&#8217;s algorithmic demotions are more subtle than their predecessors: although affected sites do not completely disappear from Google&#8217;s search results, they are systematically demoted to a point beyond the reach of most users, and so receive little or no traffic from this vital channel.</blockquote>
<p>If Google&#8217;s Panda Update is bad for merely demoting sites for lacking original content, it&#8217;s hard to see how Bing completely removing sites for exactly the same reason is going to help those arguing this aspect of the antitrust investigations against Google.</p>
<p><strong>Postscript:</strong> I&#8217;m waiting to hear back from Shop.org. They&#8217;re checking on the situation, having heard about it only today, apparently after this article appeared. My guess is that they didn&#8217;t even notice they were dropped from Bing, probably because Google sends so much traffic to them that not being in Bing didn&#8217;t register.</p>
<p><strong>Postscript 2: </strong>See our follow-up post at Marketing Land, <a href="http://marketingland.com/holiday-deals-sites-confirm-bing-dropped-them-just-before-black-friday-cyber-monday-332">Holiday Deals Sites Confirm: Bing Dropped Them Just Before Black Friday &amp; Cyber Monday</a>.</p>
<p><strong>Postscript 3:</strong> See our further follow-up story, <a href="http://searchengineland.com/banned-holiday-deal-sites-return-to-bing-104479">Banned Holiday Deal Sites Return To Bing</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/bing-bans-holiday-deals-sites-102856/feed</wfw:commentRss>
		<slash:comments>11</slash:comments>
		</item>
		<item>
		<title>Proposed UK Law Would Immunize Search Engines Against Copyright Claims</title>
		<link>http://searchengineland.com/proposed-uk-law-would-immunize-search-engines-against-copyright-claims-33336</link>
		<comments>http://searchengineland.com/proposed-uk-law-would-immunize-search-engines-against-copyright-claims-33336#comments</comments>
		<pubDate>Tue, 12 Jan 2010 14:50:04 +0000</pubDate>
		<dc:creator>Greg Sterling</dc:creator>
				<category><![CDATA[Google: News]]></category>
		<category><![CDATA[Google: Outside US]]></category>
		<category><![CDATA[Google: Web Search]]></category>
		<category><![CDATA[Legal: Copyright]]></category>
		<category><![CDATA[Legal: Crawling & Indexing]]></category>

		<guid isPermaLink="false">http://searchengineland.com/?p=33336</guid>
		<description><![CDATA[There&#8217;s been enormous debate in the US over the years about whether Google and other search engines violate copyright laws by indexing content of various sorts. The Google book scanning litigation was a copyright lawsuit. And the newspaper industry has repeatedly accused Google of building its news site on the back of their copyrighted material. [...]]]></description>
			<content:encoded><![CDATA[<p>There&#8217;s been enormous debate in the US over the years about whether Google and other search engines violate copyright laws by indexing content of various sorts. The Google book scanning litigation was a copyright lawsuit. And the newspaper industry has repeatedly accused Google of building its news site on the back of their copyrighted material. News organizations AP and <a href="http://www.reuters.com/article/idUSN0728115420070408">AFP</a> both sued Google several years ago for copyright violations. (The deal that settled the AP case is <a href="http://searchengineland.com/wheres-ap-in-google-news-33164">now up for renewal</a>.)</p>
<p>Now, according to <a href="http://paidcontent.org/article/419-digital-economy-bill-google-could-be-granted-copyright-immunity/">PaidContent</a>, a proposed amendment to a pending UK law (&#8220;<a href="http://services.parliament.uk/bills/2009-10/digitaleconomy.html">Digital Economy Bill</a>&#8220;) would permit search engines to index any or all of the content on a &#8220;publicly accessible website&#8221; through a &#8220;presumed . . . standing and non-exclusive license.&#8221; Accordingly, indexing of third party content, however extensive, would not be liable for copyright infringement. This amendment (whose adoption is uncertain) is from self-described <a href="http://lordlucas.blogspot.com/">Libertarian-Conservative Lord Lucas</a>. But it&#8217;s a fairly radical provision.</p>
<p>It&#8217;s unclear how using robots.txt would play here in terms of burdening the rights conferred by the amendment, which are much broader than &#8220;<a href="http://www.copyright.gov/fls/fl102.html">fair use</a>&#8221; in US copyright law.</p>
<p>If the law and the Lucas amendment were to be adopted, imagine a scenario where a raft of new &#8220;search engines&#8221; appeared in the UK and simply cloned large portions of &#8220;publicly accessible&#8221; content sites or copied articles wholesale. This might create havoc for publishers, legitimate search engines and end users. It&#8217;s unlikely to come to that but such scenarios are clearly implicated by the wide-ranging immunity of the amendment.</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/proposed-uk-law-would-immunize-search-engines-against-copyright-claims-33336/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Head-To-Head: ACAP Versus Robots.txt For Controlling Search Engines</title>
		<link>http://searchengineland.com/head-to-head-acap-versus-robots-txt-for-controlling-search-engines-30816</link>
		<comments>http://searchengineland.com/head-to-head-acap-versus-robots-txt-for-controlling-search-engines-30816#comments</comments>
		<pubDate>Tue, 01 Dec 2009 13:08:20 +0000</pubDate>
		<dc:creator>Danny Sullivan</dc:creator>
				<category><![CDATA[Features: Analysis]]></category>
		<category><![CDATA[Google: News]]></category>
		<category><![CDATA[Legal: Crawling & Indexing]]></category>
		<category><![CDATA[SEO: Blocking Spiders]]></category>
		<category><![CDATA[Top News]]></category>

		<guid isPermaLink="false">http://searchengineland.com/?p=30816</guid>
		<description><![CDATA[In the battle between search engines and some mainstream news publishers, ACAP has been lurking for several years. ACAP &#8212; the Automated Content Access Protocol &#8212; has constantly been positioned by some news executives as a cornerstone to reestablishing the control they feel has been lost over their content. However, the reality is that publishers [...]]]></description>
			<content:encoded><![CDATA[<p>In the battle between search engines and some mainstream news publishers, <a href="http://www.the-acap.org/">ACAP</a> has been lurking for several years.  ACAP &#8212; the Automated Content Access Protocol &#8212; has constantly been positioned  by some news executives as a cornerstone to reestablishing the control they feel has  been lost over their content. However, the reality is that publishers have more control even without ACAP than is commonly believed by some. In addition, ACAP  currently provides no &#8220;DRM&#8221; or licensing mechanisms over news content. But the  system does offer some ideas well worth considering. Below, a look at how it  measures up against the current systems for controlling search engines.</p>
<p>ACAP started development in 2006 and formally launched a year later with  version 1.0 (see <a href="../../acap-launches-robotstxt-20-for-blocking-search-engines-12802"> ACAP Launches, Robots.txt 2.0 For Blocking Search Engines?</a>). This year, in  October, ACAP 1.1 was released and has been installed by over 1,250 publishers  worldwide, says the organization, which is backed by the European Publishers  Council, the World Association of Newspapers and the International Publishers  Association.</p>
<p>If that sounds pretty impressive, hang on. I&#8217;ll provide a reality check in a  moment. But first, let&#8217;s pump ACAP up a bit more. Remember back in July, when  the <a href="http://www.epceurope.org/hamburgdeclaration/">Hamburg Declaration</a> was signed by about 150 European publishers? The short declaration basically said  that intellectual property protection needs to be increased on the internet, in  order to protect high-quality journalism.</p>
<p><strong>ACAP: Save Our Content!</strong></p>
<p>Enter ACAP, as a lynchpin to achieving the Hamburg Declaration&#8217;s dream. From  the official <a href="http://www.epceurope.org/presscentre/archive/International_publishers_demand_new_intellectual_property_rights.shtml"> release</a> put out by the European Publishers Council, which organized the  declaration:</p>
<blockquote>We need search engines to recognize ACAP as a step towards acknowledging that  content providers have the right to decide what happens to their content and on  what terms. The European Commission and other legislators call on our industry  constantly to come up with solutions &#8211; here we have one and we call upon the  regulators to back it up.</blockquote>
<p>That quote is from Gavin O&#8217;Reilly, president of the World Association of  Newspapers and News Publishers, Group CEO of Independent News &amp; Media and  chairman of ACAP.</p>
<p>To me, it reads like it&#8217;s the Wild West out on the internet.  That search engines are doing whatever they want with content, with publishers  having no control over what happens. ACAP would bring rules to the search  engines, and those rules would have the force of law if some governmental bodies  would force them on the search engines.</p>
<p><strong>The Wild West Is Actually Tame</strong></p>
<p>The reality is that the search engines do follow rules, ones they&#8217;ve created  and enhanced over the past 15 years based on feedback from the entire web  community (rather than from a select group of largely disgruntled news  publishers). Moreover, in all this time there have been relatively few lawsuits  over how search engines interact with news content. Only one really stands out  in my mind, the case won by Belgian newspapers over being included in Google News.</p>
<p>It was an unnecessary lawsuit. The papers could have stayed out of Google News using  existing controls. In fact, despite &#8220;winning&#8221; the lawsuit, the papers eventually  sought reinclusion in Google News using existing standards (see <a href="../../belgian-papers-back-in-google-begin-using-standards-for-blocking-11128"> Belgian Papers Back In Google; Begin Using Standards For Blocking</a>).</p>
<p><strong>Meet REP: On The Beat For 15 Years</strong></p>
<p>What are those existing standards? Collectively, they&#8217;re called the &#8220;Robots  Exclusion Protocol&#8221; or REP for short. REP is made up of:</p>
<ul>
<li><strong>Robots.txt:</strong> created in 1994 as a way to block content on a    server-wide basis using a single file (the robots.txt file)</li>
<li><strong>Meta Robots Tag:</strong> created in 1996 as a system to block on a    page-by-page basis (see   <a href="../../070305-204850.php">Meta Robots Tag 101:    Blocking Spiders, Cached Pages &amp; More</a> for more about it)</li>
</ul>
<p>The two standards both live at <a href="http://www.robotstxt.org/"> robotstxt.org</a>, but they&#8217;ve never been updated there, nor is there any type  of official group or organization behind REP. Instead, search engines have either <a href="http://searchengineland.com/yahoo-google-microsoft-clarify-robotstxt-support-14125"> unilaterally or collectively expanded</a> what REP can do over the years. They  serve as the de facto bosses of REP, Google in particular. If Google makes a  change, other search engines often mimic it.</p>
<p>I used &#8220;robots.txt&#8221; in the headline of this article mainly because that&#8217;s often used by those who live and  breathe this stuff as a common name for both parts of REP. But I&#8217;ll be sticking with REP for the rest of this article.</p>
<p><strong>Some ACAP In Action</strong></p>
<p>Enough of the preamble and background. Let&#8217;s roll our sleeves up and see how  the two system compare, starting with something easy. How can you block ALL  your pages from ALL search engines using REP? You&#8217;d make a two line robots.txt  file like this:</p>
<blockquote>
<pre>User-agent: *
Disallow: /</pre>
</blockquote>
<p>How would you do it in ACAP? Again, just two lines:</p>
<blockquote>
<pre>ACAP-crawler: *
ACAP-disallow-crawl: /</pre>
</blockquote>
<p>Sounds easy enough to use ACAP, right? Well, no. ACAP, in its quest to  provide as much granularity to publishers as possible, offers what I found to be  a dizzying array of choices. REP explains its parts on two pages. ACAP&#8217;s  implementation guide alone (I&#8217;ll get to links on this later on) is 37 pages  long.</p>
<p>But all that granularity is what publishers need to reassert control, right?  Time for that reality check. Remember those 1,250 publishers? Google News has  something like over 20,000 news publishers that it lists, so relatively few are  using ACAP. ACAP also positions itself as (I&#8217;ve bolded some key parts):</p>
<blockquote>an <strong>open  industry standard</strong> to enable the <strong>providers of all types of content</strong> (including, but not limited to, publishers) to communicate permissions  information (relating to access to and use of that content) in a form that can  be readily recognized and interpreted by a search engine (or any other  intermediary or aggregation service), so that the operator of the service is  enabled systematically <strong>to comply with the individual publisher’s policies</strong>.</blockquote>
<p>Well, anyone with a web site is a publisher, and there are millions of web  sites out there. Hundreds of millions, probably. Virtually no publishers use  ACAP.</p>
<p><strong>Even ACAP Backers Don&#8217;t Use ACAP Option</strong>s</p>
<p>Of course, there&#8217;s no incentive to use ACAP. After all, none of the major  search engines support it, so why would most of these people do so. OK, then let&#8217;s  look at some people with a real incentive to show the control that ACAP offers.  Even if they don&#8217;t yet have that control, they can still use ACAP now to outline what they want to do.</p>
<p>Let&#8217;s start with the ACAP <a href="http://www.independent.ie/robots.txt">file</a> for the Irish  Independent. Don&#8217;t worry if you don&#8217;t understand it, just skim, and I&#8217;ll  explain:</p>
<blockquote>
<pre>##ACAP version=1.0</pre>
<pre># Allow all
User-agent: *
Disallow: /search/
Disallow: /*.ece$
Disallow: /*startindex=
Disallow: /*from=*
Disallow: /*service=Print
Disallow: /*action=Email
Disallow: /*comment_form
Disallow: /*r=RSS</pre>
<pre>Sitemap: http://www.independent.ie/sitemap.xml.gz</pre>
<pre># Changes in Trunk</pre>
<pre>ACAP-crawler: *
ACAP-disallow-crawl: /search/
ACAP-disallow-crawl: /*.ece$
ACAP-disallow-crawl: /*startindex=
ACAP-disallow-crawl: /*from=*
ACAP-disallow-crawl: /*service=Print
ACAP-disallow-crawl: /*action=Email
ACAP-disallow-crawl: /*comment_form
ACAP-disallow-crawl: /*r=RSS</pre>
</blockquote>
<p>OK, see that top part? Those are actually commands using the robots.txt  syntax. They exist because if a search engine doesn&#8217;t understand ACAP, the  robots.txt commands serve as backup. Basically those lines tell all search  engines not to index various things on the site, such as print-only pages.</p>
<p>Now the second part? This is where ACAP gets to shine. It&#8217;s where the Irish  Independent &#8212; which is part of the media group run by ACAP president Gavin  O&#8217;Reilly &#8212; gets to express what they wish search engines would do, if  they&#8217;d only recognize all the new powers that ACAP provides. And what do they  do? EXACTLY the same blocking that they do using robots.txt.</p>
<p>So much for demonstrating the potential power of ACAP.</p>
<p>Well, how about the Wall Street Journal, backed by Rupert Murdoch, who&#8217;s been  on an anti-Google bend of late. Same situation &#8212; the WSJ&#8217;s ACAP <a href="http://online.wsj.com/robots.txt">file</a> is doing nothing more than  what the robots.txt commands show. Actually, it does less. At least the  robots.txt system allows for discovery of a sitemap file (more on this below).</p>
<p>How about the Denver Post? It doesn&#8217;t have an ACAP file, just plain old  regular robots.txt <a href="http://www.denverpost.com/robots.txt">file</a>.  Why&#8217;s that signficant? The CEO of the media company that owns the Denver Post &#8212;  Dean Singleton &#8212; recently suggested he&#8217;d pull some of his content out of Google  (see <a href="../../more-papers-join-murdochs-google-block-party-30464"> Hold On: Are More Papers Really Joining Murdoch’s Google Block Party?</a>).</p>
<p>Singleton is also chairman of the Associated Press, which has been very  anti-Google of late and which also is a backer of ACAP. So if ACAP allows the  expression of control that publishers somehow don&#8217;t currently have, I&#8217;d expect  the Denver Post to be among the poster children along side the Irish Independent  and the Wall Street Journal.</p>
<p>Well, how about the Troy Daily News, which is one of the organizations that  ACAP proudly lists as using its system. What&#8217;s happening with a rank-and-file  publisher. From its ACAP <a href="http://www.tdn-net.com/robots.txt">file</a>:</p>
<blockquote>
<pre>User-agent: *
Disallow: /private.asp
Disallow: /SiteImages/
Crawl-delay: 10
Request-rate: 1/10 # maximum rate is one page every 10 seconds
Visit-time: 0500-0845 # (GMT) only visit between 1:00 AM and 3:45 AM EST</pre>
<pre>#------------------------------------
##ACAP version=1.0
ACAP-crawler: *
ACAP-disallow-crawl: /private.asp
ACAP-disallow-crawl: /SiteImages/
#-----------------------------------------</pre>
</blockquote>
<p>Again, ACAP isn&#8217;t being used to express anything more than what&#8217;s already  indicated in the robot.txt commands (the first section). Again, robots.txt  actually goes beyond, as there&#8217;s support for a &#8220;crawl-delay&#8221; directive that ACAP  doesn&#8217;t have.</p>
<p>That &#8220;request-rate&#8221; and &#8220;visit-time&#8221; telling search engines only  to come by in the early morning hours? Have a chuckle at that. None of the major  search engines recognize those commands. Similarly, visit the Hilton.com  robots.txt <a href="http://hilton.com/robots.txt">file</a> where you&#8217;ll see a  similar but totally unrecognized command: &#8220;Do not visit Hilton.com during the  day!&#8221;</p>
<p><strong>Side-By-Side, REP &amp; ACAP</strong></p>
<p>OK, so even though no one&#8217;s using the special ACAP controls, let&#8217;s at least look at  some of the key features and see how special they supposedly are. The table below lays out  what REP offers against with ACAP.</p>
<p>In parentheses, I&#8217;ve noted the key  commands used in both systems, for the technically inclined. Links lead to more  information, as appropriate. Further below the chart, I&#8217;ve added more  explanations as necessary.</p>
<p>Because REP has been extended by the major search  engines, I&#8217;ve counted some areas as &#8220;Yes&#8221; for support if at least Google  provides an option (given it has the largest marketshare of all). I&#8217;ve also  noted the situation with Bing. As Yahoo search technology is slated to be  acquired by Bing, I didn&#8217;t itemize its control offerings, since these will</p>
<p>For specific technical details on ACAP, see technical documents <a href="http://www.the-acap.org/Documents.aspx">here</a>. The easiest to comprehend is the implementation guide of Oct. 13, 2009. Also see the two crawler communication parts, if you want to dive in further.</p>
<table style="border-collapse: collapse; height: 1358px;" border="1" cellspacing="0" cellpadding="0" width="515" bordercolor="#111111">
<tbody>
<tr>
<td width="33%" height="18" align="center"><strong><span style="font-size: x-small;">Feature</span></strong></td>
<td width="33%" height="18" align="center"><strong><span style="font-size: x-small;">REP</span></strong></td>
<td width="34%" height="18" align="center"><strong><span style="font-size: x-small;">ACAP</span></strong></td>
</tr>
<tr>
<td width="33%" height="51" align="center"><span style="font-size: x-small;">Block all search      engines</span></td>
<td width="33%" height="51" align="center"><span style="font-size: x-small;">Yes
(User-agent)</span></td>
<td width="34%" height="51" align="center"><span style="font-size: x-small;">Yes
(ACAP-crawler)</span></td>
</tr>
<tr>
<td width="33%" height="51" align="center"><span style="font-size: x-small;">Block specific search engines
(for example, block Google but not Bing)</span></td>
<td width="33%" height="51" align="center"><span style="font-size: x-small;">Yes
(User-agent)</span></td>
<td width="34%" height="51" align="center"><span style="font-size: x-small;">Yes
(ACAP-crawler)</span></td>
</tr>
<tr>
<td width="33%" height="51" align="center"><span style="font-size: x-small;">Block crawling of      all pages</span></td>
<td width="33%" height="51" align="center"><span style="font-size: x-small;">Yes
(Disallow)</span></td>
<td width="34%" height="51" align="center"><span style="font-size: x-small;">Yes
(ACAP-disallow-crawl)</span></td>
</tr>
<tr>
<td width="33%" height="51" align="center"><span style="font-size: x-small;">Block crawling of      specific pages</span></td>
<td width="33%" height="51" align="center"><span style="font-size: x-small;">Yes
(Disallow)</span></td>
<td width="34%" height="51" align="center"><span style="font-size: x-small;">Yes
(ACAP-disallow-crawl)</span></td>
</tr>
<tr>
<td width="33%" height="51" align="center"><span style="font-size: x-small;">Block crawling of      specific sections of web site</span></td>
<td width="33%" height="51" align="center"><span style="font-size: x-small;">Yes
(directory matching)</span></td>
<td width="34%" height="51" align="center"><span style="font-size: x-small;">Yes
(directory matching &amp; named &#8220;resource sets&#8221;)</span></td>
</tr>
<tr>
<td width="33%" height="51" align="center"><span style="font-size: x-small;">Block crawling via      pattern matching or &#8220;wildcards&#8221;</span></td>
<td width="33%" height="51" align="center"><span style="font-size: x-small;">Yes:     <a href="http://www.google.com/support/webmasters/bin/answer.py?hl=en&amp;answer=156412"> Google</a> &amp;     <a href="http://help.live.com/Help.aspx?market=en-US&amp;project=WL_Webmasters&amp;querytype=topic&amp;query=WL_WEBMASTERS_CONC_RestrictAccessToSite.htm"> Bing</a>
(*/?)
<strong>No: REP</strong></span></td>
<td width="34%" height="51" align="center"><span style="font-size: x-small;">Yes
(*/?)
</span></td>
</tr>
<tr>
<td width="33%" height="51" align="center"><span style="font-size: x-small;">Block indexing
(see note, below)</span></td>
<td width="33%" height="51" align="center"><span style="font-size: x-small;">Yes
(use block crawling commands)</span></td>
<td width="34%" height="51" align="center"><span style="font-size: x-small;">Yes
(ACAP-disallow-index)</span></td>
</tr>
<tr>
<td width="33%" height="18" align="center"><span style="font-size: x-small;">Cloaking
(Indexing content different than what human      visitors see) </span></td>
<td width="33%" height="18" align="center"><span style="font-size: x-small;"><strong>No
</strong>(Google views cloaking as spam;
Bing     <a href="http://www.bing.com/community/blogs/webmaster/archive/2007/12/04/live-search-and-cloaking-detection.aspx"> frowns upon</a> it but     <a href="http://help.live.com/Help.aspx?market=en-US&amp;project=WL_Webmasters&amp;querytype=topic&amp;query=WL_WEBMASTERS_REF_GuidelinesforSuccessfulIndexing.htm"> doesn&#8217;t ban</a> just for     <a href="http://help.live.com/Help.aspx?market=en-US&amp;project=WL_Webmasters&amp;querytype=topic&amp;query=WL_WEBMASTERS_CONC_VerifyYourSite.htm"> cloaking</a>)</span></td>
<td width="34%" height="18" align="center"><span style="font-size: x-small;">Yes
(must-use-resource)</span></td>
</tr>
<tr>
<td width="33%" height="18" align="center"><span style="font-size: x-small;">Block following links
(doesn&#8217;t prevent finding links in other ways)</span></td>
<td width="33%" height="18" align="center"><span style="font-size: x-small;">Yes
(nofollow, <strong>meta tag only</strong>)</span></td>
<td width="34%" height="18" align="center"><span style="font-size: x-small;">Yes
(ACAP-disallow-follow)</span></td>
</tr>
<tr>
<td width="33%" height="18" align="center"><span style="font-size: x-small;">Block making      cached pages</span></td>
<td width="33%" height="18" align="center"><span style="font-size: x-small;">Yes
(noarchive, <strong>meta tag only</strong>)</span></td>
<td width="34%" height="18" align="center"><span style="font-size: x-small;">Yes
(ACAP-disallow-preserve)</span></td>
</tr>
<tr>
<td width="33%" height="18" align="center"><span style="font-size: x-small;">Block showing      cached pages</span></td>
<td width="33%" height="18" align="center"><span style="font-size: x-small;">Yes
(use block making cached pages command)</span></td>
<td width="34%" height="18" align="center"><span style="font-size: x-small;">Yes
(ACAP-disallow-present)</span></td>
</tr>
<tr>
<td width="33%" height="18" align="center"><span style="font-size: x-small;">Block snippets /      descriptions / quotes</span></td>
<td width="33%" height="18" align="center"><span style="font-size: x-small;">Yes: Google
(nosnippet, <strong>meta tag only</strong>)
<strong>No: REP &amp; Bing</strong></span></td>
<td width="34%" height="18" align="center"><span style="font-size: x-small;">Yes
(ACAP-disallow-present-snippet)</span></td>
</tr>
<tr>
<td width="33%" height="18" align="center"><span style="font-size: x-small;">Set maximum length      for snippets</span></td>
<td width="33%" height="18" align="center"><strong><span style="font-size: x-small;">No
</span></strong><span style="font-size: x-small;">(At Bing,     <a href="../../microsofts-search-engine-optimization-advice-for-bing-21152"> nopreview meta tag</a> blocks hover preview)</span></td>
<td width="34%" height="18" align="center"><span style="font-size: x-small;">Yes
(max-length)</span></td>
</tr>
<tr>
<td width="33%" height="18" align="center"><span style="font-size: x-small;">Set exact snippet to be used</span></td>
<td width="33%" height="18" align="center"><span style="font-size: x-small;"><strong>Partial
</strong>(meta description tag)</span></td>
<td width="34%" height="18" align="center"><span style="font-size: x-small;">Yes
(must-use-resource)</span></td>
</tr>
<tr>
<td width="33%" height="18" align="center"><span style="font-size: x-small;">Block thumbnail images</span></td>
<td width="33%" height="18" align="center"><span style="font-size: x-small;"><strong>Partial
</strong>(just block images)</span></td>
<td width="34%" height="18" align="center"><span style="font-size: x-small;">Yes
(ACAP-disallow-present-thumbnail)</span></td>
</tr>
<tr>
<td width="33%" height="18" align="center"><span style="font-size: x-small;">Block link to site</span></td>
<td width="33%" height="18" align="center"><span style="font-size: x-small;">Yes: Google
(noindex, <strong>meta tag only</strong>)
<strong>No: REP &amp; maybe Bing</strong></span></td>
<td width="34%" height="18" align="center"><span style="font-size: x-small;">Yes
(ACAP-disallow-present-link)</span></td>
</tr>
<tr>
<td width="33%" height="18" align="center"><span style="font-size: x-small;">Prevent format conversion
(say HTML to PDF)</span></td>
<td width="33%" height="18" align="center"><strong><span style="font-size: x-small;">No</span></strong></td>
<td width="34%" height="18" align="center"><span style="font-size: x-small;">Yes
(prohibited-modification=format)</span></td>
</tr>
<tr>
<td width="33%" height="18" align="center"><span style="font-size: x-small;">Prevent translation</span></td>
<td width="33%" height="18" align="center"><span style="font-size: x-small;">Yes: Google
(<a href="http://translate.google.com/support/">notranslate</a>, <strong>meta tag      only</strong>)
<strong>No: REP &amp; probably Bing</strong></span></td>
<td width="34%" height="18" align="center"><span style="font-size: x-small;">Yes
(prohibited-modification=translate)</span></td>
</tr>
<tr>
<td width="33%" height="18" align="center"><span style="font-size: x-small;">Prevent annotations such as ratings</span></td>
<td width="33%" height="18" align="center"><span style="font-size: x-small;"><strong>No</strong></span></td>
<td width="34%" height="18" align="center"><span style="font-size: x-small;">Yes
(prohibited-modification=annotation)</span></td>
</tr>
<tr>
<td width="33%" height="18" align="center"><span style="font-size: x-small;">Prevent framing</span></td>
<td width="33%" height="18" align="center"><strong><span style="font-size: x-small;">No
</span></strong></td>
<td width="34%" height="18" align="center"><span style="font-size: x-small;">Yes
(prohibited-context=within-user-frame)</span></td>
</tr>
<tr>
<td width="33%" height="18" align="center"><span style="font-size: x-small;">Urgent page      removal</span></td>
<td width="33%" height="18" align="center"><strong><span style="font-size: x-small;">No</span></strong></td>
<td width="34%" height="18" align="center"><span style="font-size: x-small;">Yes
(ACAP-request-take-down)</span></td>
</tr>
<tr>
<td width="33%" height="18" align="center"><span style="font-size: x-small;">Block specific parts of a page</span></td>
<td width="33%" height="18" align="center"><span style="font-size: x-small;"><strong>No
</strong>(though Yahoo has     <a href="../../yahoo-supports-new-robots-nocontent-tag-to-block-indexing-within-a-page-11120"> robots-nocontent attribute</a>)</span></td>
<td width="34%" height="18" align="center"><span style="font-size: x-small;">Yes
(must-use-resource)</span></td>
</tr>
<tr>
<td width="33%" height="18" align="center"><span style="font-size: x-small;">Permitted Places
(indicate places like specific IP addresses or countries where content can      be listed) </span></td>
<td width="33%" height="18" align="center"><strong><span style="font-size: x-small;">No</span></strong></td>
<td width="34%" height="18" align="center"><span style="font-size: x-small;">Yes
(permittedcountrylist &amp; others)</span></td>
</tr>
<tr>
<td width="33%" height="18" align="center"><span style="font-size: x-small;">Time limits
(such as remove after set number of days)</span></td>
<td width="33%" height="18" align="center"><span style="font-size: x-small;">Yes: Google
(<a href="http://googlewebmastercentral.blogspot.com/2008/06/improving-on-robots-exclusion-protocol.html">unavailable_after</a>,      meta tag only)
No: REP &amp; Bing</span></td>
<td width="34%" height="18" align="center"><span style="font-size: x-small;">Yes
(time-limit)</span></td>
</tr>
<tr>
<td width="33%" height="18" align="center"><span style="font-size: x-small;"> <a href="../../canonical-tag-16537">Canonical Tag</a>
(Indicate &#8220;main URL&#8221; to be used in cases of same      content on multiple URLs)</span></td>
<td width="33%" height="18" align="center"><span style="font-size: x-small;">Yes</span></td>
<td width="34%" height="18" align="center"><span style="font-size: x-small;">Yes
(must-use-resource)</span></td>
</tr>
<tr>
<td width="33%" height="18" align="center"><span style="font-size: x-small;"> <a href="http://www.sitemaps.org/">Sitemaps</a>
(Provide list of all URLs to be crawled)</span></td>
<td width="33%" height="18" align="center"><span style="font-size: x-small;">Yes</span></td>
<td width="34%" height="18" align="center"><strong><span style="font-size: x-small;">No</span></strong></td>
</tr>
<tr>
<td width="33%" height="18" align="center"><span style="font-size: x-small;">http x-robots tags
(attach blocking to file headers, not within files)</span></td>
<td width="33%" height="18" align="center"><span style="font-size: x-small;"> <a href="http://googleblog.blogspot.com/2007/07/robots-exclusion-protocol-now-with-even.html"> Yes</a></span></td>
<td width="34%" height="18" align="center"><strong><span style="font-size: x-small;">No</span></strong></td>
</tr>
<tr>
<td width="33%" height="18" align="center"><span style="font-size: x-small;">Rich Snippets
(custom descriptions)</span></td>
<td width="33%" height="18" align="center"><span style="font-size: x-small;">Yes:     <a href="../../google-search-now-supports-microformats-and-adds-rich-snippets-to-search-results-19055"> Google</a>
(Bing might inherit <a href="http://developer.yahoo.com/searchmonkey/">Yahoo      Search Monkey</a>)</span></td>
<td width="34%" height="18" align="center"><strong><span style="font-size: x-small;">No</span></strong></td>
</tr>
<tr>
<td width="33%" height="18" align="center"><span style="font-size: x-small;">Crawl Delay
(slows crawling speed for slow servers)</span></td>
<td width="33%" height="18" align="center"><span style="font-size: x-small;">Yes
(Google:     <a href="http://www.google.com/support/webmasters/bin/answer.py?hl=en&amp;answer=48620"> crawl-rate option</a> in Webmaster Central;
Bing: </span> <a href="http://www.bing.com/community/blogs/webmaster/archive/2008/06/03/robots-exclusion-protocol-joining-together-to-provide-better-documentation.aspx"> <span style="font-size: x-small;">crawl-delay meta tag</span></a></td>
<td width="34%" height="18" align="center"><strong><span style="font-size: x-small;">No</span></strong></td>
</tr>
<tr>
<td width="33%" height="18" align="center"><span style="font-size: x-small;">Parameter      Consolidation
(remove tracking that can cause duplicate content issues)</span></td>
<td width="33%" height="18" align="center"><span style="font-size: x-small;">Yes:     <a href="http://www.google.com/support/webmasters/bin/answer.py?hl=en&amp;answer=147959"> Google</a>
(Yahoo also offers)</span></td>
<td width="34%" height="18" align="center"><strong><span style="font-size: x-small;">No</span></strong></td>
</tr>
</tbody>
</table>
<p>OK, that&#8217;s the big chart. As you can see, there are some things that both  systems provide and some things that are unique to each one. Here&#8217;s my personal  take on the differences:</p>
<p align="center"><strong>Jeers To ACAP!</strong></p>
<p><strong>Block Indexing: </strong>ACAP makes a weird distinction between blocking  crawling (a search engine literally going from page to page automatically) and  indexing (a search engine making a copy of the page, so that it can be added to  a searchable index). For the major search engines, crawling and indexing are one  and the same. I struggle to see an advantage to separating these out.</p>
<p><strong>Cloaking:</strong> Those savvy to search engines know that Google hates  cloaking, which is the act of showing a search engine something different than a  human being would see. It&#8217;s often associated with spam. There are plenty of  cases where people have shown misleading content to a search engine, in hopes of  getting a good ranking. One example is from 1999, when the FTC <a href="http://www.ftc.gov/opa/1999/09/atariz.shtm">took action</a> against a  site that was cloaking content that ranked for &#8220;innocent&#8221; searches like Oklahoma  tornadoes and instead directed them to porn sites. The idea of a publisher  forcing a search engine to allow cloaking would be somewhat similar to a  newspaper being forced to write whatever a subject demanded be written about  them.</p>
<p><strong>Exact Snippet To Be Used: </strong>Similar to cloaking, allowing site owners to  say whatever they want about a page sounds great if you&#8217;re an honest site owner.  When you&#8217;re a search engine that knows how people will mislead, it&#8217;s not so  appealing. In addition, sometimes it&#8217;s helpful to create a description that  shows what someone searches for in context &#8212; and that doesn&#8217;t always happen  with a publisher-defined description.</p>
<p><strong>Annotation Blocking:</strong> It&#8217;s hard to interpret how this would work. Is  Google&#8217;s <a href="../../google-sidewiki-allows-anyone-to-comment-about-any-site-26420"> SideWiki</a> an annotation system, where comments are left alongside a  publisher&#8217;s content but in a separate window? Or does this mean annotations on  Google itself, such as <a href="../../google-searchwiki-launches-15561"> SearchWiki</a> allows? And should publishers be allowed to block people  commenting about their pages on other sites? Does that block places like Yelp  from reviewing businesses, if they link to them? Is Digg a ratings service? This  option is a minefield.</p>
<p align="center"><strong>Kudos To ACAP!</strong></p>
<p><strong>Maximum Snippet Length:</strong> Search engines are quoting more and more  material from pages these days, it seems. The ability to limit how much they can  use seems like a good idea that should be considered.</p>
<p><strong>Meta Tag Only Commands:</strong> A number of controls such as blocking caching  or snippets can&#8217;t be done in a single file. Now, for those using CMS systems,  including free ones like WordPress, it&#8217;s relatively easy to add these codes to  each and every page. But it would be nice to see the search engines add  file-wide support for some of these options in the way that ACAP does.</p>
<p><strong>Prevent Framing:</strong> I hate framing. I&#8217;d love to see a way to tell  automated tools like <a href="../../analysis-which-url-shortening-service-should-you-use-17204"> URL shorteners</a> that they can&#8217;t frame. But with the search engines, framing  is pretty limited. Google does it with images, and you can block images from  being indexed period, which eliminates framing. It does the same with cached  pages, and you can block caching. Plus, it&#8217;s fairly easy for a site owner to  block any type of framing.</p>
<p><strong>Urgent Removal: </strong>If you&#8217;re a site owner, a system to get pages out of  an index in a guaranteed period of time would be very convenient. However, this  is probably better handled through webmaster tools that the search engines  offer, as they allow a site owner to proactively trigger a removal, rather than  waiting for visit from a crawler, which could take days. Ironically, at Google,  they had a system to remove pages quickly. I wrote about it two years ago (see <a href="../../google-releases-improved-content-removal-tools-10989"> Google Releases Improved Content Removal Tools</a>). But the documentation today  is terrible. Little is explained if you&#8217;re not logged in. If you are logged in,  the link for the webmaster version doesn&#8217;t work. The entire feature Google <a href="http://googlewebmastercentral.blogspot.com/2007/04/requesting-removal-of-content-from-our.html"> described</a> in 2007 is gone.</p>
<p><strong>Block Specific Parts Of Page:</strong> Who wants all their navigation being  indexed, along with all the other crud pages often have on them? ACAP allows for  only parts of a page to be indexed. Yahoo <a href="../../yahoo-supports-new-robots-nocontent-tag-to-block-indexing-within-a-page-11120"> already offers this</a>. Why not the others?</p>
<p align="center"><strong>Hmm&#8230;</strong></p>
<p><strong>Permitted Places: </strong>The idea here is that you could allow your story to  be listed in Google UK but not Google France, if you wanted. It might not be a  bad idea, though that&#8217;s not usually the demand I tend to hear. Instead, site  owners often are trying to figure out how to associate their sites with a  particular country (Google <a href="http://googlewebmastercentral.blogspot.com/2007/10/better-geographic-choices-for.html"> has a tool</a> for this).</p>
<p><strong>Time Limit:</strong> You can time restrict when a page should be removed, a  cached copy should be removed and more with ACAP. Google has some support here,  though few use it, the search engine tells me. It also seems unnecessary. It  seems far more efficient for a site owner to simply remove their own content  from the web or block spidering, when ready. In either case, that causes it to  drop from a search engine.</p>
<p align="center"><strong>Jeers To Search Engines!</strong></p>
<p>I think the biggest frustration in compiling this article was knowing that  search engines do offer much control to publishers but finding the right  documentation is hard. At Google, you can block translation, but it was  difficult to find this page in the help pages offered to site owners. Bing <a href="../../microsofts-search-engine-optimization-advice-for-bing-21152"> has a wa</a>y to block previews, but I couldn&#8217;t locate this within its help  center. Google has a blog <a href="http://googlewebmastercentral.blogspot.com/2008/06/improving-on-robots-exclusion-protocol.html"> post</a> saying that bing supports the nosnippets tag. Over at Bing, I couldn&#8217;t  find this documented. FYI, Jane &amp; Robot has a <a href="http://janeandrobot.com/library/managing-robots-access-to-your-website">good guide</a> that can help with those trying to understand all that&#8217;s allowed.</p>
<p align="center"><strong>Cheers To Search Engines!</strong></p>
<p>ACAP has focused on a publisher wishlist of options that often can be done in  other ways. Don&#8217;t want thumbnail images in a search engine? OK, we&#8217;ll make a  command, even though just blocking images would solve that problem.</p>
<p>In contrast, the search engine have added feature that have come from the  outcries of many diverse site owners. Sitemaps, to provide a list of URLs for  indexing. Crawl delay support. Richer snippets. Duplicate content tools, such as  the canonical tag or parameter consolidation. They deserve far more credit than  some news publishers give to them.</p>
<p style="text-align: left;"><strong>The Missing Part: Licensing</strong></p>
<p>Did you catch the biggest option that ACAP does NOT provide? There&#8217;s no  licensing support.</p>
<p>Remember how ACAP&#8217;s O&#8217;Reilly talked about how ACAP was needed to ensure &#8220;that  content providers have the right to decide what happens to their content and on <strong>what terms</strong>.&#8221; ACAP really doesn&#8217;t provide that much more control than  what&#8217;s out there now. It doesn&#8217;t give publishers significantly more &#8220;rights.&#8221; I  mean, how many more &#8220;rights&#8221; can you have when you&#8217;ve got the nuclear option of  fully withdrawing from a search engine at any time?</p>
<p>It&#8217;s the part I bolded that&#8217;s key, the &#8220;what terms&#8221; portion. ACAP is  supposed to somehow support new business models. Part of the idea is that you  might want to license your headlines to one search engine, your thumbnails to  another, and this would all be bundled up in some partnership deal. To quote  from the <a href="http://www.the-acap.org/FAQs.aspx#FAQ16">ACAP FAQ</a>:</p>
<blockquote>Business models are changing, and publishers need a protocol to express  permissions of access and use that is flexible and extensible as new business  models arise. ACAP will be entirely agnostic with respect to business models,  but will ensure that revenues can be distributed appropriately. ACAP presents a  win win for the whole online publishing community with the promise of more high  quality content and more innovation and investment in the online publishing  sector. ACAP is for the large as well as the small and even the individuals. It  will benefit all content providers whether they are working alone or through  publishers. A future without publishers willing and able to invest in high  quality content and get a return on that investment is a future without  high-quality content on the net.</blockquote>
<p>Nothing in the ACAP specs I&#8217;ve gone through provide any type of revenue  distribution mechanism, much less some type of automated handshake between a  publisher and a search engine to verify permissions.</p>
<p><strong>If REP &amp; ACAP Files Could Talk</strong></p>
<p>To illustrate this better, here&#8217;s a &#8220;real world&#8221; conversation of how ACAP  supposedly works. I shared this recently with others on the Read 2.0 mailing  list that I&#8217;m part of during a discussion, playing off some other conversation scenes that <a href="http://everybodyslibraries.com/">John Mark Ockerbloom</a> had started. Several people said they found it helpful. Perhaps you will, too.</p>
<blockquote>GOOGLE:
Hi! I&#8217;m Google. Can you tell me if I can crawl your site?</p>
<p>PUBLISHER:
Sure, but I might have some restrictions over what you can do.</p>
<p>GOOGLE:
That&#8217;s cool. Just use a meta robots tag on particular pages to give me specific commands.</p>
<p>PUBLISHER:
Well, on this page, I don&#8217;t want you to show a cached copy.</p>
<p>GOOGLE:
Awesome, use the noarchive command. Done. What&#8217;s next?</p>
<p>PUBLISHER:
On this page, you must always show the description I want shown for it.</p>
<p>GOOGLE:
Use the meta description tag. We&#8217;ll consider that, but we can&#8217;t promise.</p>
<p>PUBLISHER:
Dammit. You just want to rule the world.</p>
<p>GOOGLE:
Look, we build description that are related to what someone searched for, dynamically. So if we find a page on your site, in response to a particular keyword, sometimes it makes sense to &#8220;snip&#8221; a description that contains that term from your page, so they immediately understand why your page is relevant to your search. And click to view it. That&#8217;s why we call them snippets.</p>
<p>PUBLISHER:
Dammit. Do what I want. You&#8217;re not the boss of me.</p>
<p>GOOGLE:
Well, we also get people who would say they have children&#8217;s games when instead, they have adult games &#8212; like porn. Seriously, true story. Plus, we&#8217;re the boss of us. I mean, is it OK if we declare that you must review us in the way we want in your publication.</p>
<p>PUBLISHER:
Let&#8217;s move on. On this page, I don&#8217;t want any images to be used.</p>
<p>GOOGLE:
Block them with robots.txt. Done.</p>
<p>PUBLISHER:
This article, I only want you to list it for 30 days.</p>
<p>GOOGLE:
Pull it down after 30 days. Or move the full article to a new location, and leave a summary page up, if you want remnant traffic. Or block it. Or use the unavailable-after meta tag.</p>
<p>PUBLISHER:
I only want you to list this content if I have a paid partnership with you. My ACAP file will declare that to you.</p>
<p>GOOGLE:
You have a paid partnership with us?</p>
<p>PUBLISHER:
Well, not yet. But Murdoch&#8217;s promising us that will come.</p>
<p>GOOGLE:
If you have a paid partnership with us, to give us permission to index your content, we know that internally. I mean, we don&#8217;t have many of those, and we&#8217;re not scanning the web and ACAP files to keep track of them. ACAP doesn&#8217;t even have a place for you to tell us this, anyway.</p>
<p>PUBLISHER:
I don&#8217;t have a partnership. But I&#8217;m saying you should only index my content if you DO have a partnership. But you keep indexing it.</p>
<p>GOOGLE:
Well, then block us. Surely you know if we don&#8217;t have a partnership or not. And you can use robots.txt to authorize indexing all you want.</p>
<p>PUBLISHER:
But I want you to license our content!</p>
<p>GOOGLE:
Yeah, we get that. Hey, check it out, have you see our free wifi at airports?</blockquote>
<p><strong>ACAP Not A Business Solution; Search Engines, Get Organized!</strong></p>
<p>Overall, there are some ideas in ACAP that would be useful for the search  engines to consider. However, there are many ideas outside of ACAP that would  also be useful for them to consider. There&#8217;s nothing I see within ACAP that  provides some type of crucial control that if only news publishers had, all  their online woes would be over. What the news publishers really want are  licensing agreements, and given that Google already has several of these without  using ACAP (see <a href="../../josh-cohen-of-google-news-on-paywalls-partnerships-working-with-publishers-29881">Josh Cohen Of Google News On Paywalls, Partnerships &amp; Working With Publishers</a>), I can&#8217;t see that having it somehow advances any business model  changes.</p>
<p>Certainly the search engines need to get their act together more, however.  It&#8217;s time to stop referring people to the REP site which is run by no one. It&#8217;s  time to stop having a myriad of help pages scattered about within their  respective sites. Yes, they should continue to have their own help pages (see Google&#8217;s webmaster help from <a href="http://www.google.com/webmasters/">here</a>; Bing&#8217;s from <a href="http://www.bing.com/webmaster">here</a>). But  I&#8217;d like to see Google and Microsoft take the lead to also consolidate material  into a common site, perhaps building off <a href="http://sitemaps.org/"> Sitemaps.org</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/head-to-head-acap-versus-robots-txt-for-controlling-search-engines-30816/feed</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>NAR Changes Its Mind: Google Is Not A Scraper Site</title>
		<link>http://searchengineland.com/nar-google-is-not-scraper-site-30105</link>
		<comments>http://searchengineland.com/nar-google-is-not-scraper-site-30105#comments</comments>
		<pubDate>Wed, 18 Nov 2009 19:17:47 +0000</pubDate>
		<dc:creator>Matt McGee</dc:creator>
				<category><![CDATA[Google: Critics]]></category>
		<category><![CDATA[Legal: Crawling & Indexing]]></category>

		<guid isPermaLink="false">http://searchengineland.com/?p=30105</guid>
		<description><![CDATA[Backtracking on a controversial decision earlier this year, the National Association of REALTORS® (NAR) has adopted a new policy that allows real estate professionals to have their sites &#8212; including home listings that belong to others &#8212; indexed by search engines. The controversy reached a peak this year when the NAR agreed with a local [...]]]></description>
			<content:encoded><![CDATA[<p>Backtracking on a <a href="http://searchengineland.com/google-is-scraper-says-national-association-of-realtors-19046">controversial decision</a> earlier this year, the National Association of REALTORS® (NAR) has adopted a new policy that allows real estate professionals to have their sites &#8212; including home listings that belong to others &#8212; indexed by search engines.</p>
<p>The controversy reached a peak this year when the NAR agreed with a local decision in Indianapolis that said real estate agents couldn&#8217;t let Google and other search engines index the property listings on their sites if those listings belonged to other brokers/agents. In March, the Indianapolis board sent a letter to some agents that essentially called search engines &#8220;scraper&#8221; sites:</p>
<blockquote>&#8220;&#8230;the National Association of REALTORS® (NAR) is in agreement with our interpretation of the policy that the above described practice of &#8216;indexing your Web site&#8217; as you have called it, is a method of scraping or reproducing the data&#8221;</blockquote>
<p>At their meeting this week, the NAR Board of Directors <a href="http://www.realtor.org/inis.nsf/HTMLNewstest/SpecialINS11162009">revised</a> its policy on home listings and search engines to say that participants &#8220;are not required to prevent indexing of their Web sites by recognized search engines.&#8221; </p>
<p>That Realtor.org link is also interesting for its comments about the development of REALTORS® Property Resource, which many say is essentially a national search engine for property listings.</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/nar-google-is-not-scraper-site-30105/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Twitter Not Giving Access To Private Tweets</title>
		<link>http://searchengineland.com/twitter-not-giving-access-to-private-tweets-28122</link>
		<comments>http://searchengineland.com/twitter-not-giving-access-to-private-tweets-28122#comments</comments>
		<pubDate>Tue, 20 Oct 2009 19:44:34 +0000</pubDate>
		<dc:creator>Danny Sullivan</dc:creator>
				<category><![CDATA[Features: General]]></category>
		<category><![CDATA[Legal: Crawling & Indexing]]></category>
		<category><![CDATA[Legal: Privacy]]></category>
		<category><![CDATA[Top News]]></category>
		<category><![CDATA[Twitter]]></category>

		<guid isPermaLink="false">http://searchengineland.com/?p=28122</guid>
		<description><![CDATA[Is Twitter allowing search engines access to protected tweets or not? Not, Twitter tells me, though the company probably needs to do a bit more to prevent this type of confusion in the future. The LA Times reported yesterday about a &#8220;Twitter hole&#8221; that it believed allowed Google special access to protected tweets, tweets made [...]]]></description>
			<content:encoded><![CDATA[<p>Is Twitter allowing search engines access to protected tweets or not? Not,  Twitter tells me, though the company probably needs to do a bit more to prevent  this type of confusion in the future.</p>
<p>The LA Times <a href="http://latimesblogs.latimes.com/technology/2009/10/twitter-see-protected-tweets.html">reported  yesterday</a> about a &#8220;Twitter hole&#8221; that it believed allowed Google special  access to <a href="http://help.twitter.com/forums/10711/entries/14016">protected  tweets</a>, tweets made from Twitter accounts where owners have deliberately  chosen not to have their tweets be made public.</p>
<p><a href="http://www.techcrunch.com/2009/10/19/the-new-twitter-hole-that-probably-isnt/">Not  so</a>, said TechCrunch. The so-called protected tweets that the LA Times was  finding in Google looked to be those made from before particular account holders  locked down their accounts.</p>
<p>I checked with Twitter and got back the official word from their press  office:</p>
<blockquote>The TechCrunch article seems to sum up the confusion pretty well. It seems  that the LA Times piece references tweets that were public but later the user  protected the account, thus all subsequent tweets are private along with the  profile. The tweets prior to that time <strong>cannot</strong> be un-cached.</p>
<p>Google has not been given a key to the castle&#8230;so to speak.</blockquote>
<p>I&#8217;m good with this answer except for the word I&#8217;ve bolded &#8212; that formerly  public tweets cannot be uncached. That&#8217;s incorrect.</p>
<p>Let&#8217;s take an example. Let&#8217;s assume you started your Twitter account in  March. You started tweeting publicly, then in July decided to be private.  Twitter doesn&#8217;t try to protect any of your past tweets. In fact, it&#8217;s pretty  clear about this in its help page <a href="http://help.twitter.com/forums/10711/entries/14016">on the topic</a>:</p>
<blockquote>If you have a public account and you protect it, all updates after the time  of protection will be protected. Your profile will only be visible to approved  followers, and existing followers will not be affected.</p>
<p>Please note that tweets from protected profiles will not appear in search  results. People will still be able to find your account using the Find People  search tool but only people you&#8217;ve approved to follow your account will be able  to see your tweets. Also note that any tweets posted while your profile is  private will remain private indefinitely, and tweets posted while your account  is public will remain public indefinitely</blockquote>
<p>But Twitter could try to protect those formerly public tweets. As best I can  tell, if you lock down an account, Twitter does make ALL tweets (formerly public  or not) inaccessible to everyone accept those the account holder has authorized  to see them. That includes search engines like <a href="http://www.google.com/">Google</a> or a Twitter-specific search engine  like <a href="http://topsy.com/">Topsy</a>.</p>
<p>Well, if Google can&#8217;t get in to tweets after an account has been protected,  why does it show some? And why does Twitter say this will happen?</p>
<p>Google seems to rely on the last information for a tweet that it could see.  So you tweeted something in March. Google sees the tweet and records it. If in  August, you protect your account. Google tries to revisit your tweets as it does  with any web page, to make sure it has fresh information. It can&#8217;t get to any of  your tweets now.</p>
<p>The ones from August, it never saw them, since they were never  public &#8212; so it doesn&#8217;t list them.</p>
<p>That tweet in March? It keeps showing the  information from the last time it saw it. And apparently, it will keep doing  this for weeks or months.</p>
<p>Google didn&#8217;t send me a comment about this (I did ask, and I might get one later today). But that&#8217;s  just how I know Google works and can see it specifically working with some  protected tweets I investigated today.</p>
<p>As for Topsy, they <a href="http://twitter.com/Topsy/statuses/5023777459">told me</a>:</p>
<blockquote>Topsy only displays tweets that were once public. The refresh button will  make them vanish if the account is now private.</blockquote>
<p>Back to Google. Eventually it should update its old copy of the tweet with  what it currently shows to non-authorized visitors, a message that says &#8220;This  person has protected their tweets&#8221; (you can see this for millions of people <a href="http://www.google.com/#hl=en&amp;safe=off&amp;q=site%3Atwitter.com+%22This+person+has+protected+their+tweets%22">on  Google now</a>).</p>
<p>Twitter could speed that process along by explicitly  blocking tweets from a protected account with a <a href="../../meta-robots-tag-101-blocking-spiders-cached-pages-more-10665">meta  robots tag</a> configured to remove the page from the index entirely and from  cached copies being allowed (the NOINDEX, NOARCHIVE commands).</p>
<p>That wouldn&#8217;t guarantee that formerly public tweets are all taken private, of  course. Once something&#8217;s put out on the public web, it&#8217;s very difficult to pull  it back. But it could help and seems an easy enough change to do.</p>
<p>If you have a protected account, also keep in mind that those who follow you  might retweet what you tweet to the world. If you&#8217;re that worried, make sure you  pick your followers carefully and regularly keep them informed that you don&#8217;t  want things retweeted. Otherwise, be prepared for your private tweets to leak  out.</p>
<p>For more about search and tweets, see my <a href="../../what-is-real-time-search-definitions-players-22172">What  Is Real Time Search? Definitions &amp; Players</a> post which cover some ways to  make use of Google and its <a href="../../up-close-with-google-search-options-26985">search  options</a> feature to drill-down into tweets.</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/twitter-not-giving-access-to-private-tweets-28122/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Yahoo Wins In Lawsuit Over Search Descriptions</title>
		<link>http://searchengineland.com/yahoo-wins-in-lawsuit-over-search-descriptions-25473</link>
		<comments>http://searchengineland.com/yahoo-wins-in-lawsuit-over-search-descriptions-25473#comments</comments>
		<pubDate>Wed, 09 Sep 2009 16:20:06 +0000</pubDate>
		<dc:creator>Barry Schwartz</dc:creator>
				<category><![CDATA[Legal: Crawling & Indexing]]></category>
		<category><![CDATA[Legal: General]]></category>
		<category><![CDATA[Yahoo: Legal]]></category>

		<guid isPermaLink="false">http://searchengineland.com/?p=25473</guid>
		<description><![CDATA[Eric Goldman updated us on a case where Yahoo was sued for showing spam and porn pages for a search on a person&#8217;s name. Beverly Stayart was upset that when you searched for her name on Yahoo, Yahoo returned some results that were not to her liking. So she sued Yahoo but the court wouldn&#8217;t [...]]]></description>
			<content:encoded><![CDATA[<p>Eric Goldman <a href="http://blog.ericgoldman.org/archives/2009/09/yahoos_search_r.htm">updated</a> us on a case where <A href="http://searchengineland.com/yahoo-sued-for-showing-spam-pages-about-beverly-16601">Yahoo was sued</a> for showing spam and porn pages for a search on a person&#8217;s name.  Beverly Stayart was upset that when you searched for her name on Yahoo, Yahoo returned some results that were not to her liking.  So she sued Yahoo but the court wouldn&#8217;t have it.</p>
<p>Goldman explained that the court rejected her suit on three different levels.   She sued Yahoo for Lanham Act false endorsement and it was denied by the court, as expected.  The reason it was rejected? Goldman sums it up as follows:</p>
<blockquote>The court denies Various&#8217; 230 defense because its association with the banner ad was unclear. Having dismissed the federal Lanham Act claims completely, the court then declines supplemental jurisdiction over the state law claims. The court also rejects Stayart&#8217;s guffaw-inducing request for sanctions against the defendants for having the temerity of moving to dismiss her complaint.</blockquote>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/yahoo-wins-in-lawsuit-over-search-descriptions-25473/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>US Justice Dept. Formally Confirms Google Books Inquiry</title>
		<link>http://searchengineland.com/us-justice-dept-confirms-google-books-inquiry-21958</link>
		<comments>http://searchengineland.com/us-justice-dept-confirms-google-books-inquiry-21958#comments</comments>
		<pubDate>Fri, 03 Jul 2009 12:34:38 +0000</pubDate>
		<dc:creator>Greg Sterling</dc:creator>
				<category><![CDATA[Features: Analysis]]></category>
		<category><![CDATA[Google: Book Search]]></category>
		<category><![CDATA[Google: Critics]]></category>
		<category><![CDATA[Google: Legal]]></category>
		<category><![CDATA[Legal: Copyright]]></category>
		<category><![CDATA[Legal: Crawling & Indexing]]></category>
		<category><![CDATA[Top News]]></category>

		<guid isPermaLink="false">http://searchengineland.com/?p=21958</guid>
		<description><![CDATA[It&#8217;s not really a surprise or even news that the US Department of Justice (DOJ) confirmed yesterday it was formally investigating the terms of the Google Book Search settlement. This was known as far back as April and essentially confirmed last month by various publications reporting that formal requests (called &#8220;civil investigative demands”) had been [...]]]></description>
			<content:encoded><![CDATA[<p>It&#8217;s not really a surprise or even news that the US Department of Justice (DOJ) <a href="http://www.techmeme.com/090703/p3#a090703p3">confirmed yesterday</a> it was formally investigating the terms of the Google Book Search <a href="http://searchengineland.com/google-settles-copyright-litigation-for-125-million-paves-way-for-novel-services-15282">settlement</a>. This was known <a href="http://searchengineland.com/reports-us-dept-of-justice-looking-at-antitrust-issues-over-google-books-lawsuit-18238">as far back as April</a> and essentially confirmed last month by various publications <a href="http://searchengineland.com/doj-increases-scrutiny-of-google-book-settlement-20795">reporting</a> that formal requests (called &#8220;civil investigative demands”) had been issued to book publishers by the DOJ.</p>
<p>What happened yesterday was that the DOJ provided procedural notification to the court that the US is exploring the potential anti-trust dimensions of the Google settlement. The so-called &#8220;fairness hearing&#8221; to finalize the terms of the settlement will happen on October 7 and the US has until September 19 to &#8220;present its views in writing.&#8221; That&#8217;s not a lot of time.</p>
<p>What did the DOJ say in its <a href="http://www.scribd.com/doc/17045068/SDNY-Order-DOJ-Letter">two paragraph letter</a> notifying the court of its formal investigation? Here&#8217;s the body of the letter:</p>
<p><img class="alignnone size-full wp-image-21959" title="picture-18" src="http://searchengineland.com/figz/wp-content/seloads/2009/07/picture-18.png" alt="picture-18" width="470" height="443" /></p>
<p>The <a href="http://www.america.gov/st/educ-english/2008/April/20080423212813eaifas0.42149.html">Sherman Anti-Trust Act</a> is mentioned in the first paragraph of the government&#8217;s letter. The Sherman Act (and related subsequent legislation) is intended to promote competition and prevent consolidation of power and control over markets. For example, Microsoft was <a href="http://news.bbc.co.uk/2/hi/business/700084.stm">found</a> in 2000 to have violated the Sherman Anti-Trust Act for too closely tying Internet Explorer to the Windows operating system.</p>
<p>One of the central questions the DOJ is exploring in its review of the Google Book Search settlement is whether Google gains exclusive rights or benefits to which other parties don&#8217;t have access. Some legal scholars <a href="http://www.slideshare.net/naypinya/reflections-on-the-google-book-search-settlement-by-pamela-samuelson">have argued</a> that Google does gain what amounts to a monopoly over so-called &#8220;orphan books,&#8221; books that are out of print but not yet in the public domain and where copyright owners are effectively impossible to locate.</p>
<p>Google has repeatedly said that the agreement is non-exclusive but has not discussed that contention in precise detail. Questions in the anti-trust investigation will include how much control Google has over the &#8220;corpus&#8221; (material scanned), over pricing of book sales and to what degree third parties can gain access to the material. (New York University is <a href="http://searchengineland.com/google-book-search-lawsuit-gets-conference-treatment-21746">holding a conference</a> on the Book Search Settlement for interested parties on October 8, a day after the fairness hearing.)</p>
<p>Unlike the Google-Yahoo <a href="http://searchengineland.com/as-expected-yahoo-announces-10-year-google-paid-search-deal-14198">paid search deal</a> that Google <a href="http://searchengineland.com/citing-risk-google-ends-yahoo-paid-search-deal-15375">walked away from to avoid a legal battle with the DOJ</a>, the Book Search Settlement is different. Because of the underlying lawsuit Google cannot simply walk away here. Putting aside some of the less substantive issues (e.g., whether the attorneys fees are too high) as a basic matter there are essentially only two potential outcomes: approval of the existing settlement or modification of selected terms. If I had to bet I would bet that we will see some sort of modification further expanding third party access to the scanned material.</p>
<p>You can read the <a href="http://books.google.com/googlebooks/agreement/press.html">settlement itself</a> or take a look at the public-facing Google Book Settlement <a href="http://www.googlebooksettlement.com/">website</a> for rights holders and publishers. For additional context and background here&#8217;s our &#8220;library&#8221; of <a href="http://searchengineland.com/library/google/google-book-search">Google Book Search stories</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/us-justice-dept-confirms-google-books-inquiry-21958/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Website Guilty Over Google&#8217;s Automated Snippet</title>
		<link>http://searchengineland.com/website-guilty-over-googles-automated-snippet-20347</link>
		<comments>http://searchengineland.com/website-guilty-over-googles-automated-snippet-20347#comments</comments>
		<pubDate>Tue, 02 Jun 2009 13:46:16 +0000</pubDate>
		<dc:creator>Barry Schwartz</dc:creator>
				<category><![CDATA[Google: Legal]]></category>
		<category><![CDATA[Google: Web Search]]></category>
		<category><![CDATA[Legal: Crawling & Indexing]]></category>
		<category><![CDATA[Top News]]></category>

		<guid isPermaLink="false">http://searchengineland.com/?p=20347</guid>
		<description><![CDATA[The Register reports a Dutch court has ruled against a Dutch web site, where the web site was sued over the automated snippet used in Google for a keyword phrase. When someone searched for [Zwartepoorte] and [bankrupt], Google showed Miljoenhuizen.nl and the snippet in the Google search results for Miljoenhuizen.nl was: Complete name: Zwartepoorte Specialiteit: [...]]]></description>
			<content:encoded><![CDATA[<p>The Register <a href="http://www.theregister.co.uk/2009/06/02/google_snippet/">reports</a> a Dutch court has ruled against a Dutch web site, where the web site was sued over the automated snippet used in Google for a keyword phrase. </p>
<p>When someone searched for [Zwartepoorte] and [bankrupt], Google showed Miljoenhuizen.nl and the snippet in the Google search results for Miljoenhuizen.nl was:</p>
<blockquote>Complete name: Zwartepoorte Specialiteit: BMW&#8230;This company has been declared bankrupt, it has been acquired by the motordealer I have worked for Boat Rialto&#8230;</blockquote>
<p>What is the issue?  Well, Zwartepoorte did not go bankrupt and was upset when the Google snippet showed they did.  So Zwartepoorte sued Miljoenhuizen.nl, not Google for this error, and won.  The site owner had to change their web page so Google would remove the snippet.</p>
<p>This is pretty crazy if you ask me.  Google algorithmically makes the snippets.  No where on the page did the site say the company went bankrupt.  In any event, the court required the web site to modify the page so Google would remove the automated snippet.  </p>
<p>&#8220;The Court argues that it might be true that the website had no control over the functioning of Google but suggests that these questions about the opacity of Google’s functioning should be addressed in a broader context&#8230; then concludes that defendant had its own responsibility,&#8221; says van Hoboken, a PhD candidate at the Institute for Information Law at the University of Amsterdam.</p>
<p>Hat tip to <a href="http://www.blogstorm.co.uk/court-rules-website-liable-for-google-snippet/">Blogstorm</a> for spotting this case.</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/website-guilty-over-googles-automated-snippet-20347/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Open Letter To Google &amp; The AP: Reveal The Licensing Terms</title>
		<link>http://searchengineland.com/open-letter-to-google-the-ap-reveal-the-licensing-terms-20229</link>
		<comments>http://searchengineland.com/open-letter-to-google-the-ap-reveal-the-licensing-terms-20229#comments</comments>
		<pubDate>Fri, 29 May 2009 18:42:30 +0000</pubDate>
		<dc:creator>Danny Sullivan</dc:creator>
				<category><![CDATA[Features: Analysis]]></category>
		<category><![CDATA[Google: Business Issues]]></category>
		<category><![CDATA[Google: News]]></category>
		<category><![CDATA[Legal: Copyright]]></category>
		<category><![CDATA[Legal: Crawling & Indexing]]></category>
		<category><![CDATA[Top News]]></category>

		<guid isPermaLink="false">http://searchengineland.com/?p=20229</guid>
		<description><![CDATA[Discussions between Google and the Associated Press about renewing their content licensing deal continue, I assume, but all&#8217;s quiet recently on the negotiation front. I want to disrupt that. It would be wrong in this particular case for both parties to reach a deal where &#8220;terms are not disclosed.&#8221; The future of journalism, as well [...]]]></description>
			<content:encoded><![CDATA[<p><a href="../../sorry-tom-curley-no-google-ranking-boost-for-ap-18402">Discussions</a> between Google and the Associated Press about renewing their content licensing  deal continue, I assume, but all&#8217;s quiet recently on the negotiation front. I  want to disrupt that. It would be wrong in this particular case for both parties  to reach a deal where &#8220;terms are not disclosed.&#8221; The future of journalism, as  well as Google&#8217;s own reputation, deserves for things to open up.</p>
<p>After threatening a lawsuit against Google several years ago, AP <a href="../../google-news-now-hosting-wire-stories-promises-better-variety-in-results-12064">won</a> its first licensing deal with Google in 2006. Google was at pains to stress this  wasn&#8217;t a deal designed to gain the rights to merely list AP stories. It was  supposed to cover more &#8220;new&#8221; and &#8220;extensive&#8221; uses of AP material.</p>
<p>Bull. Today, you can still search at Google and find that it fails to list just one &#8220;originating&#8221; AP story in many cases. Sure, Google hosts AP stories, but it  wasn&#8217;t like Google set out trying for that goal. It was just as happy as  pointing outward. The bottom line was that the deal was a nice wrapper to go  around getting the AP off Google&#8217;s back.</p>
<p>Now the AP&#8217;s been on again that without the right deal, it&#8217;ll pull content  from Google. Technically, how they&#8217;ll do that is absurd. Will the AP robots.txt out  its entire site? And ensure that all of its members do the same for any AP  story? More likely, it would fall back to the lawsuit front.</p>
<p>But what&#8217;s the right deal? What&#8217;s Google paying the AP now, and what happened  in three years that this amount wasn&#8217;t enough? Since the terms were never  disclosed, the public can&#8217;t judge.</p>
<p>And the public needs to judge, especially as newspapers have <a href="http://correspondents.theatlantic.com/james_warren/2009/05/shhhh_newspaper_publishers_are_quietly_holding_a_very_very_important_conclave_today_will_you_soon_be.php">secret  stealth meetings</a> (which involve AP CEO Tom Curley) which involve sessions  like:</p>
<blockquote>Journalism Online: Presentation on proposed service to charge for access to  newspaper content and to license that content that (sic) online  aggregators</blockquote>
<p>(For more on the stealth meeting, also see the Nieman Journalism Lab <a href="http://www.niemanlab.org/2009/05/newspaper-execs-treading-carefully-on-antitrust-laws/">report</a> and <a href="http://www.techmeme.com/090528/p53#a090528p53">Techmeme</a>).</p>
<p>See, some in the newspaper industry persist in the assumption that merely  listing headlines and summarizing stories is a copyright offense. The AP itself wants to  charge for this and already has guidelines that make some people think they  might be violating fair use laws when they aren&#8217;t (see <a href="http://daggle.com/do-newspapers-owe-google-fees-for-researching-stories-611">Do  Newspapers Owe Google “Fair Share” Fees For Researching Stories?</a>).</p>
<p>In other quarters, we have people like Forbes.com CEO Jim Spanfeller <a href="../../forbes-spanfeller-attacks-google-stumbles-into-cesspool-18654">pulling  $60 million dollar figures out of the air</a>, about what he thinks Google owes  him and suggesting that Google is helping to destroy &#8220;one of the core building  blocks of our democracy.&#8221;</p>
<p>Since so much is at stake here &#8212; democracy itself! &#8212; it doesn&#8217;t seem right  that Google and the AP will reach a deal that no one gets the details too.  Shouldn&#8217;t all journalistic enterprises, be they traditional or not, understand  exactly the value Google&#8217;s willing to pay to one with a big mouth and a staff of  lawyers?</p>
<p>Moreover, shouldn&#8217;t Google be taking a stand on behalf of the majority of  publishers who I&#8217;d argue have no problem with aggregators or search engines  listing their content. A small number of publishers with inflated opinions about  their importance are pushing for (and winning) <a href="http://daggle.com/newspaper-tax-break-626">concessions</a> to compensate  for their dying business models and lack of foresight. Aren&#8217;t the wrong people  being rewarded?</p>
<p>You have to appreciate the rich irony. I can&#8217;t get the AP to talk with me at  all (<a href="http://daggle.com/do-newspapers-owe-google-fees-for-researching-stories-611">their  execs are all busy, I&#8217;ve been told</a>). When their CEO attends a stealth  meeting, it&#8217;s not even the AP that first reports about it despite the AP being  so convinced that they originate stories. Instead, the AP story <a href="http://www.sfgate.com/cgi-bin/article.cgi?f=/n/a/2009/05/28/financial/f150308D57.DTL">points</a> to an Atlantic blog <a href="http://correspondents.theatlantic.com/james_warren/2009/05/shhhh_newspaper_publishers_are_quietly_holding_a_very_very_important_conclave_today_will_you_soon_be.php">post</a> that broke the news. And Google, which <a href="../../google-as-open-as-it-wants-to-be-ie-when-its-convenient-12624">touts  being open any time it is convenient</a>, is being hush-mouthed here.</p>
<p>Google needs to step up. Defend the &#8220;fair use&#8221; rights it believes it  has on its own behalf and those of the greater web ecosystem. Alternatively, negotiate a deal  to solve its AP problem but do it openly, so everyone else understands what  exactly is being given up.</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/open-letter-to-google-the-ap-reveal-the-licensing-terms-20229/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

<!-- Dynamic page generated in 0.532 seconds. -->
<!-- Cached page generated by WP-Super-Cache on 2012-05-25 20:15:51 -->
<!-- Compression = gzip -->
