<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>searchengineland.com &#187; SEO: Cloaking &amp; Doorway Pages</title>
	<atom:link href="http://searchengineland.com/library/seo/seo-cloaking-doorway-pages/feed" rel="self" type="application/rss+xml" />
	<link>http://searchengineland.com</link>
	<description>Search Engine Land: Must Read News About Search Marketing &#38; Search Engines</description>
	<lastBuildDate>Mon, 23 Nov 2009 00:40:51 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>See What Googlebot Sees On Your Site</title>
		<link>http://searchengineland.com/see-what-googlebot-sees-on-your-site-27623</link>
		<comments>http://searchengineland.com/see-what-googlebot-sees-on-your-site-27623#comments</comments>
		<pubDate>Tue, 13 Oct 2009 20:01:18 +0000</pubDate>
		<dc:creator>Vanessa Fox</dc:creator>
				<category><![CDATA[Google: Webmaster Central]]></category>
		<category><![CDATA[SEO: Cloaking & Doorway Pages]]></category>
		<category><![CDATA[SEO: Redirects & Moving Sites]]></category>

		<guid isPermaLink="false">http://searchengineland.com/?p=27623</guid>
		<description><![CDATA[Google Webmaster Tools has just launched a &#8220;labs&#8221; section, where you&#8217;ll find new features that may be early in the development cycle and not quite as robust as the rest of the tools. The features available so far are Fetch as Googlebot, which lets you see exactly what Googlebot is served when it requests a [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fsearchengineland.com%2Fsee-what-googlebot-sees-on-your-site-27623"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fsearchengineland.com%2Fsee-what-googlebot-sees-on-your-site-27623" height="61" width="51" /></a></div><p>Google Webmaster Tools has <a href="http://googlewebmastercentral.blogspot.com/2009/10/fetch-as-googlebot-and-malware-details.html">just launched a &#8220;labs&#8221; section</a>, where you&#8217;ll find new features that may be early in the development cycle and not quite as robust as the rest of the tools. The features available so far are <em>Fetch as Googlebot</em>, which lets you see exactly what Googlebot is served when it requests a URL from your server and <em>Malware Details</em>, which shows you malicious code snippets from your site if it&#8217;s been flagged as containing malware.</p>
<p><strong>Fetch as Googlebot</strong></p>
<p>Of most interest to webmasters, SEOs, and web developers is likely the Fetch as Googlebot feature. You can specify any URL on your site and see the HTTP response (header and contents) that the server returns. Simply  indicate the URL and click the Fetch button. It may take a few moments for Googlebot to access the page and return the results, since it fetches the page in real time. (Refresh the page to see the progress.)</p>
<p><a title="Google Fetch as Googlebot by Search Engine Land, on Flickr" href="http://www.flickr.com/photos/23148333@N06/4009489298/"><img src="http://farm3.static.flickr.com/2434/4009489298_f9879b18af.jpg" alt="Google Fetch as Googlebot" width="500" height="229" /></a></p>
<p>Click the Success link once it&#8217;s been processed to see the results.</p>
<p><a title="Google Fetch As Googlebot Results by Search Engine Land, on Flickr" href="http://www.flickr.com/photos/23148333@N06/4008724331/"><img src="http://farm3.static.flickr.com/2467/4008724331_bf6ee1260c.jpg" alt="Google Fetch As Googlebot Results" width="500" height="372" /></a></p>
<p>How is this different from simply looking at the source code of the page?</p>
<ul>
<li>You see the HTTP header information at the top. This information is generally easily available through tools such as Live HTTP Headers, but isn&#8217;t contained in the source code itself (since that information is coming from the server, not the page).</li>
<li>You can see if the server is returning any of the page information differently than the page has been coded.</li>
<li>You can see if the server is returning something different to Googlebot than what other users see. This tool uses the same user-agent and IP range as Googlebot when it crawls the web, so if the server is configured conditionally for user agent or IP address (typically known as &#8220;<a href="http://www.google.com/support/webmasters/bin/answer.py?hl=en&amp;answer=66355">cloaking</a>&#8220;), you&#8217;ll see  what&#8217;s being conditionally served to Google.</li>
<li>You can use the tool to test changes (particularly things like redirects) in real-time.</li>
</ul>
<p>Note that this tool won&#8217;t necessarily show you the content that Google is able to extract from the page. If the page contains JavaScript, for instance, you&#8217;ll see the raw JavaScript code contained on the page, not the rendered view visible in the browser. Which, unfortunately means you can&#8217;t use this tool to determine if Google is able to access content contained in rich markup.</p>
<p><strong>What&#8217;s this about cloaking?</strong></p>
<p>This tool can help you determine if the pages are being cloaked to Google. This may be useful if you&#8217;re coming into a project late and aren&#8217;t sure what&#8217;s been previously done. It can also help uncover if your site has been hacked. Back in 2006, <a href="http://blog.sitepronews.com/index.php?/archives/23-Matt-Cutts-on-Good-Karma-Domain-Hijacking-as-a-Blackhat-Technique.html">Googler Matt Cutts and I did a show on Webmaster Radio</a> during which we talked about how in some cases, a hacker might add links to a site and then cloak those pages so that the site owner never sees them. Only Google does. At the time, Matt suggested <a href="http://blog.sitepronews.com/index.php?/archives/25-Matt-Cutts-Response-to-Good-Karma-Questions.html">using Google Translate</a> (and choosing English to English) to see what Googlebot was being served, but this tool can now more easily serve that purpose. Matt confirmed this to me this morning: &#8220;The biggest use case is just debugging site issues. Of those, the biggest case will be hacked sites. Some attacks will hide content until search engines fetch the page (and some attackers add a noarchive tag so that the search result doesn&#8217;t have a &#8220;Cached&#8221; link), so a site could look clean to the website owner. Using this feature will site owners verify that there are no hidden links in the page that Google actually fetches.&#8221;</p>
<p><strong>How do I test redirects?</strong></p>
<p>If you&#8217;ve implemented redirects, you can use this tool to test how Googlebot will interpret those redirects without waiting for those pages to be crawled. For instance, when I fetch www.searchengineland.com, I see that the redirect is correctly implemented as a 301 and points to searchengineland.com:</p>
<p><a title="Google Fetch as Googlebot by Search Engine Land, on Flickr" href="http://www.flickr.com/photos/23148333@N06/4009489502/"><img src="http://farm3.static.flickr.com/2555/4009489502_1ccef8d5ae_o.jpg" alt="Google Fetch as Googlebot" width="371" height="300" /></a></p>
<p>You can also use the tool to troubleshoot URLs listed in the Crawl Errors &gt; Not Followed report. You can also test these URLs using something like Live HTTP Headers or by trying to access the URLs in a browser, but if neither of those methods uncover the problem, this tool can help determine that the issue is specific to Googlebot. You can also use this tool to verify that fixes you&#8217;ve made to redirect errors uncovered by the Not Followed report have really solved the problem.</p>
<p>(Note that the tool currently has a limit of 100kb per page. However, this is for the tool only and doesn&#8217;t apply to Googlebot&#8217;s normal crawl of the site. Google is monitoring feedback to see if many site owners find this size to be limiting.)</p>
<p><strong>Malware details</strong></p>
<p>The Google Online Security Blog has more information on the <a href="http://googleonlinesecurity.blogspot.com/2009/10/show-me-malware.html">malware details tool</a>. Previously, webmaster tools reported when the site was flagged has having malware and listed sample URLs. This new tool will also show samples of the malicious content, and in some cases, the underlying cause. This should help those site owners whose sites have been hacked to include malware find the problem and fix it. If your site does contain malware and you&#8217;ve fixed it, you can<a href="http://googlewebmastercentral.blogspot.com/2008/10/malware-we-dont-need-no-stinking.html"> request a review</a> to have the malware alert removed in search results.</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/see-what-googlebot-sees-on-your-site-27623/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Google&#8217;s First Click Free Program For Web Content, Not Just News</title>
		<link>http://searchengineland.com/googles-first-click-free-program-for-web-content-not-just-news-15165</link>
		<comments>http://searchengineland.com/googles-first-click-free-program-for-web-content-not-just-news-15165#comments</comments>
		<pubDate>Fri, 17 Oct 2008 17:56:03 +0000</pubDate>
		<dc:creator>Barry Schwartz</dc:creator>
				<category><![CDATA[Google: SEO]]></category>
		<category><![CDATA[Google: Web Search]]></category>
		<category><![CDATA[Google: Webmaster Central]]></category>
		<category><![CDATA[How To: SEO]]></category>
		<category><![CDATA[SEO: Cloaking & Doorway Pages]]></category>
		<category><![CDATA[Top News]]></category>

		<guid isPermaLink="false">http://searchengineland.com/?p=15165</guid>
		<description><![CDATA[Google&#8217;s long had a &#8220;First Click Free&#8221; program that allows news publishers to make their content accessible to search spiders but requires human visitors to login if they&#8217;ve already viewed one page on the site for free &#8212; hence the &#8220;first click free&#8221; name. Earlier this year, Google said this program was OK for web [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fsearchengineland.com%2Fgoogles-first-click-free-program-for-web-content-not-just-news-15165"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fsearchengineland.com%2Fgoogles-first-click-free-program-for-web-content-not-just-news-15165" height="61" width="51" /></a></div><p>Google&#8217;s long had a &#8220;<a href="http://www.google.com/support/news_pub/bin/answer.py?answer=40543&amp;topic=11707">First Click Free</a>&#8221; program that allows news publishers to make their content accessible to search spiders but requires human visitors to login if they&#8217;ve already viewed one page on the site for free &#8212; hence the &#8220;first click free&#8221; name. Earlier this year, <a href="http://googlewebmastercentral.blogspot.com/2008/06/how-google-defines-ip-delivery.html">Google said</a> this program was OK for web publishers to use &#8212; not just news publishers. Today, they&#8217;ve <a href="http://googlewebmastercentral.blogspot.com/2008/10/first-click-free-for-web-search.html">done a post</a> more formally making that point.</p>
<p><span id="more-15165"></span>Let me take you through some of the history of this program as it relates to web search.</p>
<p>Back in March 2007, Danny Sullivan&#8217;s <a href="http://searchengineland.com/070304-231603.php">Yet Another Debate About Cloaking Happens</a> article talked about how First Click Free was offered to news publishers but not general web publishers, something he wanted to see changed</p>
<p>In September 2007, <a href="http://searchengineland.com/first-click-free-accessing-subscription-based-articles-for-free-via-google-news-12236.php">First Click Free: Accessing Subscription-Based Articles For Free Via Google News</a> covers how Google did a blog <a href="http://googlenewsblog.blogspot.com/2007/09/first-click-free.html">post</a> to help promote the idea of First Click Free. It still was a news publisher-only program, but many news publishers weren&#8217;t aware of it.</p>
<p>In June 2008, Google <a href="http://googlewebmastercentral.blogspot.com/2008/06/how-google-defines-ip-delivery.html">did</a> a blog post saying that Google News&#8217;s First Click Free policy could be applied to &#8220;to include your premium or subscription-based content in Google&#8217;s websearch index without violating our quality guidelines.&#8221;</p>
<p>Many people missed this important change. Danny <a href="http://searchengineland.com/no-advanced-seo-does-not-mean-spamming-14165.php">highlighted it</a> in a post that came out of SMX Advanced. Over at Search Engine Roundtable, Matt Cutts also <a href="http://www.seroundtable.com/archives/017331.html#comments">commented</a> to clarify things, after Google elsewhere has mistakenly said First Click Free was something &#8220;reserved&#8221; for Google News. Matt said:</p>
<blockquote><p>First Click Free originated with Google News, but you can use the same way of handling content in web search (show the same page to users and Googlebot, then if the user clicks to read a different article, then you can show them the registration or pay page).</p>
<p>Because the same page is presented to users and to Googlebot, it&#8217;s not cloaking. So First Click Free is a great way if you have premium content to surface it in Google&#8217;s web index without cloaking. Hope that makes sense.</p></blockquote>
<p>Today&#8217;s post should help end any further confusion. First Click Free isn&#8217;t just for Google News &#8212; anyone can use it.</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/googles-first-click-free-program-for-web-content-not-just-news-15165/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Long Road To The Debate Over &#8220;White Hat Cloaking&#8221;</title>
		<link>http://searchengineland.com/the-long-road-to-the-debate-over-white-hat-cloaking-14306</link>
		<comments>http://searchengineland.com/the-long-road-to-the-debate-over-white-hat-cloaking-14306#comments</comments>
		<pubDate>Tue, 01 Jul 2008 19:13:41 +0000</pubDate>
		<dc:creator>Danny Sullivan</dc:creator>
				<category><![CDATA[Google: SEO]]></category>
		<category><![CDATA[SEO: Cloaking & Doorway Pages]]></category>

		<guid isPermaLink="false">http://searchengineland.com/beta/the-long-road-to-the-debate-over-white-hat-cloaking-14306.php</guid>
		<description><![CDATA[
]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fsearchengineland.com%2Fthe-long-road-to-the-debate-over-white-hat-cloaking-14306"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fsearchengineland.com%2Fthe-long-road-to-the-debate-over-white-hat-cloaking-14306" height="61" width="51" /></a></div><p>I really hate arguments over
<a href="http://searchengineland.com/070814-093136.php">cloaking</a>. Like
really hate them. But Rand Fishkin recently
<a href="http://www.seomoz.org/blog/white-hat-cloaking-it-exists-its-permitted-its-useful">
did a chart</a> outlining the degrees of &quot;safeness&quot; to cloaking that in turn
riled up Google&#8217;s Matt Cutts in comments there. No doubt others at search
engines also dislike the idea that any cloaking is &quot;white hat.&quot; So I wanted
to revisit some of the things that Rand outlined about cloaking plus the
guidelines Google updated last month. Over the years, content delivery
methods that were once considered cloaking have become acceptable. This is a
look at what those are and how we need a new name for them.</p>
<p><span id="more-14306"></span></p>
<p><b>How Does Google Define Cloaking?</b></p>
<p>First, if you want to understand how many tiring debates we&#8217;ve had over
cloaking issues, I strongly urge you to read my
<a href="http://searchengineland.com/070304-231603.php">YADAC: Yet Another
Debate About Cloaking Happens Again</a> post from last year. One of the
things it discusses is how Google dropped its definition of what cloaking
was in 2006. Before then, it was:</p>
<blockquote>
<p>The term &quot;cloaking&quot; is used to describe a website that returns altered
webpages to search engines crawling the site. In other words, the
webserver is programmed to return different content to Google than it
returns to regular users, usually in an attempt to distort search engine
rankings. This can mislead users about what they&#8217;ll find when they click
on a search result. To preserve the accuracy and quality of our search
results, Google may permanently ban from our index any sites or site
authors that engage in cloaking.</p>
</blockquote>
<p>We were left with
<a href="http://www.google.com/support/webmasters/bin/answer.py?hl=en&#038;answer=35769">
this</a> on another page:</p>
<blockquote>
<p>Make pages primarily for users, not for search engines. Don&#8217;t deceive
your users or present different content to search engines than you display
to users, which is commonly referred to as &quot;cloaking.&quot;</p>
</blockquote>
<p>Then around June of last year,
<a href="http://web.archive.org/web/*/http:/www.google.com/support/webmasters/bin/answer.py?answer=66355">
as best I can tell</a>, we got this:</p>
<blockquote>
<p>Cloaking refers to the practice of presenting different content or URLs
to users and search engines. Serving up different results based on user
agent may cause your site to be perceived as deceptive and removed from
the Google index.</p>
<p>Some examples of cloaking include:</p>
<ul>
<li>Serving a page of HTML text to search engines, while showing a page
of images or Flash to users.</li>
<li>Serving different content to search engines than to users.</li>
</ul>
<p>If your site contains elements that aren&#8217;t crawlable by search engines
(such as Flash, Javascript, or images), you shouldn&#8217;t provide cloaked
content to search engines. Rather, you should consider visitors to your
site who are unable to view these elements as well. For instance:</p>
<ul>
<li>Provide alt text that describes images for visitors with screen
readers or images turned off in their browsers.</li>
<li>Provide the textual contents of Javascript in a noscript tag.</li>
</ul>
<p>Ensure that you provide the same content in both elements (for
instance, provide the same text in the Javascript as in the noscript tag).
Including substantially different content in the alternate element may
cause Google to take action on the site.</p>
</blockquote>
<p>Then in June of this year, Google did a
<a href="http://googlewebmastercentral.blogspot.com/2008/06/how-google-defines-ip-delivery.html">
major blog post</a> on the topic, which along with cloaking covered topics
such as:</p>
<ul>
<li>Geolocation</li>
<li>IP delivery</li>
<li>First Click Free</li>
</ul>
<p>Much of what&#8217;s in the blog post has yet to migrate to the help pages,
though I&#8217;m sure it will. But the more important point to me is that over
time, things that people might have once considered cloaking have become
acceptable to Google &#8212; and get named something other than cloaking in the
process.</p>
<p><b>Renaming Confusion</b></p>
<p>Let&#8217;s take geolocation as an example of that. Years ago when Google would
campaign against cloaking, those who disagreed would argue that Google
itself cloaked. For example, go to Google.com from outside the US and
<a href="http://searchengineland.com/080515-100958.php">you&#8217;ll see</a> a
different page than someone in the US sees. Cloaking! No, Google eventually
argued &#8212; that&#8217;s not cloaking. It&#8217;s geolocation and perfectly fine for
anyone to do.</p>
<p>OK, how about showing Google content that human users only see if they
register with a site and/or pay for access. Cloaking! Well, here Google took
some time to clarify things for web search. Google informally said that
users shouldn&#8217;t have to pay to see content if they came from Google. In
Google News search, this is called
<a href="http://www.google.com/support/news_pub/bin/answer.py?answer=40543">
First Click Free</a>, and it&#8217;s a formalized process for news publishers to
follow (that&#8217;s why you can <a href="http://daggle.com/080515-120207.html">
read the Wall Street Journal for free</a> via Google News). But for web
search, we never got a formal program or formal mention that this was allowed
until the June 2008 blog post.</p>
<p>Along the way, we&#8217;ve also had more advice about doing things to make
Flash content visible through JavaScript detection and replacement (see
today&#8217;s excellent post,
<a href="http://searchengineland.com/080701-000002.php">Google Now Crawling
And Indexing Flash Content</a>, from Vanessa Fox on those methods plus the
new Flash reading that Google&#8217;s doing). We&#8217;ve also had advice about
rewriting URLs to make them more search engine friendly &#8212; good advice &#8211;
but also a situation that in some cases can be URL cloaking. We&#8217;ve also had
situations where the search engines have absolutely known someone was
cloaking &#8212; no ifs, ands or buts &#8212; but decided the intent wasn&#8217;t bad so let
it go through (which is fine for me &#8212; I care more about intent than
kneejerk reaction to a bad &quot;tactic&quot;).</p>
<p>It&#8217;s confusing. Very confusing. Now what was cloaking again?</p>
<p><b>Charting Acceptability</b></p>
<p>To help, Rand put out a tactics chart with things rated from &quot;pearly
white&quot; to &quot;solid black.&quot; That&#8217;s nice, but it&#8217;s also debatable. In fact, it
IS being debated! So I wanted to go back to what Google itself is saying and
work from that.</p>
<p>The most important thing is that Rand takes a &quot;tactics&quot; approach &#8212; are
you using IP detection, cookie detection, and so on &#8212; then suggests that as
you add things like user agent detection or other tactics, you tip into the
danger zone. </p>
<p>In contrast, Google is less about how you physically deliver content and
more focused on the user experience &#8212; the end result of what a user sees.
The bottom line in that remains quite simple &#8212; Google generally wants to be
treated the same as a &quot;typical&quot; user. So if you typically use cookie
detection to deliver content, Google&#8217;s got no real issue with that &#8212; as
long as it sees what a typical user might see who doesn&#8217;t accept cookies.</p>
<p>Below, some content delivery methods now acceptable to Google that might
have once been considered cloaking:</p>
<table border="1" cellpadding="3" cellspacing="1" style="border-collapse: collapse" bordercolor="#111111" width="500">
<tr>
<td align="center"><b><font size="2">Content Delivery Method</font></b></td>
<td align="center"><b><font size="2">Description</font></b></td>
<td align="center"><b><font size="2">How Done</font></b></td>
<td align="center"><b><font size="2">OK With Google If&#8230;</font></b></td>
</tr>
<tr>
<td align="center"><font size="2">Geolocation</font></td>
<td align="center"><font size="2">Users see content tailored to their
physical location</font></td>
<td align="center"><font size="2">IP detection, cookies, user login</font></td>
<td align="center"><font size="2">Don&#8217;t do anything special just for
Googlebot</font></td>
</tr>
<tr>
<td align="center"><font size="2">First Click Free</font></td>
<td align="center"><font size="2">Users clicking from Google to a listed
page can read page without having to pay or register with the hosting
site [if they try to click past that page, it's then OK to toss up a
barrier]</font></td>
<td align="center"><font size="2">IP detection, cookies, user login</font></td>
<td align="center"><font size="2">You let Googlebot through as if it
were a paid/registered member and also allow anyone coming from Google&#8217;s
search listings through</font></td>
</tr>
<tr>
<td align="center"><font size="2">JavaScript Replacement</font></td>
<td align="center"><font size="2">Using JavaScript to show content to
non-JavaScript capable visitors (such as Google) that matches the
textual information within a Flash or other multimedia element</font></td>
<td align="center"><font size="2">JavaScript</font></td>
<td align="center"><font size="2">Don&#8217;t do anything special just for
Googlebot [it sees what any non-JavaScript person would see]</font></td>
</tr>
<tr>
<td align="center"><font size="2">Landing Page Testing</font></td>
<td align="center"><font size="2">Using tools like Google Website
Optimizer to change pages that are shown</font></td>
<td align="center"><font size="2">JavaScript, other</font></td>
<td align="center"><font size="2">Don&#8217;t do anything special just for
Googlebot [it sees what any non-JavaScript person would see]</font></td>
</tr>
<tr>
<td align="center"><font size="2">URL Rewriting</font></td>
<td align="center"><font size="2">Stripping out unnecessary parameters
and other URL &quot;garbage&quot; not needed to deliver a page</font></td>
<td align="center"><font size="2">Server side, JavaScript insertion</font></td>
<td align="center"><font size="2">The underlying content isn&#8217;t changing
&amp; you aren&#8217;t just detecting Googlebot to do it</font></td>
</tr>
</table>
<p>
Where am I getting the info for the chart above?</p>
<ul>
<li><b>Geolocation</b> &amp; <b>First Click Free</b> come from Google
Webmaster Central&#8217;s June 2008
<a href="http://googlewebmastercentral.blogspot.com/2008/06/how-google-defines-ip-delivery.html">
blog post</a>. The post talks about Google&#8217;s First Click Free as if it is
only for news content. That&#8217;s not correct. You can use this method for
Google web search, as was covered by Google reps speaking at SMX Advanced
2008 last month.<br />
&nbsp;</li>
<li><b>JavaScript Replacement</b>: Google&#8217;s
<a href="http://www.google.com/support/webmasters/bin/answer.py?answer=66355">
page</a> on cloaking addresses this, plus reps have covered this advice in
many public forums.<br />
&nbsp;</li>
<li><b>Landing Page Testing:</b>
<a href="http://searchengineland.com/070404-084938.php">Google Website
Optimizer Now Available, But Is It Cloaking?</a> covers the issues here,
and
<a href="http://adwords.google.com/support/bin/answer.py?hl=en&#038;answer=72507">
How does Website Optimizer fit in with Google&#8217;s view of cloaking?</a> from
Google AdWords help has detailed advice on how it is acceptable (and when
it is not)</li>
</ul>
<ul>
<li><b>URL Rewriting:</b>
<a href="http://searchengineland.com/070301-065358.php">Good Cloaking,
Evil Cloaking &amp; Detection</a> from Stephan Spencer last year talked about
the ways to strip down URLs so they appear &quot;nicer&quot; in search results. He&#8217;d
surveyed the major search engines that emphatically told him this was OK,
even if their spiders were detected in order to do it. I believe this is
still fine as long as the content itself isn&#8217;t changing. </li>
</ul>
<p>And what&#8217;s not OK?</p>
<table border="1" cellpadding="3" cellspacing="1" style="border-collapse: collapse" bordercolor="#111111" width="500">
<tr>
<td align="center"><b><font size="2">Content Delivery Method</font></b></td>
<td align="center"><b><font size="2">Description</font></b></td>
<td align="center"><b><font size="2">How Done</font></b></td>
</tr>
<tr>
<td align="center"><font size="2">Cloaking</font></td>
<td align="center"><font size="2">Showing Google content that typical
users do not see</font></td>
<td align="center"><font size="2">IP, User Agent Detection</font></td>
</tr>
<tr>
<td align="center"><font size="2">Conditional Redirects</font></td>
<td align="center"><font size="2">Showing Google a redirect code (301)
that is different from what someone else sees</font></td>
<td align="center"><font size="2">IP, User Agent Detection</font></td>
</tr>
</table>
<p></p>
<ul>
<li><b>
Cloaking</b> comes from Google Webmaster Central&#8217;s June 2008
<a href="http://googlewebmastercentral.blogspot.com/2008/06/how-google-defines-ip-delivery.html">
blog post</a> and its help
<a href="http://www.google.com/support/webmasters/bin/answer.py?answer=66355">
page</a> on the topic. If you&#8217;re delivering content to users that&#8217;s
different than what Google sees &#8212; and it&#8217;s not listed on the first chart
&#8211; you&#8217;re probably cloaking.<br />
&nbsp;</li>
<li><b>Conditional Redirects:</b> AKA, cloaking redirects. Discussed at
SMX Advanced 2008, especially
<a href="http://www.naturalsearchblog.com/archives/2008/06/03/amazons-secret-to-dominating-serp-results/">
how Amazon</a> has been doing it. Google
<a href="http://www.seroundtable.com/archives/017301.html">warned</a> that
<a href="http://videos.webpronews.com/2008/06/09/playing-with-bots/">
conditional redirects</a> might be risky.</li>
</ul>
<p><b>&quot;Whitehat&quot; Cloaking</b></p>
<p>I said I&#8217;m pretty tired over the cloaking debate, right? That&#8217;s because
way back when the debate started, we had search engines that DID allow
cloaking via XML feeds or other methods &#8211; or they turned blind eyes to
cloaking they knew about &#8212; or they had programs like Google Scholar that
sure felt like cloaking before guidelines caught up to say no, those aren&#8217;t
cloaking.</p>
<p>These exceptions made it difficult to say cloaking was &quot;bad&quot; when clearly
some of it was allowed. That&#8217;s why I urged the search engines &#8212; Google in
particular &#8212; to update the guidelines to say that any type of &quot;unapproved&quot;
cloaking might get you banned. My thought was that we could maybe skip past
the debate over tactics (and finger-pointing &#8212; ah-ha!!!! &#8212; this major site
is cloaking, ban them!) and focus more on the user experience.</p>
<p>Things have changed since then. The guidelines, as I&#8217;ve explained, have
gotten more detailed &#8212; and exceptions have been spun off into new names. As
part of this, the idea of &quot;white hat cloaking&quot; has come out, the &quot;good&quot;
cloaking that&#8217;s now acceptable.</p>
<p>I can tell you firsthand that Google doesn&#8217;t like the phrase &quot;white hat
cloaking&quot; at all. To Google, there&#8217;s cloaking &#8212; it&#8217;s always bad &#8212; and there are the other
content delivery methods I&#8217;ve outlined above.</p>
<p>OK, I&#8217;ll roll with that. Personally, I&#8217;ll avoid talking about &quot;white hat&quot;
or &quot;good&quot; cloaking if it helps improve relations between webmasters and
the search engines AND helps newbies avoid getting into trouble. But
I do think we need a term to encompass content delivery methods that do
target spiders or consider them in some way. </p>
<p>First Click Free, JavaScript replacement, even geolocation &#8212; while maybe
you don&#8217;t do something special for the search engines as part of them &#8211;
you&#8217;re still considering them. Indeed, part of what you consider is to ensure
that you might NOT do something special.</p>
<p>If cloaking is a bad word to the search engines that can never be
redeemed, we still need a name for the &quot;good&quot; things that are out there.
I&#8217;ve got my thinking cap on, and if you&#8217;ve got ideas, let me know by
commenting.</p>
<p>I&#8217;ll close by saying that my list above is not complete, in terms of the
various content delivery methods that are out there. I hope to grow this
over time &#8212; and you can help with your comments, as well.</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/the-long-road-to-the-debate-over-white-hat-cloaking-14306/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Google Defines IP Delivery, Geolocation, &amp; Cloaking</title>
		<link>http://searchengineland.com/google-defines-ip-delivery-geolocation-cloaking-14124</link>
		<comments>http://searchengineland.com/google-defines-ip-delivery-geolocation-cloaking-14124#comments</comments>
		<pubDate>Tue, 03 Jun 2008 14:16:28 +0000</pubDate>
		<dc:creator>Barry Schwartz</dc:creator>
				<category><![CDATA[Google: SEO]]></category>
		<category><![CDATA[SEO: Cloaking & Doorway Pages]]></category>
		<category><![CDATA[SEO: Spamming]]></category>

		<guid isPermaLink="false">http://searchengineland.com/beta/google-defines-ip-delivery-geolocation-cloaking-14124.php</guid>
		<description><![CDATA[
]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fsearchengineland.com%2Fgoogle-defines-ip-delivery-geolocation-cloaking-14124"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fsearchengineland.com%2Fgoogle-defines-ip-delivery-geolocation-cloaking-14124" height="61" width="51" /></a></div><p>Maile Ohye at the Google Webmaster Central Blog has &#8220;<a href="http://googlewebmastercentral.blogspot.com/2008/06/how-google-defines-ip-delivery.html">defined</a>&#8221; what Google considers to be IP delivery, geolocation, and cloaking.</p>
<p>On the geolocation front, Google recommends you treat &#8220;Googlebot as you would a typical user from a similar location.&#8221;  So, if Googlebot&#8217;s IP is coming from California, then serve up the same page you would serve a web user from California.  The same rule applies to IP delivery: serve the &#8220;same content a typical user from the same IP address would see&#8221; to Googlebot.  Clearly, if you serve different content to Googlebot then you would a normal user, that would be cloaking and would go against Google&#8217;s guidelines.  Finally, Google discusses the <a href="http://googlenewsblog.blogspot.com/2007/09/first-click-free.html">first click free</a> program from Google News, where if you visit an article a second time, you would need to enter a password.  When serving this content, you need to make sure the content is the same as when a normal user goes to the page.</p>
<p><span id="more-14124"></span>
Did Google really &#8220;define&#8221; what they think cloaking or IP delivery is?  Well, if you consider a definition to remove any doubt or questions, then no.  If you look at the <a href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/dec3f461bec0f6ab#">related Google Groups</a> thread, you can clearly see there is a lot of confusion and &#8220;what-ifs&#8221; in the thread.  So although the post did clarify some issues to those who are unaware of the issues with IP delivery and its various forms, it did not necessarily cover every &#8220;what-if&#8221; scenario.  Will it ever?  I doubt it.</p>
<p>Here is an awesome video from Maile on IP delivery:</p>
<p><object width="425" height="355"><param name="movie" value="http://www.youtube.com/v/XWfqyy7J34s&#038;hl=en"></param><param name="wmode" value="transparent"></param><embed src="http://www.youtube.com/v/XWfqyy7J34s&#038;hl=en" type="application/x-shockwave-flash" wmode="transparent" width="425" height="355"></embed></object></p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/google-defines-ip-delivery-geolocation-cloaking-14124/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Microsoft Reports That Issues With Their Live Search Cloaking Detection System Have Been Fixed</title>
		<link>http://searchengineland.com/microsoft-reports-that-issues-with-their-live-search-cloaking-detection-system-have-been-fixed-12843</link>
		<comments>http://searchengineland.com/microsoft-reports-that-issues-with-their-live-search-cloaking-detection-system-have-been-fixed-12843#comments</comments>
		<pubDate>Tue, 04 Dec 2007 19:02:33 +0000</pubDate>
		<dc:creator>Vanessa Fox</dc:creator>
				<category><![CDATA[Microsoft: Bing SEO]]></category>
		<category><![CDATA[SEO: Cloaking & Doorway Pages]]></category>

		<guid isPermaLink="false">http://searchengineland.com/beta/microsoft-reports-that-issues-with-their-live-search-cloaking-detection-system-have-been-fixed-12843.php</guid>
		<description><![CDATA[
]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fsearchengineland.com%2Fmicrosoft-reports-that-issues-with-their-live-search-cloaking-detection-system-have-been-fixed-12843"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fsearchengineland.com%2Fmicrosoft-reports-that-issues-with-their-live-search-cloaking-detection-system-have-been-fixed-12843" height="61" width="51" /></a></div><p>Today, Microsoft has <a href="http://blogs.msdn.com/webmaster/archive/2007/12/04/live-search-and-cloaking-detection.aspx">posted to the Live Search Webmaster Center blog</a> that the cloaking detection system they have been running for the past few months has, for some sites, skewed site statistics and ads reporting, as well as caused a high traffic load, and they have made some adjustments to the process to correct these errors. Below, details on the problems this process caused, what Live Search is doing about them, and a brief look at how the engines have historically dealt with cloaking.</p>
<p><span id="more-12843"></span>
For the last few months, there have been slight rumblings in the places webmasters frequent about <a href="http://www.seroundtable.com/archives/014509.html">odd referrers and traffic from MSNbot</a>, as well as from a bot not identified as such, but coming from an IP range identified as belonging to Microsoft.</p>
<p>In September, <a href="http://www.seroundtable.com/archives/014720.html">MSNDude posted on Webmaster World</a>, confirming that the traffic was coming from Microsoft as a result of &#8220;quality check&#8221; and that webmasters who blocked this bot faced being removed from the Live Search index.</p>
<p>Now Microsoft is further clarifying that the &#8220;quality check&#8221; referred to cloaking detection, although they allow that some cloaking may be valid.</p>
<p>In the blog post, Nathan Buggia of the Live Search Webmaster Team notes that over the last eight months, the team has been making good progress on spam fighting tools. One aspect of spam that they have been working on is cloaking. <a href="http://searchengineland.com/070814-093136.php">Cloaking is showing a different page to search engines than to users</a> and the use of it has been the subject of <a href="http://searchengineland.com/070301-065358.php">many a debate</a> over the years. Google has generally sent the message that all cloaking violates its guidelines (although in practice, some instances seem to be accepted). Microsoft&#8217;s post notes that &#8220;not all cloaking is spam related and we do our best to take this into account.&#8221; When I talked to Nathan about this, he said that they consider cloaking to be situations when site owners are trying to deceive search engines. However, they don&#8217;t recommend cloaking in any situation. In part, this recommendation stems from the fact that it&#8217;s difficult to gauge intent with automated cloaking detection tools and some innocent sites could suffer collateral damage.</p>
<p>If you show the same content to users and to search engines, but use a method that automated checks may flag as cloaking, or if you have cloaked your site in the past and you feel your site is being penalized in the Live Search index, you can request manual review of your site using the feedback at the Live Search Webmaster Center.</p>
<p>Live Search has been using a cloaking detection process that has used a bot not identified as MSNbot to crawl pages and compare them to the versions served to MSNbot. Microsoft says that this non-identified bot has adhered to robots.txt rules (this bot hasn&#8217;t retrieved the robots.txt file separately, but has followed the one retrieved by MSNbot), but has caused a few other problems for some site owners.</p>
<ul>
<li><strong>Skewed ads reporting:</strong> The bot initially had a bug that caused it to download ad blocks, which made it appear as a user for ads reporting purposes. This may have overinflated impression counts and lowered click-through rate reporting.</li>
<li><strong>Skewed site statistics:</strong> In some cases, the bot caused high traffic loads on sites, which not only consumed extra bandwidth, but also distorted visitor counts in analytics reports. Because this bot didn&#8217;t identify itself as a bot, some traffic reporting software reported the visits as coming from users.</li>
<li><strong>Added unrelated search terms to HTTP logs:</strong> The bot often crawled a page as a user clicking through a search result. The search queries the bot used were often not related to the site, which caused incorrect search query reports and perplexed many a webmaster.</li>
</ul>
<p>In addition, when webmasters began reporting these issues to Microsoft, they were unresponsive and, in fact, MSNDude posted on the Webmaster World forums that site owners who blocked this bot risked being removed from the Live Search index:</p>
<blockquote><p>&#8220;We would request that you do not actively block the IP addresses used by this quality check; blocking these IP addresses could prevent your site from being included in the Live Search index.&#8221;</p></blockquote>
<p>The blog post today says that these issues have been resolved. The bot will no longer download ads blocks, will no longer crawl sites enough to place noticeable traffic loads, and won&#8217;t use non-related search strings. Also, the Live Search team says it has created a <a href="http://forums.microsoft.com/webmaster/ShowForum.aspx?ForumID=1984&#038;SiteID=79">forum specifically to discuss crawler issues</a> and that webmasters can also use the <a href="https://feedback.live.com/default.aspx?productkey=livesearchwebmastercenter&#038;mkt=en-us">feedback form</a> and should receive a timely response.</p>
<p>I asked if sites that blocked this bot would indeed be removed from the Live Search index and was told that:</p>
<blockquote><p>&#8220;The tool we’re using to detect cloaking is just one of many inputs we use to determine if a site is spam. Blocking this bot does not necessarily mean that your site will be considered spam, but it is a possibility. We recommend any webmasters concerned should log into our Webmaster Tools (http://webmaster.live.com) and check to see if their site has been blocked. If so, they should submit a reinclusion request using the form.&#8221;</p></blockquote>
<p>In a separate release, Microsoft notes that their <a href="http://webmaster.live.com/">Webmaster Center</a> is now live and &#8220;provides all the necessary resources to optimize a Web site for achieving the highest possible algorithmic or “organic” listing on Live Search.&#8221;</p>
<p>When I talked to Nathan, he said that sites that are cloaking may continue to see some amount of traffic from this bot. This tool crawls sites throughout the web &#8212; both those that cloak and those that don&#8217;t &#8212; but those not found to be cloaking won&#8217;t continue to see traffic.</p>
<p>Live Search isn&#8217;t the only search engine to have processes that check for cloaking. Matt Cutts has said that Google checks <a href="http://www.mattcutts.com/blog/detecting-more-undetectable-webspam/">both algorithmically and manually</a>. (Although they have implemented a <a href="http://searchengineland.com/070921-082823.php">&#8220;first-click free&#8221; program</a>, which is a type of approved cloaking.)</p>
<p><a href="http://help.yahoo.com/l/us/yahoo/search/basics/basics-18.html">Yahoo&#8217;s guidelines</a> warn against &#8220;pages that give the search engine different content than what the end user sees (cloaking).&#8221; Search engines want the search results to accurately represent what users will see when they click through to the linked pages, and deceptive cloaking interferes with this goal.</p>
<p>More signals from a major search engine that they are both actively seeking out cloaking and acknowledging that it&#8217;s not always spam aren&#8217;t likely to <a href="http://searchengineland.com/070304-231603.php">settle the cloaking debate</a>. And one has to wonder how effective methods like this really are. Those savvy enough to cloak may be able to cloak for this new cloaker detection bot as well. I asked Microsoft if this tool has helped them achieve their spam reduction goals. They said that:</p>
<blockquote><p>&#8220;Like all search engines, Live Search has been vigilantly identifying and removing spam content from our index since the inception of our search engine. With the launch of the update this Fall, we have measured a significant decrease in the amount of spam.&#8221;</p></blockquote>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/microsoft-reports-that-issues-with-their-live-search-cloaking-detection-system-have-been-fixed-12843/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Google&#8217;s Matt Cutts On Cloaking &amp; Search Snippets</title>
		<link>http://searchengineland.com/googles-matt-cutts-on-cloaking-search-snippets-12784</link>
		<comments>http://searchengineland.com/googles-matt-cutts-on-cloaking-search-snippets-12784#comments</comments>
		<pubDate>Wed, 28 Nov 2007 12:35:49 +0000</pubDate>
		<dc:creator>Barry Schwartz</dc:creator>
				<category><![CDATA[Google: SEO]]></category>
		<category><![CDATA[SEO: Cloaking & Doorway Pages]]></category>
		<category><![CDATA[SEO: Spamming]]></category>
		<category><![CDATA[SEO: Titles & Descriptions]]></category>

		<guid isPermaLink="false">http://searchengineland.com/beta/googles-matt-cutts-on-cloaking-search-snippets-12784.php</guid>
		<description><![CDATA[
]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fsearchengineland.com%2Fgoogles-matt-cutts-on-cloaking-search-snippets-12784"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fsearchengineland.com%2Fgoogles-matt-cutts-on-cloaking-search-snippets-12784" height="61" width="51" /></a></div><p>Matt Cutts of Google has written two posts on the topic of SEO:</p>
<ul>
<li><a href="http://www.mattcutts.com/blog/detecting-more-undetectable-webspam/">Detecting more “undetectable” webspam</a></li>
<li><a href="http://googlewebmastercentral.blogspot.com/2007/11/anatomy-of-search-result.html">The anatomy of a search result</a></li>
</ul>
<p><span id="more-12784"></span>
The first, <a href="http://www.mattcutts.com/blog/detecting-more-undetectable-webspam/">Detecting more “undetectable” webspam</a>, should trigger some heated comments at his blog.  It shows how he was able to find what was claimed to be &#8220;undetectable&#8221; cloaking and web spam.  Matt then issues his warning to webmasters:</p>
<blockquote><p>More generally, if someone is trying to manipulate Google by deceptive cloaking, it means that a webserver is returning different content to Googlebot than to users. That’s a condition that can be checked for by algorithms or manually, and such cloaking is certainly not “undetectable.” For cloaking to be completely “undetectable,” it would have to be like that Steven Wright joke: “Last night somebody broke into my apartment and replaced everything with exact duplicates.” And a cloaking script that gave users and Googlebot exactly duplicate pages would be a bit pointless. </p></blockquote>
<p>There are currently over thirty comments at Matt&#8217;s blog, I assume you will be able to find over a hundred by the end of today.</p>
<p>In Matt&#8217;s other post, <a href="http://googlewebmastercentral.blogspot.com/2007/11/anatomy-of-search-result.html">The anatomy of a search result</a>, he breaks down how the search result snippet works.  He describes in the video below each of the following components: the title, the description, the plus expansion box, the URL, how Google bolds keywords, the cache link, similar pages link, and, Google notes, Site Links and the more results link.  Here is the video:</p>
<p><object width="425" height="355"><param name="movie" value="http://www.youtube.com/v/vS1Mw1Adrk0&#038;rel=1&#038;border=0"></param><param name="wmode" value="transparent"></param><embed src="http://www.youtube.com/v/vS1Mw1Adrk0&#038;rel=1&#038;border=0" type="application/x-shockwave-flash" wmode="transparent" width="425" height="355"></embed></object></p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/googles-matt-cutts-on-cloaking-search-snippets-12784/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>First Click Free: Accessing Subscription-Based Articles For Free Via Google News</title>
		<link>http://searchengineland.com/first-click-free-accessing-subscription-based-articles-for-free-via-google-news-12236</link>
		<comments>http://searchengineland.com/first-click-free-accessing-subscription-based-articles-for-free-via-google-news-12236#comments</comments>
		<pubDate>Fri, 21 Sep 2007 12:28:23 +0000</pubDate>
		<dc:creator>Barry Schwartz</dc:creator>
				<category><![CDATA[Google: News]]></category>
		<category><![CDATA[SEO: Cloaking & Doorway Pages]]></category>

		<guid isPermaLink="false">http://searchengineland.com/beta/first-click-free-accessing-subscription-based-articles-for-free-via-google-news-12236.php</guid>
		<description><![CDATA[
]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fsearchengineland.com%2Ffirst-click-free-accessing-subscription-based-articles-for-free-via-google-news-12236"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fsearchengineland.com%2Ffirst-click-free-accessing-subscription-based-articles-for-free-via-google-news-12236" height="61" width="51" /></a></div><p>The Google News Blog <a href="http://googlenewsblog.blogspot.com/2007/09/first-click-free.html">wrote</a> about the <a href="http://www.google.com/support/news_pub/bin/answer.py?answer=40543&#038;topic=11707">First click free</a> program that allows publishers to let readers pass registration and subscription barriers, if they come from Google News.</p>
<p>I am sure you have seen this in action.  You see an article, you click on it from Google News and you are able to read the full article.  Then you want to see the article again an hour or so later, but the publisher asks you to login to see the full article.  That is the &#8220;first click free&#8221; program in action.</p>
<p><span id="more-12236"></span>
Sometimes publishers do not want even the first click to be free, in those cases, Google will label that news result with a &#8220;subscription&#8221; tag, so the user knows you need to be a subscriber to read the content.  Here is an example of what that may look like in Google news:</p>
<p><a href="http://www.flickr.com/photos/rustybrick/1417771430/" title="Photo Sharing"><img src="http://farm2.static.flickr.com/1165/1417771430_ad7f18f827.jpg" width="485" height="64" alt="Google News Subscription Tag" /></a></p>
<p>The &#8220;First click free&#8221; program isn&#8217;t new, just not that well known. Danny covered it in his <a href="http://searchengineland.com/070304-231603.php">YADAC: Yet Another Debate About Cloaking Happens Again</a> article.  I also wrote about it at the Search Engine Roundtable in <a href="http://www.seroundtable.com/archives/013561.html">How To Allow Google News To Index Your Subscription Only Source?</a></p>
<p>In addition, we had two Sphinn discussions on this topic.  The first was started by Danny and named <a href="http://sphinn.com/story/913">Got Registration? Cloaking Google News Is Now Officially OK</a> and the second was started by Sebastian and named <a href="http://sphinn.com/story/2044">Cloaking is alive and doing well in Google</a>.</p>
<p>Of course, if Google News allows this form of cloaking, why can&#8217;t others use it in Google web search?  Heck, we have <a href="http://searchengineland.com/070516-143312.php">Google Universal Search</a> now, so Google News can be directly embedded in the Web search results and those results can participate in the First click free program.</p>
<p>Somewhat related, <a href="http://blogsci.com/randoms/academic-publishers-as-spammers">Academic Publishers as Spammers</a> from BlogSci.com covers how through other program such as Google Scholar, cloaked pages are allowed into Google.</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/first-click-free-accessing-subscription-based-articles-for-free-via-google-news-12236/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Search Illustrated: Black Hat Cloaking Explained</title>
		<link>http://searchengineland.com/search-illustrated-black-hat-cloaking-explained-11939</link>
		<comments>http://searchengineland.com/search-illustrated-black-hat-cloaking-explained-11939#comments</comments>
		<pubDate>Tue, 14 Aug 2007 13:31:36 +0000</pubDate>
		<dc:creator>Elliance</dc:creator>
				<category><![CDATA[SEO: Cloaking & Doorway Pages]]></category>
		<category><![CDATA[Search Illustrated]]></category>

		<guid isPermaLink="false">http://searchengineland.com/beta/search-illustrated-black-hat-cloaking-explained-11939.php</guid>
		<description><![CDATA[
]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fsearchengineland.com%2Fsearch-illustrated-black-hat-cloaking-explained-11939"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fsearchengineland.com%2Fsearch-illustrated-black-hat-cloaking-explained-11939" height="61" width="51" /></a></div><p><a href="http://searchengineland.com/guides/search_illustrated.php">
</a> Have you ever clicked on a promising search result, only to find that the web page you land on has nothing to do with what you expected, or even worse, is a page of porn or other spam?</p>
<p>If so, you may have been the unwitting victim of black-hat cloaking. Today&#8217;s Search Illustrated shows how this bait-and-switch tactic is accomplished:</p>
<p><span id="more-11939"></span>
<img alt="black-hat-seo.jpg" src="http://searchengineland.com/images/black-hat-seo.jpg" width="500" height="544" /></p>
<p><i>Graphic by <a href="http://seo.elliance.com/">Elliance</a>, an eMarketing firm specializing in results-driven search engine marketing, web site design, and outbound eMarketing campaigns. The firm is the creator of the <a href="http://ennect.com">ennect</a> online marketing toolkit. The <a href="http://searchengineland.com/lands/search-illustrated.php">Search Illustrated</a> column appears Tuesdays at <a href="http://searchengineland.com">Search Engine Land</a>.</i></p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/search-illustrated-black-hat-cloaking-explained-11939/feed</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Google Website Optimizer Now Available, But Is It Cloaking?</title>
		<link>http://searchengineland.com/google-website-optimizer-now-available-but-is-it-cloaking-10898</link>
		<comments>http://searchengineland.com/google-website-optimizer-now-available-but-is-it-cloaking-10898#comments</comments>
		<pubDate>Wed, 04 Apr 2007 12:49:38 +0000</pubDate>
		<dc:creator>Danny Sullivan</dc:creator>
				<category><![CDATA[Google: SEO]]></category>
		<category><![CDATA[Google: Website Optimizer]]></category>
		<category><![CDATA[SEM Tools]]></category>
		<category><![CDATA[SEO: Cloaking & Doorway Pages]]></category>
		<category><![CDATA[SEO: Spamming]]></category>

		<guid isPermaLink="false">http://searchengineland.com/beta/google-website-optimizer-now-available-but-is-it-cloaking-10898.php</guid>
		<description><![CDATA[
]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fsearchengineland.com%2Fgoogle-website-optimizer-now-available-but-is-it-cloaking-10898"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fsearchengineland.com%2Fgoogle-website-optimizer-now-available-but-is-it-cloaking-10898" height="61" width="51" /></a></div><p><a href="http://services.google.com/websiteoptimizer/">Google Website
Optimizer</a>, a tool that allows you to easily test different page layouts, is
<a href="http://adwords.blogspot.com/2007/04/website-optimizer-now-available-to-all.html">now available</a> to anyone with an AdWords account. Previously, it had been out in
limited beta since
<a href="http://adwords.blogspot.com/2006/10/beta-testers-needed-for-new-website.html">
last October</a>. It remains unclear whether using the tool would be considered
<a href="http://searchengineland.com/guides/seo_cloaking_doorway_pages.php">
cloaking</a>, which is against Google&#8217;s webmaster guidelines. The
<a href="http://searchengineland.com/070301-065358.php">Good Cloaking, Evil
Cloaking &amp; Detection</a> article from Stephan Spencer here on Search Engine Land
looked at this issue last month. Threadwatch has a
<a href="http://www.threadwatch.org/node/13640">recent discussion</a> also
going. I&#8217;ll explain more about the situation below, and I&#8217;m checking with Google
for the official word.</p>
<p>Fair to say, Google&#8217;s probably going to say that using the tool is NOT
cloaking. Otherwise, you&#8217;ve got one part of Google putting out a tool that will
cause anyone using it to potentially be banned by another part of Google. This
also means questions that have hung over other page testing tools such as
<a href="http://www.offermatica.com/">Offermatica</a> should go away.</p>
<p>It won&#8217;t make the <a href="http://searchengineland.com/070304-231603.php">
cloaking debate</a> necessarily easier, however. Consider the
<a href="http://www.seomoz.org/blog/if-your-page-ranks-well-youd-better-be-messing-with-it">
If Your Page Ranks Well, You&#8217;d Better Be Messing With It</a> article from Rand
Fishkin at SEOmoz last month. In it, Rand picks up on a
<a href="http://www.imediaconnection.com/content/13986.asp">challenge</a> that
Jamie Roche (CEO of Offermatica) makes to the traditional wisdom that if a page
is ranking well, you don&#8217;t want to alter it much.</p>
<p><span id="more-10898"></span></p>
<p>Rand argues the major factors controlling rank are mostly off-the-page
(importance of your domain, links to your pages) and thus not impacted by change
to the page itself. I&#8217;m largely with him. If a page is doing well, altering it
really shouldn&#8217;t have that much of an impact, unless you&#8217;re doing something
really drastic such as replacing all the text for images. And yet &#8212; I&#8217;d still
be cautious, despite knowing that&#8217;s not likely something to fear.</p>
<p>This leads back to Roche&#8217;s
<a href="http://www.imediaconnection.com/content/13986.asp">article</a>. Roche
is clearly reacting to clients that are concerned that his product, Offermatica,
might suggest changes that could cause them to lose rankings. So, there&#8217;s an
element of self-interest in his writing. But I still find many of his points
valid:</p>
<blockquote>
<p>Unfortunately, even if you don&#8217;t mess with your page, your rank could
change. That&#8217;s because the rules that the search engines use to rank sites
change as they discover newer, and presumably better, ways to rank results. A
page that ranked well one day might drop to the third or fourth page the next,
or get removed altogether. </p>
<p>The result is that we live in a state of fear about changing well-ranked
pages, while knowing that even if we don&#8217;t change them we could lose rank
anyway. </p>
<p>So when we realize that we want to change our natural search landing pages
because they don&#8217;t provide the best user experience, we wonder whether it
might make sense to experiment with changes despite our fears. </p>
</blockquote>
<p>He then also gets into three tactics, warning that potentially, they could
get you delisted or cause a rank drop:</p>
<ul>
<li><b>Test various pages, and keep showing spiders the original one during
testing.</b> Anyone not seeing what the spider sees is technically getting a
cloaked page. Yes, as he notes, that&#8217;s what Google Website Optimizer does &#8211;
so Google itself seems to be giving tacit approval to cloaking.<br />
&nbsp;</li>
<li><b>Change just some of the page.</b> He seems to think this is a bigger
risk. I think it&#8217;s minor. The main risk is that if you change some of the page
using JavaScript, potentially that could be seen as cloaking. But that&#8217;s part
of the page testing opt-out already covered above.<br />
&nbsp;</li>
<li><b>Target pages to second-time visitors.</b> Search engines don&#8217;t read
cookies. So if you cookie a first time visitor, you can then show them a
different page when they return, even if they go to the same original URL.
Search engines will continue to see the same page every time, since they won&#8217;t
have a cookie.</li>
</ul>
<p>Now here&#8217;s an even more complicated issue. Many who do good old-fashioned
cloaking have long argued that it doesn&#8217;t matter how they get ranking, as long
as they are relevant. So what if they use gibberish text or put out some
highly-optimized textual content? As long as they show the visitor a relevant
page, what&#8217;s the harm? Indeed, if you buy into the argument that what you do on
the page largely doesn&#8217;t matter &#8212; that it&#8217;s really down to domain authority and
links &#8212; then it becomes even harder to understand why cloaking is such an
issue.</p>
<p>Moreover, what prevents someone from eternally testing a page? If you gain a
ranking with some butt-ugly page, you might then keep feeding that to spiders
but constantly test it with a page testing tool. Potentially, that keeps you out
of the cloaking hot seat yet it does the same exact thing that cloaking does.</p>
<p>Overall, Google in particular has dodged its outdated guidelines on cloaking
time-and-time again over the past few years. My
<a href="http://searchengineland.com/070304-231603.php">YADAC: Yet Another
Debate About Cloaking Happens Again</a> article from last month provides the
background here. They can&#8217;t keep dodging it in the case of page testing tools
now, not when they themselves are offering one. So I&#8217;m pinging them for the
official word.</p>
<p>As for the tool, here&#8217;s more from the formal announcement:</p>
<blockquote>
<p>As part of our continued commitment to help advertisers make smart business
decisions, we are happy to announce that the Google Website Optimizer™
application is now available to AdWords advertisers worldwide. Google Website
Optimizer is designed to help website owners increase conversions such as
sales, sign-ups or downloads. This multivariate landing page optimization tool
enables marketers to test different ideas for web page content such as
different headlines, promotional copy, or images. The application provides
easy-to-read reports that enable advertisers to see which variation resonated
best with their site visitor. It is a self-service application that enables
website owners to set-up and run multivariate landing page experiments. </p>
<p>Google Website Optimizer is a beta application that is integrated with the
Google AdWords™ program and free to AdWords advertisers. Advertisers can sign
up immediately at www.google.com/websiteoptimizer. Over the coming weeks, the
Google Website Optimizer application will become available automatically in
all advertisers’ accounts. Website owners can now determine what content was
most effective as indicated by the highest conversion rate. </p>
<p>By giving website owners the tools they need to improve their website
content, Google is helping improve the user experience on the internet as a
whole. Since the beta launch of the application in October 2006, many
advertisers who used Google Website Optimizer have achieved major results. </p>
<p>&quot;Using Website Optimizer enables us to approach our website like a living
lab, where we can test and play and constantly figure out how to improve the
site,&quot; says Deborah Krier, Marketing Manager, Dale and Thomas Popcorn,
&quot;Website Optimizer is a powerful tool that allows us to understand our users
better, leading to increased conversions and increased business success.&quot; </p>
<p>In addition, we are announcing the formation of a new partner program,
Google Website Optimizer Authorized Consultants. As of today, Optimost,
EpikOne, Future Now, ROI Revolution, and SiteTuners.com have signed on as
charter members. </p>
<p>“We&#8217;re delighted that Google is now offering Website Optimizer. In the
past, not everyone had the tools to test regularly,” said Bryan Eisenberg,
co-founder, Future Now, Inc. and author of the New York Times bestseller Call
to Action, “Google Analytics had a major effect on the accessibility of data
and on how website owners valued analytics; Website Optimizer will take the
benefits of testing to a much broader audience and help them increase online
conversion rates.” </p>
<p>For more information or to sign up to use Website Optimizer, please visit:
www.google.com/websiteoptimizer</p>
</blockquote>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/google-website-optimizer-now-available-but-is-it-cloaking-10898/feed</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>YADAC: Yet Another Debate About Cloaking Happens Again</title>
		<link>http://searchengineland.com/yadac-yet-another-debate-about-cloaking-happens-again-10654</link>
		<comments>http://searchengineland.com/yadac-yet-another-debate-about-cloaking-happens-again-10654#comments</comments>
		<pubDate>Mon, 05 Mar 2007 03:16:03 +0000</pubDate>
		<dc:creator>Danny Sullivan</dc:creator>
				<category><![CDATA[SEO: Cloaking & Doorway Pages]]></category>

		<guid isPermaLink="false">http://searchengineland.com/beta/yadac-yet-another-debate-about-cloaking-happens-again-10654.php</guid>
		<description><![CDATA[
]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fsearchengineland.com%2Fyadac-yet-another-debate-about-cloaking-happens-again-10654"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fsearchengineland.com%2Fyadac-yet-another-debate-about-cloaking-happens-again-10654" height="61" width="51" /></a></div><p>Sigh. Double sigh. Triple sigh. I guess now that the SEO industry has had the
required twice-yearly <a href="http://searchengineland.com/070208-110711.php">
debate about the reputation of SEO</a>, it&#8217;s time to do the go round about
cloaking once again.
<a href="http://www.mattcutts.com/blog/a-quick-word-about-cloaking/">A quick
word about cloaking</a> has Google&#8217;s Matt Cutts trying to clarify concerns that
Philipp Lenssen of <a href="http://blog.outer-court.com/">Google Blogoscoped</a>
<a href="http://blog.outer-court.com/archive/2007-03-05-n40.html">has been raising</a> about <a href="http://www.webmasterworld.com/">WebmasterWorld</a>.
The comments are now up over 100, as people rehash things that have been hashed,
mashed, rebaked so many times before. Below, some cloaking history plus an
honest plea about trying to get past this stupid, stupid issue.</p>
<p><span id="more-10654"></span></p>
<p><b>Definition Time</b></p>
<p>Let&#8217;s do the definition, first:</p>
<blockquote>
<p>Cloaking is when you show a search engine content that is different than what
a human being sees.</p>
</blockquote>
<p>Got it? That&#8217;s my definition, and Matt says virtually the same thing in his
post today:</p>
<blockquote>
<p>Cloaking is serving different content to users than to search engines.</p>
</blockquote>
<p>So simple. What&#8217;s to debate? Well, is it cloaking if&#8230;</p>
<ul>
<li>A spider coming from a US IP address sees a different page than a user
from a UK IP address?</li>
<li>A spider sees content that a user sees, but only if they do free
registration</li>
<li>A spider sees content that a user sees, but only if they do paid
registration</li>
<li>A spider sees content in text that represents what a users sees in Flash</li>
<li>A spider sees content that&#8217;s slightly different than what a user sees when
their browser renders Javascript</li>
</ul>
<p>Pick one of those above &#8212; pick something else (see our
<a href="http://searchengineland.com/070301-065358.php">Good Cloaking, Evil
Cloaking &amp; Detection</a> column from last week) &#8212; and people can, will and have
pointed at something someone is doing, then yelled &quot;cloaking&quot; and screamed for a ban
to happen. A ban? Well, as you know, all search engines hate cloaking. Actually,
that&#8217;s always been a confused point. Here starts the lesson.</p>
<p><b>History Time: Tactics Versus Intent</b></p>
<p>Back in January 2003, Alan Perkins wrote this big
<a onclick="s_objectID=&quot;http://www.highrankings.com/issue041.htm_1&quot;;return this.s_oc?this.s_oc(e):true" href="http://www.highrankings.com/issue041.htm">
Cloaking Is Always A Bad Idea</a> article, telling us that search engines always
said cloaking was bad. I was never a proponent of cloaking. I was, however, well
aware that NOT all the guidelines were against cloaking. In addition, with paid
inclusion, I argued some cloaking was actually allowed. All this went into my
<a href="http://searchenginewatch.com/showPage.html?page=2165231">Ending The
Debate Over Cloaking</a> that came out in reaction to Alan&#8217;s article, in
February 2003.</p>
<p>Since I knew that all the search engines had allowed some types of cloaking,
my advice to marketers was this, with the stress on avoiding &quot;unapproved
cloaking:&quot;</p>
<blockquote>
<p>Cloaking is getting a search engine to record content for a URL that is
different than what a searcher will ultimately see, often intentionally. It can
be done in many technical ways. Several search engines have explicit bans
against unapproved cloaking, of which Google is the most notable one. Some
people cloak without approval and never have problems. Some even may cloak
accidentally. However, if you cloak intentionally without approval &#8212; and if you
deliver content to a search engine that is substantially different from what a
search engine records &#8212; then you stand a much larger chance of being penalized
by search engines with penalties against unapproved cloaking. If in doubt, ask
the search engine if it has a problem with what you intend to do, assuming you
can&#8217;t get a clear answer from written guidelines that are provided. If you are
working with a third-party search engine marketer, ask them for proof that what
they intend to do is approved. Otherwise, be prepared for any adverse
consequences.</p>
</blockquote>
<p>The suggestion to avoid &quot;unapproved cloaking&quot;
<a href="http://www.ihelpyou.com/forums/showthread.php?s=&#038;threadid=6798">
infuriated</a> Doug Heil over at the iHelpYou forums, who could not (and to this
day still cannot) get over the idea that cloaking MUST equal spamming.</p>
<p>My response back then remains the same today. <b>There&#8217;s a difference between
tactics and intent.</b> Many of the things that might cause penalties with
search engines are tactics (hidden text, gibberish pages, cloaking) that are
closely aligned with the intent of trying to mislead or game the search
algorithms. But in some cases, what&#8217;s a bad tactic (or technical implication)
might have a good intent as agreed by the search engines. So they&#8217;ll allow it,
either turning a blind eye to it or giving it some official endorsement.</p>
<p>That difference is important because back then, if the search engines got
behind the &quot;unauthorized&quot; versus &quot;authorized&quot; suggestion, we wouldn&#8217;t be having
today&#8217;s wasteful argument. But let&#8217;s carry on.</p>
<p><b>NPR, Google Scholar &amp; Approved Cloaking</b></p>
<p>In May 2004, I
<a href="http://searchenginewatch.com/showPage.html?page=3360681">looked at</a>
how National Public Radio was, in my view, cloaking text transcripts of audio to
search engines but only letting human visitors by those. At the time, Google had
a guideline against cloaking that read:</p>
<blockquote>
<p>The term &quot;cloaking&quot; is used to describe a website that returns altered
webpages to search engines crawling the site. In other words, the webserver is
programmed to return different content to Google than it returns to regular
users, usually in an attempt to distort search engine rankings. This can mislead
users about what they&#8217;ll find when they click on a search result. To preserve
the accuracy and quality of our search results, Google may permanently ban from
our index any sites or site authors that engage in cloaking to distort their
search rankings.</p>
</blockquote>
<p>I argued that this was an example of &quot;good cloaking&quot; and that the real issue
I had with it was that other marketers were supposedly banned from doing it:</p>
<blockquote>
<p>As a searcher, I&#8217;m actually glad the method is being used. It does mean I&#8217;m
more likely to find audio content of interest. Moreover, I can listen to that
for free via the NPR site.</p>
<p>As a search engine marketer, I&#8217;m not so thrilled. I&#8217;m well aware that many
other companies would like the ability to feed Google content in this manner.
In addition, they have just as compelling arguments as NPR about having good
content that isn&#8217;t adequately indexed by the Google crawler. Unfortunately,
they&#8217;re denied the privilege of feeding relevant material just to Google&#8217;s
crawler.</p>
<p>What about Yahoo? Anyone can enjoy the same benefits that NPR has, the
ability to cloak content when relevant, through Yahoo&#8217;s content acquisition
program. Non-profit organizations are offered this for free. Commercial
organizations have to pay, making use of Yahoo&#8217;s trusted feed program.</p>
</blockquote>
<p>By November 2004, I was writing about cloaking again. Now Google had an
officially approved program that, in my view, allowed cloaking. This was
<a href="http://scholar.google.com/">Google Scholar</a>. As I
<a href="http://searchenginewatch.com/showPage.html?page=3437471">wrote</a>:</p>
<blockquote>
<p>This system may lead to problems for some searchers. In the example above,
not only could I NOT read the paper, as I didn&#8217;t have a subscription, but I
also could not read even an abstract. Instead, a password-prompt continued to
appear, even when I cancelled it, making it extremely difficult to finally
close the window (and that&#8217;s why I haven&#8217;t linked to the actual paper, to save
other people the problem).</p>
<p>This situation is probably unusual, however. One of Google&#8217;s requirements
for inclusion in Google Scholar is that publishers at least show abstracts to
searchers.</p>
<p>The special access for publishers flies in the face of Google&#8217;s
anti-cloaking policy. Google is being shown material that regular users
wouldn&#8217;t normally see, its own definition of cloaking. This is a GOOD thing
for searchers, but the company needs to amend its cloaking policy so as not to
be hypocritical.</p>
<p>Indeed, that&#8217;s long overdue. This has been a problem since I first reported
about a similar issue earlier this year. A sidebar piece &#8230; looks at the
latest case and suggests some fixes for Google, including finally moving
forward with formalizing such programs for ALL publishers.</p>
</blockquote>
<p><b>Plea For Google To Change The Cloaking Guidelines</b></p>
<p>In the sidebar to that article, and
<a href="http://blog.searchenginewatch.com/blog/041201-063855">again</a> in a
follow-up piece a few days later, I urged Google to alter its cloaking policy to
something stressing that cloaking was bad only if not approved:</p>
<blockquote>
<p>The term &quot;cloaking&quot; is used to describe a website that returns altered
webpages to search engines crawling the site <b>without permission</b>. In
other words, the webserver is programmed to return different content to Google
than it returns to regular users, usually in an attempt to distort search
engine rankings. This can mislead users about what they&#8217;ll find when they
click on a search result. To preserve the accuracy and quality of our search
results, Google may permanently ban from our index any sites or site authors
that engage in cloaking <b>without our permission, if we feel it is harmful to
our search rankings</b>. </p>
</blockquote>
<p><b>BMW, WebmasterWorld &amp; New York Times Cloaking Accusations</b></p>
<p>In March 2005, there was great amusement in some quarters when Google&#8217;s
policy against cloaking caused it to ban itself when pages apparently with text
designed to help internal Google searching made also onto external versions seen
only by Google spiders, rather than humans (see
<a href="http://blog.searchenginewatch.com/blog/050309-092708">here</a>,
<a href="http://www.threadwatch.org/node/1800#comment">here</a> and
<a href="http://www.webmasterworld.com/forum30/28470.htm">here</a> for more).</p>
<p>By the end of 2005, WebmasterWorld came under
<a href="http://blog.searchenginewatch.com/blog/051222-113443">accusations</a>
of cloaking. Since it is one of the most important forums about search engines
around &#8212; frequented by official Google reps &#8212; it sort of became a poster child
of &quot;why can they do it but others can&#8217;t&quot; for some.</p>
<p>Far bigger news came in 2006. In February, BMW got
<a href="http://www.mattcutts.com/blog/ramping-up-on-international-webspam/">
banned</a> on Google for using hidden text &#8212; in particular, a &quot;poor-man&#8217;s&quot;
version of cloaking that used JavaScript to show different content to users. It
got back in a few days later. Then in June, an
<a href="http://searchenginewatch.com/showPage.html?page=3613561">article</a>
about how the New York Times optimizes content for search engines sparked a new
cloaking debate when it seemed that the major search engines were allowing
cloaked content. In particular, the New York Times was allowing search spiders
to read content that was only accessible to humans if you registered for free
or, in some cases, paid for access.</p>
<p><a href="http://forums.searchenginewatch.com/showthread.php?t=12191">Marshall
Simmonds, the NYTimes &amp; Acceptable Cloaking</a> was a giant discussion that came
out of that article, which had me originally arguing that this was cloaking,
since search spiders were seeing something different than most humans could see.
But I was convinced to change my mind. Since the spiders were indeed shown what
anyone could ultimately see, this wasn&#8217;t cloaking. I commented:</p>
<blockquote>
<p>Until now, I would have considered feeding a search engine a page that people
couldn&#8217;t see unless they registered to have been unapproved cloaking, since most
users were seeing something different that then spider saw.</p>
<p>But sure, I&#8217;ll buy into the &quot;eventually you&#8217;ll see the same thing&quot; argument as
this not being cloaking. Why not? Google&#8217;s allowed this in approved cases for
about two years now and never wants to go on the record as this being approved
cloaking. So don&#8217;t call it cloaking and everyone&#8217;s happy. Google&#8217;s not allowing
something officially they say not to do, and content owners can do this without
fear. </p>
<p>In fact, I expect Fantomaster, Beyond Engineering and anyone with IP delivery
lists now can have some new-found respect from people who previously slammed
them as helping cloakers. Here they&#8217;ve been saying its all about content
delivery systems and now they&#8217;re right, at least in some situations. Because
after all, some people aren&#8217;t going to want to depend on user agent delivery to
feed content this way.</p>
<p>Of course, I&#8217;m still not going to do this yet with my own members-only content.
Despite agreeing with you, Phil &#8212; despite seeing others allowed to do this &#8211;
I&#8217;m still fearful Google might arbitrarily decide to call it cloaking anyway
when they choose. But maybe I&#8217;ll be braver down the line, and why not? It&#8217;s like
a whole new world.</p>
</blockquote>
<p>I did have several off-the-record conversations with Google about this. The
main thing that came out that I can report was that Google really felt most
users should see what their spiders saw WITHOUT having to register or pay for
access. That&#8217;s similar to what Matt wrote about WebmasterWorld content today,
when covering the latest criticisms that it might be cloaked:</p>
<blockquote>
<p>I consider the issue in a much better state now, in that most (all?) Google
searchers get the identical page to what Googlebot saw. But I still consider
Philipp’s February posts open for investigation, and I will get to them, in the
same way that I tackled Philipp’s first two posts about this.</p>
</blockquote>
<p>FYI, to understand a bit more about the WebmasterWorld situation, see Matt&#8217;s
post from March 2006,
<a href="http://www.mattcutts.com/blog/how-to-sign-up-for-webmasterworld/">How
to sign up for WebmasterWorld</a>. It explains how in some cases, trying to
access a thread that you might come across from a search result can trigger a
registration results. In today&#8217;s post at Matt&#8217;s, WebmasterWorld&#8217;s Brett Tabke
explains and <a href="http://www.webmasterworld.com/webmasterworld/3270248.htm">
points at</a> more information on how this has changed.</p>
<p>The comments in Matt&#8217;s post also get into an apparent return of some New York
Times content requiring registration, as well as more complaints about Google
Scholar content (added with the full cooperation of Google) also being annoying,
in that you can&#8217;t read it without paying when clicking from search results.</p>
<p><b>Solution 1: Allow Registered Content</b></p>
<p>Enough history. Time for some solutions. Both Google and Yahoo have programs
that allow people with free registration or fee-based content to show up in
search results &#8212; and I mean mixed in with regular search results, not
segregated out like the <a href="http://search.yahoo.com/subscriptions">Yahoo
Subscriptions</a> product launched in June 2005. Yahoo&#8217;s will happen mainly
through paid inclusion, and not that much. Google&#8217;s will happen primarily
through the aforementioned Google Scholar plus the
<a href="http://www.google.com/support/news_pub/bin/answer.py?answer=40543">
First Click Free</a> system used for Google News but
<a href="http://www.betanews.com/article/Google_Indexing_Subscription_Content/1120164520">
which also</a> may happen with some web search content.</p>
<p>None of this content is labeled in any way. Back to that discussion on
cloaking and the New York Times, I commented:</p>
<blockquote>
<p>It is annoying to hit one of these links and not know that payment or
registration is required, however. That problem&#8217;s going to get worse as more
and more people decide this isn&#8217;t cloaking and give it a go. Google and the
others should look to establish a way for site owners to better flag premium
or registration only content&#8230;.</p>
<p>I don&#8217;t agree having paid content in regular search results is bad. I have
a Wall Street Journal paid subscription. They have lots of great content. If
I&#8217;m doing a search, and they&#8217;ve got a good match, I want to know that. </p>
<p>And over at Yahoo, as I explained way way back above, I can do that. I can
choose specifically to have this content revealed to me. It doesn&#8217;t make my
results bad at all. </p>
<p>It is a bad user experience if you constantly get back results that you
can&#8217;t actually view, of course, without paying. We simply aren&#8217;t going to
subscribe to everything. </p>
<p>The solution is easy. Give users the option. Let me choose to see content
that requires payment or not. Or similarly, let me choose to see content that
might require free registration. We just need Google and the others to
graduate from 1999 mentality and better accommodate web sites with this type
of content. It&#8217;s easily done, if they want to do it. And getting more
formalized program for publishers, as well as options for searchers, will
help. </p>
</blockquote>
<p>So enough already. Enough with the special programs that only some publishers
get to do. I want Google &#8212; which leads the charge in scaring people about
cloaking &#8212; to fast track a system to let anyone with registration-based or
fee-based content to be in their search results.</p>
<p>As for usability, either flag the URLs so users know to expect a charge or
registration request or make it possible for users to exclude this information.
Or experiment with both. But do something so that publishers don&#8217;t take matters
into their own hands and the SEO industry has to have yet another one of these
debates over whether it&#8217;s cloaking.</p>
<p><b>Solution 2: Allow For Approved Cloaking</b></p>
<p>Remember that suggested revision for Google&#8217;s cloaking policy I gave above?
Well sometime in 2006, Google dropped its definition of cloaking entirely.
Instead, we were just left with shorter definition of cloaking
<a href="http://www.google.com/support/webmasters/bin/answer.py?answer=35769">
here</a>:</p>
<blockquote>
<p>Make pages for users, not for search engines. Don&#8217;t deceive your users or
present different content to search engines than you display to users, which is
commonly referred to as &quot;cloaking.&quot;</p>
</blockquote>
<p>And a warning against it
<a href="http://www.google.com/support/webmasters/bin/answer.py?answer=40052">
here</a>:</p>
<blockquote>
<p>However, certain actions such as cloaking, writing text in such a way that it
can be seen by search engines but not by users, or setting up pages/links with
the sole purpose of fooling search engines may result in removal from our index.</p>
</blockquote>
<p>That warning, like the old warning, uses the word &quot;may&quot; in terms of removal
for cloaking. Will cloaking get you banned? Maybe. Maybe if it is noticed. Maybe
if it is deemed harmful to the users. Maybe if after a closer look, it can&#8217;t be
pigeonholed in some other definition.</p>
<p>For me, it would be clearer to go back to the old definition and stress that
unless approved, cloaking might result in a ban. Roll that out along with a
program making it easier for people to feed in registered content. That gives
Google flexibility, helps publishers plus stops this insane focus on
technical/tactical implementations and refocuses concern where it belongs &#8212; the
user experience.</p>
<p>Did what you do help or harm the search results, in Google&#8217;s opinion? If you
were harming search results, Google&#8217;s always reserved the right to boot you out.
And if you were technically violating guidelines but not in a harmful way,
Google&#8217;s always reserved the right to turn a blind eye. Or rather, an approving
eye, an eye knowing that it&#8217;s intent that matters, not some technicality.</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/yadac-yet-another-debate-about-cloaking-happens-again-10654/feed</wfw:commentRss>
		<slash:comments>8</slash:comments>
		</item>
	</channel>
</rss>
