<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>searchengineland.com &#187; Vanessa Fox</title>
	<atom:link href="http://searchengineland.com/author/vanessa-fox/feed" rel="self" type="application/rss+xml" />
	<link>http://searchengineland.com</link>
	<description>Search Engine Land: Must Read News About Search Marketing &#38; Search Engines</description>
	<lastBuildDate>Mon, 23 Nov 2009 12:00:24 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Google&#8217;s Additional Discovery Method: RSS and Atom Feeds</title>
		<link>http://searchengineland.com/googles-additional-discovery-method-rss-and-atom-feeds-28828</link>
		<comments>http://searchengineland.com/googles-additional-discovery-method-rss-and-atom-feeds-28828#comments</comments>
		<pubDate>Fri, 30 Oct 2009 06:24:24 +0000</pubDate>
		<dc:creator>Vanessa Fox</dc:creator>
				<category><![CDATA[Google: SEO]]></category>
		<category><![CDATA[Top News]]></category>

		<guid isPermaLink="false">http://searchengineland.com/?p=28828</guid>
		<description><![CDATA[For years, Google&#8217;s discovery of web pages was solely based on links. If a page had no links to it, Googlebot had no way of knowing about it and therefore, would never index it. Along the way, Google provided an option for submitting individual pages, but that wasn&#8217;t really a viable option for site owners [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fsearchengineland.com%2Fgoogles-additional-discovery-method-rss-and-atom-feeds-28828"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fsearchengineland.com%2Fgoogles-additional-discovery-method-rss-and-atom-feeds-28828" height="61" width="51" /></a></div><p>For years, Google&#8217;s discovery of web pages was solely based on links. If a page had no links to it, Googlebot had no way of knowing about it and therefore, would never index it. Along the way, Google provided an option for <a href="http://www.google.com/addurl/">submitting individual pages</a>, but that wasn&#8217;t really a viable option for site owners with large sites. In 2005, Google launched<a href="http://googleblog.blogspot.com/2005/06/webmaster-friendly.html"> XML Sitemaps</a>, which was a much more scalable way for site owners to let Google know about pages of their site that Googlebot may not otherwise discover through links. Today, a Google Webmaster Central blog post discusses <a href="http://googlewebmastercentral.blogspot.com/2009/10/using-rssatom-feeds-to-discover-new.html">another way Googlebot may discover pages</a>: feeds. They say that using RSS and Atom feeds to discover pages helps them learn about new content quickly.</p>
<p>New content is key for Google since freshness is a vital component of relevance for some queries. Convention wisdom is that it&#8217;s not all that useful to ensure Google knows about pages of your site if they don&#8217;t have links to them, because without links, Google won&#8217;t see them as valuable. But current ranking is much more complicated than the original PageRank formula describes. And new content with no links may very well trump content with an abundance of links if it makes sense for the query.</p>
<p>Of course, site owners have always been able to to <a href="http://www.google.com/support/webmasters/bin/answer.py?hl=en&amp;answer=156184">submit RSS and Atom feeds as Sitemaps</a>, but this post describes using these feeds even if the site owner hasn&#8217;t submitted them via the Sitemap system. Instead, they are scanning other feed submission systems, such as Google Reader and ping services for the feeds.</p>
<p>It&#8217;s unclear from the post if the feeds are being used solely for discovery or if the content from the feeds are being used in place of crawling as well. The title of the post references &#8220;discovery&#8221; but the post itself notes that they are able to &#8220;get these new pages into our index more quickly than traditional crawling methods&#8221; and to directly crawl feeds. If Google is using the feeds in place of crawling, this would be another argument in favor of full rather than partial feeds &#8212; you&#8217;d get more of a page&#8217;s content indexed more quickly. Google Blogsearch initially crawled feed content rather than the actual pages, which led to partial indexing in Blogsearch, <a href="http://www.ninebyblue.com/blog/google-blog-search-changes-how-it-indexes-posts/">but this changed</a> late last year.</p>
<p>The post notes that in order for Google to use a feed as a discovery method, the feed must not be blocked by robots.txt.</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/googles-additional-discovery-method-rss-and-atom-feeds-28828/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>See What Googlebot Sees On Your Site</title>
		<link>http://searchengineland.com/see-what-googlebot-sees-on-your-site-27623</link>
		<comments>http://searchengineland.com/see-what-googlebot-sees-on-your-site-27623#comments</comments>
		<pubDate>Tue, 13 Oct 2009 20:01:18 +0000</pubDate>
		<dc:creator>Vanessa Fox</dc:creator>
				<category><![CDATA[Google: Webmaster Central]]></category>
		<category><![CDATA[SEO: Cloaking & Doorway Pages]]></category>
		<category><![CDATA[SEO: Redirects & Moving Sites]]></category>

		<guid isPermaLink="false">http://searchengineland.com/?p=27623</guid>
		<description><![CDATA[Google Webmaster Tools has just launched a &#8220;labs&#8221; section, where you&#8217;ll find new features that may be early in the development cycle and not quite as robust as the rest of the tools. The features available so far are Fetch as Googlebot, which lets you see exactly what Googlebot is served when it requests a [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fsearchengineland.com%2Fsee-what-googlebot-sees-on-your-site-27623"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fsearchengineland.com%2Fsee-what-googlebot-sees-on-your-site-27623" height="61" width="51" /></a></div><p>Google Webmaster Tools has <a href="http://googlewebmastercentral.blogspot.com/2009/10/fetch-as-googlebot-and-malware-details.html">just launched a &#8220;labs&#8221; section</a>, where you&#8217;ll find new features that may be early in the development cycle and not quite as robust as the rest of the tools. The features available so far are <em>Fetch as Googlebot</em>, which lets you see exactly what Googlebot is served when it requests a URL from your server and <em>Malware Details</em>, which shows you malicious code snippets from your site if it&#8217;s been flagged as containing malware.</p>
<p><strong>Fetch as Googlebot</strong></p>
<p>Of most interest to webmasters, SEOs, and web developers is likely the Fetch as Googlebot feature. You can specify any URL on your site and see the HTTP response (header and contents) that the server returns. Simply  indicate the URL and click the Fetch button. It may take a few moments for Googlebot to access the page and return the results, since it fetches the page in real time. (Refresh the page to see the progress.)</p>
<p><a title="Google Fetch as Googlebot by Search Engine Land, on Flickr" href="http://www.flickr.com/photos/23148333@N06/4009489298/"><img src="http://farm3.static.flickr.com/2434/4009489298_f9879b18af.jpg" alt="Google Fetch as Googlebot" width="500" height="229" /></a></p>
<p>Click the Success link once it&#8217;s been processed to see the results.</p>
<p><a title="Google Fetch As Googlebot Results by Search Engine Land, on Flickr" href="http://www.flickr.com/photos/23148333@N06/4008724331/"><img src="http://farm3.static.flickr.com/2467/4008724331_bf6ee1260c.jpg" alt="Google Fetch As Googlebot Results" width="500" height="372" /></a></p>
<p>How is this different from simply looking at the source code of the page?</p>
<ul>
<li>You see the HTTP header information at the top. This information is generally easily available through tools such as Live HTTP Headers, but isn&#8217;t contained in the source code itself (since that information is coming from the server, not the page).</li>
<li>You can see if the server is returning any of the page information differently than the page has been coded.</li>
<li>You can see if the server is returning something different to Googlebot than what other users see. This tool uses the same user-agent and IP range as Googlebot when it crawls the web, so if the server is configured conditionally for user agent or IP address (typically known as &#8220;<a href="http://www.google.com/support/webmasters/bin/answer.py?hl=en&amp;answer=66355">cloaking</a>&#8220;), you&#8217;ll see  what&#8217;s being conditionally served to Google.</li>
<li>You can use the tool to test changes (particularly things like redirects) in real-time.</li>
</ul>
<p>Note that this tool won&#8217;t necessarily show you the content that Google is able to extract from the page. If the page contains JavaScript, for instance, you&#8217;ll see the raw JavaScript code contained on the page, not the rendered view visible in the browser. Which, unfortunately means you can&#8217;t use this tool to determine if Google is able to access content contained in rich markup.</p>
<p><strong>What&#8217;s this about cloaking?</strong></p>
<p>This tool can help you determine if the pages are being cloaked to Google. This may be useful if you&#8217;re coming into a project late and aren&#8217;t sure what&#8217;s been previously done. It can also help uncover if your site has been hacked. Back in 2006, <a href="http://blog.sitepronews.com/index.php?/archives/23-Matt-Cutts-on-Good-Karma-Domain-Hijacking-as-a-Blackhat-Technique.html">Googler Matt Cutts and I did a show on Webmaster Radio</a> during which we talked about how in some cases, a hacker might add links to a site and then cloak those pages so that the site owner never sees them. Only Google does. At the time, Matt suggested <a href="http://blog.sitepronews.com/index.php?/archives/25-Matt-Cutts-Response-to-Good-Karma-Questions.html">using Google Translate</a> (and choosing English to English) to see what Googlebot was being served, but this tool can now more easily serve that purpose. Matt confirmed this to me this morning: &#8220;The biggest use case is just debugging site issues. Of those, the biggest case will be hacked sites. Some attacks will hide content until search engines fetch the page (and some attackers add a noarchive tag so that the search result doesn&#8217;t have a &#8220;Cached&#8221; link), so a site could look clean to the website owner. Using this feature will site owners verify that there are no hidden links in the page that Google actually fetches.&#8221;</p>
<p><strong>How do I test redirects?</strong></p>
<p>If you&#8217;ve implemented redirects, you can use this tool to test how Googlebot will interpret those redirects without waiting for those pages to be crawled. For instance, when I fetch www.searchengineland.com, I see that the redirect is correctly implemented as a 301 and points to searchengineland.com:</p>
<p><a title="Google Fetch as Googlebot by Search Engine Land, on Flickr" href="http://www.flickr.com/photos/23148333@N06/4009489502/"><img src="http://farm3.static.flickr.com/2555/4009489502_1ccef8d5ae_o.jpg" alt="Google Fetch as Googlebot" width="371" height="300" /></a></p>
<p>You can also use the tool to troubleshoot URLs listed in the Crawl Errors &gt; Not Followed report. You can also test these URLs using something like Live HTTP Headers or by trying to access the URLs in a browser, but if neither of those methods uncover the problem, this tool can help determine that the issue is specific to Googlebot. You can also use this tool to verify that fixes you&#8217;ve made to redirect errors uncovered by the Not Followed report have really solved the problem.</p>
<p>(Note that the tool currently has a limit of 100kb per page. However, this is for the tool only and doesn&#8217;t apply to Googlebot&#8217;s normal crawl of the site. Google is monitoring feedback to see if many site owners find this size to be limiting.)</p>
<p><strong>Malware details</strong></p>
<p>The Google Online Security Blog has more information on the <a href="http://googleonlinesecurity.blogspot.com/2009/10/show-me-malware.html">malware details tool</a>. Previously, webmaster tools reported when the site was flagged has having malware and listed sample URLs. This new tool will also show samples of the malicious content, and in some cases, the underlying cause. This should help those site owners whose sites have been hacked to include malware find the problem and fix it. If your site does contain malware and you&#8217;ve fixed it, you can<a href="http://googlewebmastercentral.blogspot.com/2008/10/malware-we-dont-need-no-stinking.html"> request a review</a> to have the malware alert removed in search results.</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/see-what-googlebot-sees-on-your-site-27623/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>All Of Your Technical SEO Questions Answered And Bonus Free Developer Summit: Technical SMX East Dream Agenda</title>
		<link>http://searchengineland.com/all-of-your-technical-seo-questions-answered-and-bonus-free-developer-summit-technical-smx-east-dream-agenda-26539</link>
		<comments>http://searchengineland.com/all-of-your-technical-seo-questions-answered-and-bonus-free-developer-summit-technical-smx-east-dream-agenda-26539#comments</comments>
		<pubDate>Fri, 25 Sep 2009 19:11:50 +0000</pubDate>
		<dc:creator>Vanessa Fox</dc:creator>
				<category><![CDATA[SEO - Search Engine Optimization]]></category>
		<category><![CDATA[SMX & SMN Alerts]]></category>

		<guid isPermaLink="false">http://searchengineland.com/?p=26539</guid>
		<description><![CDATA[Search engine optimization involves a lot of moving parts. Audience analysis, content optimization, rankings factors &#8212; it can be hard to pinpoint concrete, actionable data. But technical SEO is a little different. It&#8217;s certainly not any easier than the other components, but at least the data can be a bit more straightforward. You can check [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fsearchengineland.com%2Fall-of-your-technical-seo-questions-answered-and-bonus-free-developer-summit-technical-smx-east-dream-agenda-26539"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fsearchengineland.com%2Fall-of-your-technical-seo-questions-answered-and-bonus-free-developer-summit-technical-smx-east-dream-agenda-26539" height="61" width="51" /></a></div><p>Search engine optimization involves a lot of moving parts. Audience analysis, content optimization, rankings factors &#8212; it can be hard to pinpoint concrete, actionable data. But technical SEO is a little different. It&#8217;s certainly not any easier than the other components, but at least the data can be a bit more straightforward. You can check your server logs to see if the search engine bots haven&#8217;t been crawling of your pages (or have been crawling lots of duplicate ones with various URLs); you can generally determine if search engines are having trouble extracting the content on your pages (ever see &#8220;loading, loading, loading&#8221; as a snippet?); and while the search engines don&#8217;t come right out and tell you when they&#8217;ve tweaked a ranking factor, they do generally announce technical changes (&#8221;we can <a href="http://searchengineland.com/google-now-crawling-and-indexing-flash-content-14299">crawl Flash</a>&#8220;; &#8220;we are <a href="http://searchengineland.com/google-io-new-advances-in-the-searchability-of-javascript-and-flash-but-is-it-enough-19881">executing onClick JavaScript events</a>&#8220;).</p>
<p>And technical SEO is clearly of vital importance. If search engines can&#8217;t crawl your pages and extract your content, your site will never be well-ranked for the queries you care about, even if you have the most awesome content on the web and more backlinks than the Adobe Reader download page.</p>
<p>Historically, search engines have had a tough time with newer web technologies. They were built as text-based engines that searched HTML pages, so it&#8217;s been taking them some time to catch up with things like Flash and AJAX and video. But the times, they are a changin&#8217;. The last couple of years have seen some great improvements, particularly from Google. The new <a href="http://searchengineland.com/google-lets-you-tell-them-which-url-parameters-to-ignore-25925">parameter handling feature of Google webmaster tools</a> to help with canonicalization issues due to URL parameters is only the latest example in a long line of advances in this area.</p>
<p><strong>Get the latest solid information </strong></p>
<p>Search Marketing Expo East, coming up in just a few weeks in New York, provides (among many other topics) in-depth, actionable information about how to construct search friendly web architecture, diagnose technical issues, and be prepared for the next evolution in search engine handling of web technologies. I&#8217;ve programmed a number of sessions that I&#8217;ll be moderating and speaking at, along with some fantastic technical experts, and have put together a suggested itinerary for the whole show for those interested in the more technical side of SEO.</p>
<p><strong>Stay another day for even more technical discussions at Google NY</strong></p>
<p>In addition, I&#8217;m organizing a post-SMX technical day of follow up discussion, information, and site review time to really dig into what technical issues you are experiencing and help you formulate real solutions. This <a href="http://janeandrobot.com/developer-summit/2009-nyc-smx">developer summit</a>, as we&#8217;re calling it, is co-hosted by <a href="http://janeandrobot.com/">Jane and Robot</a> and Google and will be held at the Google New York office on Thursday, October 8th, the day after SMX East. It&#8217;s absolutely free and SMX attendees get preference on invitations.</p>
<p>See the <a href="http://searchmarketingexpo.com/east/2009/technical-developers-itinerary">SMX East developer&#8217;s itinerary</a> for complete information about what&#8217;s available for web developers and anyone else interested in the technical side of SEO and see the <a href="http://janeandrobot.com/developer-summit/2009-nyc-smx">Jane and Robot developer summit page</a> for more details on the agenda and information on how to register for your invitation. The technical highlights of SMX East are included below. I hope to see you there!</p>
<p><strong>Day 1 – October 5</strong></p>
<p><a href="http://searchmarketingexpo.com/east/2009/full_agenda#253">SMX Boot Camp</a>: For web developers who are new to search and SEO, we recommend that you go through ALL the SMX Boot Camp sessions on the first day to get a solid foundation of how search works. If you’re more experienced, then…</p>
<p><a title="Duplicate Content Issues: The Search Engine Edition" href="http://searchmarketingexpo.com/east/2009/full_agenda#252"> Duplicate Content Issues: The Search Engine Edition</a> – What’s the best way to deal with duplicate content issues? In this session, search engine representatives offer direct advice including tips on the new canonical tag.</p>
<p><a title="Ranking Tactics For Local Search" href="http://searchmarketingexpo.com/east/2009/full_agenda#255"> Ranking Tactics For Local Search</a> – This session looks at cutting edge tactics that work to drive traffic from local listings, plus highlights research to predict where things may be going.</p>
<p><a title="Maps, Maps, Maps!" href="http://searchmarketingexpo.com/east/2009/full_agenda#259"> Maps, Maps, Maps!</a> – This session looks at how the mapping &amp; local seach services work from the three major search engines. Search representatives will share insights and tips for performing better.</p>
<p><strong>Day 2 – October 6</strong></p>
<p><a title="Ecommerce Search Marketing Tactics" href="http://searchmarketingexpo.com/east/2009/full_agenda2#263"> Ecommerce Search Marketing Tactics</a> – This session looks at a variety of tips and tactics to help retailers and ecommerce merchants do well in search results.</p>
<p><a title="Ask The Search Engines: Best Practices Edition" href="http://searchmarketingexpo.com/east/2009/full_agenda2#265"> Ask The Search Engines: Best Practices Edition</a> – In this session, search engine engineers and representatives share their most important takeaways and tips for those hoping to do better in traffic, plus there’s lots of time devoted to open Q&amp;A.</p>
<p><a title="Ask The SEOs" href="http://searchmarketingexpo.com/east/2009/full_agenda2#269"> Ask The SEOs</a> – PowerPoint Free! Put your questions about SEO to our panel of experts and get answers about ranking and crawling issues.</p>
<p><a title="Universal &amp; Blended Search Opportunities" href="http://searchmarketingexpo.com/east/2009/full_agenda2#275"> Universal &amp; Blended Search Opportunities</a> – This session looks at how to closely monitor your paid search visitors, analyze what they do when visiting your site and how best to convert them toward your goals.</p>
<p><strong>Day 3 – October 7 – Technical SEO Track</strong></p>
<p><a title="Diagnosing Technical SEO Issues" href="http://searchmarketingexpo.com/east/2009/full_agenda3#279"> Diagnosing Technical SEO Issues</a> – Site isn’t being indexed fully? Experience a huge drop in rankings or traffic? This session provides a step by step checklist to help developers or others diagnose site issues and ensure they’re following best technical practices in SEO.</p>
<p><a title="Dealing With Domain Names, URLs, Parameters &amp; All That Jazz" href="http://searchmarketingexpo.com/east/2009/full_agenda3#283">Dealing With Domain Names, URLs, Parameters &amp; All That Jazz</a> – Subdomains or subdirectories? Are tracking parameters a problem for search engines? Keywords in the URL make a difference? How about the &#8220;level&#8221; of a page – is &#8220;deep&#8221; in a site bad? This session focuses on technical SEO questions of this nature and provides answers.</p>
<p><a title="Pumping Up WordPress For SEO" href="http://searchmarketingexpo.com/east/2009/full_agenda3#287"> Pumping Up WordPress For SEO</a> – WordPress is an incredibly popular blogging platform and CMS system. This session shows you how to quickly tune WordPress with plugins and settings to get the most out of technical SEO.</p>
<p><a title="CSS, AJAX, Web 2.0 &amp; SEO" href="http://searchmarketingexpo.com/east/2009/full_agenda3#291">CSS, AJAX, Web 2.0 &amp; SEO</a> – This session looks CSS, AJAX and Web 2.0 dynamic design techniques that can cause search engine indexing and ranking issues, with solutions to consider.</p>
<p><span style="font-size: 11pt; color: #1f497d;"> </span><a href=" http://searchmarketingexpo.com/east/2009/register">Register for SMX East</a> and request your <a href="http://janeandrobot.com/developer-summit/2009-nyc-smx/smx-east">Jane and Robot Developer Summit invite</a>!</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/all-of-your-technical-seo-questions-answered-and-bonus-free-developer-summit-technical-smx-east-dream-agenda-26539/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Google Lets You Tell Them Which URL Parameters To Ignore</title>
		<link>http://searchengineland.com/google-lets-you-tell-them-which-url-parameters-to-ignore-25925</link>
		<comments>http://searchengineland.com/google-lets-you-tell-them-which-url-parameters-to-ignore-25925#comments</comments>
		<pubDate>Wed, 16 Sep 2009 20:13:28 +0000</pubDate>
		<dc:creator>Vanessa Fox</dc:creator>
				<category><![CDATA[Google: SEO]]></category>
		<category><![CDATA[Google: Webmaster Central]]></category>
		<category><![CDATA[SEO: Duplicate Content]]></category>
		<category><![CDATA[Top News]]></category>

		<guid isPermaLink="false">http://searchengineland.com/?p=25925</guid>
		<description><![CDATA[A new feature has appeared in the Site Configuration Settings Sections of Google Webmaster Tools. The setting, called Parameter Handling, enables site owners to specify up to 15 parameters that Google should ignore when crawling and indexing the site.
Google lists the parameters they’ve found in the URLs on your site, and indicates whether or not [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fsearchengineland.com%2Fgoogle-lets-you-tell-them-which-url-parameters-to-ignore-25925"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fsearchengineland.com%2Fgoogle-lets-you-tell-them-which-url-parameters-to-ignore-25925" height="61" width="51" /></a></div><p>A new feature has appeared in the Site Configuration Settings Sections of Google Webmaster Tools. The setting, called Parameter Handling, enables site owners to specify up to 15 parameters that Google should ignore when crawling and indexing the site.</p>
<p>Google lists the parameters they’ve found in the URLs on your site, and indicates whether or not they think they those parameters are extraneous (with a suggested “Ignore” or “Don’t ignore”. You can confirm or reject those suggestions and can add parameters that aren’t listed.</p>
<p><a title="Google Webmaster Tools Parameter Handling by Search Engine Land, on Flickr" href="http://www.flickr.com/photos/23148333@N06/3927048854/"><img src="http://farm3.static.flickr.com/2479/3927048854_bebff649f6.jpg" alt="Google Webmaster Tools Parameter Handling" width="500" height="191" /></a></p>
<p>So what does this mean for site owners?</p>
<p>The primary value of the feature is to improve the canonicalization of a site in Google’s index due to <a href="http://searchengineland.com/understanding-search-engines-duplicate-content-issues-11738">duplicate content</a>. Canonicalization issues occur when multiple URLs load the same content. This scenario can be problematic for a number of reasons (for instance, it can skew analytics data) but from a search perspective, canonicalization issues can cause:</p>
<ul>
<li><strong>Crawl efficiency problems:</strong> if search engine bots crawl the same page via multiple URLs, they may not have resources to crawl as many unique pages on the site</li>
<li><strong>PageRank dilution that can lead to lowered search rankings: </strong>if external sites link to multiple versions of a page, each page has less Page Rank value than if all links were to one version</li>
<li><strong>Display and branding problems: </strong>search engines display only one version of the URL; you ideally want the canonical version of a URL to display (mysite.com/goldfish) rather than a version with extraneous parameters (mysite.com/goldfish?adid=1205123&amp;sid=452006&amp;sort=high-rating&amp;loc=sea)</li>
</ul>
<p>A number of canonicalization solutions exist, including several that are Google-specific, so why did they launch this new feature? Yahoo! has included a <a href="http://searchengineland.com/yahoo-site-explorer-adds-dynamic-url-rewriting-tool-11991">similar feature as part of its Site Explorer</a> webmaster product for some time and site owners have been asking for a similar feature from Google for a while (certainly at least since I was working on Webmaster Central).</p>
<p>Below a rundown of the various canonicalization options and how this one differs.</p>
<p><strong>Google Webmaster Tools Parameter Handling: When URLs Can Contain Optional Parameters</strong></p>
<p>This new option only helps with canonicalization issues that are caused by optional parameters that are in a standard key-value pair format and that you specify. In other words, it can only be an exclusionary list (don’t crawl parameters x,y, and z) rather than inclusionary (only crawl parameters a and b).</p>
<p>Wouldn’t you always know the complete list of potential parameters? Hopefully. But some canonicalization issues happen because a URL can take any parameters at all. Ideally, you want to ensure your server isn’t set up this way, but if you need this configuration (for instance another team or outside agency needs the ability to use any custom tracking code without waiting for that parameter code to be added to the server set up), then you’re better off using the meta canonical tag.</p>
<p>The two most common reasons for optional parameters and that this feature will work well for are:</p>
<ul>
<li>Tracking codes used for analytics data (in this case, you may not want to implement a 301 redirect from the long version of the URL to the canonical one since you could lose the data)</li>
<li>Page layout changes, such as sort orders (in this case, the code on the page uses the parameter to change the layout of the page, but from a search engine perspective the content on each version is the same, just in a different order)</li>
</ul>
<p><strong>Why use this canonicalization option over the others?</strong> The biggest benefit is likely in the increase in crawl efficiency. When Google discovers a new URL, they can check the included parameters against the parameter handling list and remove any optional ones before crawling it (but still credit any found links to the page). This could substantially reduce the crawling overhead on a site and could free up considerable bandwidth for getting other pages of the site crawled.</p>
<p>It’s also fairly simple to use. Just scan the list of suggested parameters and click the ones that are optional. In some organizations, it can be difficult to get source code added to web pages, making the implementation of the canonical tag difficult and time consuming. With this option, if you have verified webmaster tools access, you don’t need to involve IT at all.</p>
<p><strong>What are the drawbacks to this option? </strong>The most obvious issue with this option is that it only works for Google. In time past, you could use this setting and the corresponding one in Yahoo! Site Explorer and not worry about other engines. But with Microsoft Bing’s impending (likely) replacement of Yahoo’s search index, it’s quite possible that Yahoo’s feature will go the way of its index, and if Microsoft doesn’t offer something similar, then a search index with 25%+ market share could be getting your URLs wrong.</p>
<p>You could also shoot yourself in the foot, metaphorically speaking. You could accidentally tell Google to ignore important parameters that, if dropped from the index, could wipe out large portions of your site. As Google adds more of these types of features to webmaster tools, it becomes more important to ensure that anyone who has access to them know what they’re doing.</p>
<p>In reality, Google likely has safeguards in place that at least partially protect against such accidental destruction. That’s undoubtedly why they say that “While Google takes suggestions into account, we don&#8217;t guarantee that we&#8217;ll follow them in every case.” They don’t want large portions of their index disappearing either.</p>
<p>Unlike accidental blocking with robots.txt, which search engines follow as a directive, this feature (and many of the others) is a signal only. If the other signals already in place strongly contradict it (for instance, the content seems to be vastly different), it likely won’t be used.</p>
<p>But even though Google has safeguards like this one in place, you may not want to chance it if you’re not confident of which parameters are really optional (all the time, since this is a site-wide setting).</p>
<p>This option also won&#8217;t work if your canonicalization issues aren&#8217;t related to parameters or if the parameters aren&#8217;t in standard key-value pair format.</p>
<p><strong>Meta canonical attribute</strong></p>
<p>The canonical attribute is a page-level meta tag that specifies the canonical version for the page. This can be useful because no matter what optional parameters are added to the version of the URL that renders the page, search engines can always know the canonical version. You can find detailed information about this tag in my article <a href="http://searchengineland.com/canonical-tag-16537">about its launch</a>.</p>
<p><strong>Why use this canonicalization option over the others?</strong> You just specify the canonical version of a page once, and no matter what parameters are added to the URL, search engines are always provided with the canonical version.</p>
<p>Since this meta data is on the page itself, any search engine can read it, and in fact, Google, Yahoo!, and Microsoft have all announced support for it. As of yet though, only Google seems to be actively using it.</p>
<p><strong>What are the drawbacks to this option? </strong>Unlike with the parameter handling feature, search engines have to crawl the page before they can read the tag, so some crawl efficiency is lost. This tag should promote <em>long-term</em> efficiency, however, since theoretically,  once the bot has crawled the non-canonical version of the URL and read the tag, it shouldn’t have to crawl that version of the URL again.</p>
<p>As already noted, implementation requires modification of the page source code, which isn’t always easy within some organizations.</p>
<p>As with parameter handling, it’s possible to implement this tag incorrectly. For instance, it’s been discovered that some sites have accidentally set the canonical version of every page to the home page. As with the parameter handling feature, search engines consider the tag a “strong hint” as a precaution against these types of mistakes and won’t use the data when it strongly contradicts their other signals. In the case of Google, the only search engines who is actively using the tag so far, this has proven to be the case.</p>
<p><strong>301 redirect</strong></p>
<p>It’s  universally agreed that (other than not have multiple versions of a URL at all) the best way to canonicalize URLs is to redirect all versions to the canonical one using a 301 redirect. This implementation sends all users and search engines to the canonical version and effectively consolidates all links to the page and ensures only the canonical one is indexed and ranked.</p>
<p><strong>Why use this canonicalization option over the others?</strong> It’s understood and followed by all major search engines and it provides the best user experience (visitors have one URL to access, bookmark, and share). In most cases, search engines consolidate all links to the redirect target and rank the canonical one.</p>
<p>This option is the best choice when you are moving content (for instance, changing your URL structure or changing domains) and to indicate whether your want content indexed under the www or non-www version of the domain.</p>
<p>Also keep in mind that if you redirect to the canonical version you’re more likely to get links to the right version, since most visitors will simply copy and paste what they see in the address bar.</p>
<p><strong>What are the drawbacks to this option? </strong>When you are using parameters for sort orders or tracking, a redirect may negate those parameters. ou can generally <a href="http://janeandrobot.com/library/url-referrer-tracking">configure your analytics program</a> to handle this properly, but it probably won’t work out of the box.</p>
<p>Redirects aren’t always properly implemented. For instance, they might inadvertently be implemented as a 302 (or worse yet, a JavaScript redirect or meta refresh). Or they may generate redirect loops or infinite redirect chains. In these cases, search engine bots eventually abandon the crawl attempt (and with Google and Microsoft, you can get a list of these URLs in their webmaster tools products).</p>
<p>Redirects can also slow down crawl efficiency, particularly due to redirect chains. Ideally, search engines crawl the redirect then eventually stop crawling the origination URL, but if the bot encounters links to the original URL, it will continue crawling both versions (or more, if the page has moved multiple times).</p>
<p><strong>Google webmaster tools change address feature</strong></p>
<p>This feature enables you to tell Google when you’re changing domains. You have to verify ownership of both the old domain and the new domain and then you can specify a move from one to the other. You can find more information about this  feature xx.</p>
<p><strong>Why use this canonicalization option over the others?</strong> The best use of this feature is when you are changing domains and you aren’t able to implement a 301 redirect from the old domain to the new. (This is the case, for instance, with blogspot.com sites.) Even if you are able to implement the redirect, it can’t hurt to let Google know as well!</p>
<p><strong>What are the drawbacks to this option? </strong>You can only use this option to move from one domain to the other. And as with the other Google webmaster tools features, it only works for Google.</p>
<p><strong>Google webmaster tools preferred domain feature</strong></p>
<p><a href="http://googlewebmastercentral.blogspot.com/2006/09/setting-preferred-domain.html">The preferred domain feature</a> enables you to tell Google whether your want your domain indexed with the www subdomain or without it.. Since most sites resolve either way, a complete duplicate set of content  of your site will exist if you don’t set www/non-www canonicalization. Why is this a problem? Ideally it’s not and search engines consolidate the content correctly. But often, search engines find links to both versions and end up crawling both, indexing both, and crediting the links to the versions separately.</p>
<p><strong>Why use this canonicalization option over the others?</strong> You may as well always use this option, although you should implement a 301 redirect as well, if you can. Google initially implemented this feature for those sites that weren’t able to do so.</p>
<p><strong>What are the drawbacks to this option? </strong>Again, this option works only for Google. And it doesn’t provide as much of a guarantee as a 301 redirect.</p>
<p><strong>Blocking duplicate content with a robots directive</strong></p>
<p>The traditional advice for avoiding duplicate content has been to block the duplicates with robots.txt (or a robots meta tag) to ensure the correct version is indexed. It can be important that the right version be indexed vs. the version intended for print, for instance.</p>
<p><strong>Why use this canonicalization option over the others?</strong> Generally speaking, you shouldn’t now that the canonical meta tag is available. The scenarios for which you wouldn’t want to redirect (such as the print version example) can be more easily solved with the canonical tag and the scenarios for which you’re worried about crawl efficiency issues that would leave large portions of your site uncrawled (such as large-scale optional parameters) can now more easily be solved with Google’s parameter handling feature.</p>
<p><strong>What are the drawbacks to this option? </strong>The primary drawback to this option is the loss of link credit. Any links to blocked pages fall into a black hole and can’t be credited to the canonical version of the page, as happens with the other options.</p>
<p><strong>The parameter handling feature can also provide insight on how Google sees your site
</strong></p>
<p>For some time, Google has been attempting to canonicalize URLs and show the canonical version in the results, even when a site owner hasn’t implemented any of these canonicalization options. For instance, they may determine that several pages contain the same content and algorithmically consolidate them and associate them with the one Google determines is canonical. They haven’t described exactly how they determine the canonical version, but they might, for instance, choose the URL with the fewest number of parameters or the shortest version of the URL.</p>
<p>Last year, they started letting webmasters know <a href="http://googlewebmastercentral.blogspot.com/2008/08/to-infinity-and-beyond-no.html">when they encountered URLs that they thought were extraneous and were causing crawling problems</a>. It’s likely that Google is using a similar source to generate the list of parameters it suggests should be ignored.</p>
<p>In this way, the parameter handling feature provides insight into how Google perceives the site. If you see many parameters listed that aren’t optional, take a look at the content on the URL that use those parameters.</p>
<p>This could signify a larger problem. It could be that Google doesn’t see enough unique content on them (this can happen, for instance, with pages that list part numbers, contain mostly images and item codes, or list little information outside of a login). You may want to look for ways to differentiate the pages a bit more.</p>
<p>Interestingly, the Google Webmaster Central blog has a new <a href="http://googlewebmastercentral.blogspot.com/2009/09/duplicate-content-and-multiple-site.html">post about duplicate content</a>, but no mention of this new feature. Thanks to Brian Ussery for <a href="http://www.beussery.com/blog/index.php/2009/09/parameter-handling/">pointing it out</a>!</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/google-lets-you-tell-them-which-url-parameters-to-ignore-25925/feed</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Google Logo Remembers Michael Jackson&#8217;s Birthday</title>
		<link>http://searchengineland.com/google-logo-remembers-michael-jacksons-birthday-24784</link>
		<comments>http://searchengineland.com/google-logo-remembers-michael-jacksons-birthday-24784#comments</comments>
		<pubDate>Sun, 30 Aug 2009 01:32:46 +0000</pubDate>
		<dc:creator>Vanessa Fox</dc:creator>
				<category><![CDATA[Google: Logos]]></category>

		<guid isPermaLink="false">http://searchengineland.com/?p=24784</guid>
		<description><![CDATA[Google&#8217;s August 29th logo notes Michael Jackson&#8217;s birthday.

Google has commemorated birthdays of many famous people and events, such as Luciano Pavarotti, Lego, and Dr. Seuss.
]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fsearchengineland.com%2Fgoogle-logo-remembers-michael-jacksons-birthday-24784"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fsearchengineland.com%2Fgoogle-logo-remembers-michael-jacksons-birthday-24784" height="61" width="51" /></a></div><p>Google&#8217;s August 29th logo notes Michael Jackson&#8217;s birthday.
<a title="google-jackson-logo by Search Engine Land, on Flickr" href="http://www.flickr.com/photos/23148333@N06/3869410594/"><img src="http://farm4.static.flickr.com/3492/3869410594_7e9e0cc6b7_o.png" alt="google-jackson-logo" width="367" height="126" /></a></p>
<p>Google has commemorated birthdays of many famous people and events, such as <a href="http://searchengineland.com/google-logo-celebrates-luciano-pavarottis-birthday-12414">Luciano Pavarotti</a>, <a href="http://searchengineland.com/google-lego-logo-google-offers-lego-50th-birthday-wishes-13227">Lego</a>, and <a href="http://searchengineland.com/google-wishes-dr-seuss-a-happy-birthday-16748">Dr. Seuss</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/google-logo-remembers-michael-jacksons-birthday-24784/feed</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Google Caffeine: Google&#8217;s New Search Engine Index</title>
		<link>http://searchengineland.com/caffeine-googles-new-search-index-23823</link>
		<comments>http://searchengineland.com/caffeine-googles-new-search-index-23823#comments</comments>
		<pubDate>Mon, 10 Aug 2009 23:56:38 +0000</pubDate>
		<dc:creator>Vanessa Fox</dc:creator>
				<category><![CDATA[Google: SEO]]></category>
		<category><![CDATA[Google: Search Customization]]></category>
		<category><![CDATA[Top News]]></category>

		<guid isPermaLink="false">http://searchengineland.com/?p=23823</guid>
		<description><![CDATA[Google has just unveiled a &#8220;secret project&#8221; of &#8220;next-generation architecture for Google&#8217;s web search&#8220;. This new architecture appears to include crawling, indexing, and ranking changes. For the first time, Google isn&#8217;t simply incorporating these changes into their existing infrastructure or replacing it. Instead, they&#8217;re providing a developer preview and are asking webmasters and power searchers [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fsearchengineland.com%2Fcaffeine-googles-new-search-index-23823"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fsearchengineland.com%2Fcaffeine-googles-new-search-index-23823" height="61" width="51" /></a></div><p>Google has just unveiled a &#8220;secret project&#8221; of &#8220;<a href="http://googlewebmastercentral.blogspot.com/2009/08/help-test-some-next-generation.html">next-generation architecture for Google&#8217;s web search</a>&#8220;. This new architecture appears to include crawling, indexing, and ranking changes. For the first time, Google isn&#8217;t simply incorporating these changes into their existing infrastructure or replacing it. Instead, they&#8217;re providing a <a href="http://www2.sandbox.google.com/">developer preview</a> and are asking webmasters and power searchers to try it out and give them feedback. Unlike Google&#8217;s now-defunct SearchMash, which was intended for search experiments that wouldn&#8217;t necessarily be incorporated into Google&#8217;s main web search, the caffeine index seems to be an entirely new search infrastructure that will repace what exists now.</p>
<p>Based on the blog post, we can guess that this new infrastructure may include ways of crawling the web more comprehensively, determining reputation and authority (possibly beyond the link graph and what&#8217;s typically thought of as PageRank), and returning more relevant results more quickly, although Google&#8217;s Matt Cutts told me that the changes are &#8220;primarily in how we index&#8221;.</p>
<p>While the biggest visible changes in Microsoft&#8217;s relaunched search engine, Bing, are user-interface related, Google&#8217;s new search is only infrastructure related and includes no UI changes. On first glance, however, the underlying infrastructure changes do seem to have impacted user interface as it relates to universal search (likely because universal results are influenced by ranking and relevance signals). For the sample searches I did, the first ten results were fairly similar, but the existence and location of images, video, news, and blog posts was notably different.</p>
<p>A search for [<a href="http://www2.sandbox.google.com/search?hl=en&amp;q=buffy+the+vampire+slayer&amp;aq=f&amp;oq=&amp;aqi=g10">buffy the vampire slayer</a>] on the new infrastructure, for instance, returns video and news results midway down the page.</p>
<p><a title="Buffy Results - Google Caffeine by Search Engine Land, on Flickr" href="http://www.flickr.com/photos/23148333@N06/3809881828/"><img src="http://farm3.static.flickr.com/2554/3809881828_3db0177934.jpg" border="1" alt="Buffy Results - Google Caffeine" width="450" height="500" /></a></p>
<p>A search on the existing infrastructure, however, returns news at the top, video in the middle, and images at the bottom of the page.</p>
<p><a title="Buffy Results - Google by Search Engine Land, on Flickr" href="http://www.flickr.com/photos/23148333@N06/3809067943/"><img src="http://farm4.static.flickr.com/3533/3809067943_ed5cc3e91e.jpg" border="1" alt="Buffy Results - Google" width="500" height="481" /></a></p>
<p><a title="Buffy Results - Google by Search Engine Land, on Flickr" href="http://www.flickr.com/photos/23148333@N06/3809882136/"><img src="http://farm3.static.flickr.com/2488/3809882136_24cdf367e6.jpg" border="1" alt="Buffy Results - Google" width="500" height="241" /></a></p>
<p>Undoubtedly, Google Caffeine will cause quite a kerfluffle in the web developer and search engine optimization world and many will dive in to try and figure out the changes. We&#8217;ll likely see many a speculative blog post about how best to optimize for the new infrastructure, but my guess is that it likely does what Google search does, but better. And the foundational elements of having a crawlable infrastructure and compelling content remain.</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/caffeine-googles-new-search-index-23823/feed</wfw:commentRss>
		<slash:comments>10</slash:comments>
		</item>
		<item>
		<title>Dear Senator (and Texas Gubernatorial Candidate) Kay Bailey Hutchinson, Here&#8217;s A Free Crash Course On SEO</title>
		<link>http://searchengineland.com/dear-senator-kay-bailey-hutchinson-crash-course-on-seo-23393</link>
		<comments>http://searchengineland.com/dear-senator-kay-bailey-hutchinson-crash-course-on-seo-23393#comments</comments>
		<pubDate>Fri, 31 Jul 2009 17:15:22 +0000</pubDate>
		<dc:creator>Vanessa Fox</dc:creator>
				<category><![CDATA[Features: General]]></category>
		<category><![CDATA[How To: SEO]]></category>
		<category><![CDATA[SEO: Spamming]]></category>

		<guid isPermaLink="false">http://searchengineland.com/?p=23393</guid>
		<description><![CDATA[The &#8220;Kay Bailey Hutchinson for Governor Official Website&#8221; put up by &#8220;Texans for Kay Bailey Hutchison, Allan Shivers, Jr., Treasurer&#8221; over the weekend in support of Kay Bailey Hutchinson for Texas governor was briefly in Google but now appears to be completely missing. Huh.

Odd since Bing seems to have indexed it just fine, although the [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fsearchengineland.com%2Fdear-senator-kay-bailey-hutchinson-crash-course-on-seo-23393"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fsearchengineland.com%2Fdear-senator-kay-bailey-hutchinson-crash-course-on-seo-23393" height="61" width="51" /></a></div><p>The &#8220;Kay Bailey Hutchinson for Governor Official Website&#8221; <a href="http://standybykay.com/">put up</a> by &#8220;Texans for Kay Bailey Hutchison, Allan Shivers, Jr., Treasurer&#8221; over the weekend in support of Kay Bailey Hutchinson for Texas governor was briefly in Google but now appears to be<a href="http://www.google.com/search?hl=en&amp;q=site%3Astandbykay.com"> completely missing</a>. Huh.
<a title="Google Index by Search Engine Land, on Flickr" href="http://www.flickr.com/photos/23148333@N06/3775549930/"><img src="http://farm3.static.flickr.com/2658/3775549930_67bab8b283.jpg" alt="Google Index" width="500" height="230" /></a></p>
<p>Odd since Bing seems to have indexed it just fine, although the snippet looks a little odd.</p>
<p><a title="Bing Results by Search Engine Land, on Flickr" href="http://www.flickr.com/photos/23148333@N06/3774743263/"><img src="http://farm3.static.flickr.com/2540/3774743263_4e0364fc7b.jpg" alt="Bing Results" width="500" height="117" /></a></p>
<p>Let&#8217;s take a closer look at the site.</p>
<p><a title="kbhhomepage by Search Engine Land, on Flickr" href="http://www.flickr.com/photos/23148333@N06/3775548852/"><img src="http://farm4.static.flickr.com/3451/3775548852_22a6f32e5b.jpg" alt="kbhhomepage" width="500" height="454" /></a></p>
<p>Looks OK. Maybe it&#8217;s all in images or JavaScript or something and Google is having a hard time extracting content? I&#8217;ll just take a closer look with web developer toolbar. I&#8217;ll disable CSS and&#8230;</p>
<p>Oh. Wait.</p>
<p><a title="Hidden Text by Search Engine Land, on Flickr" href="http://www.flickr.com/photos/23148333@N06/3775549246/"><img src="http://farm4.static.flickr.com/3513/3775549246_e0b604acf8.jpg" alt="Hidden Text" width="500" height="219" /></a></p>
<p>Do you think all those hidden keywords revealed when CSS styling is disabled might have anything to do with it? I speculated yes when first writing this piece, and after it was posted, I got this confirmation from Matt Cutts, head of the spam team at Google:</p>
<blockquote><p>Google did take action on this site for hidden text. Hidden text is a violation of our quality <a href="http://www.google.com/support/webmasters/bin/answer.py?hl=en&amp;answer=35769">guidelines</a>. We&#8217;ve removed the site from our index and tried to contact the site maintainers by email to explain that the hidden text was the cause for the site&#8217;s removal from our index. We also recommended that the webmaster remove all the hidden text and file a valid reconsideration request. More information about requesting reconsideration of a site can be found <a href="http://www.google.com/support/webmasters/bin/answer.py?hl=en&amp;answer=35843">here</a>.</p></blockquote>
<p>Hiring an SEO firm is just like hiring any other service for expertise that you lack. Since you by definition don&#8217;t have the expertise, how do you know if they&#8217;re doing a good job? Well, Texans for Kay Bailey Hutchinson, Allan Shivers, Jr., Treasurer, today you&#8217;re getting a free crash course in SEO evaluation.</p>
<p><strong>Understanding hidden text</strong></p>
<p>Hidden text is not only against the search engine guidelines and can get you banned, but is amateur and lazy. It&#8217;s like hiring a painter to paint your house and having them show up and throw a bucket of paint in the general direction of your walls.</p>
<p>You might find <a href="http://www.google.com/support/webmasters/bin/answer.py?answer=66353">Google&#8217;s webmaster guidelines</a> on the subject helpful:</p>
<blockquote><p>Hiding text or links in your content can cause your site to be perceived as untrustworthy since it presents information to search engines differently than to visitors. Text (such as excessive keywords) can be hidden in several ways, including &#8230; Using CSS to hide text&#8230; If your site is perceived to contain hidden text and links that are deceptive in intent, your site may be removed from the Google index, and will not appear in search results pages. When evaluating your site to see if it includes hidden text or links, look for anything that&#8217;s not easily viewable by visitors of your site. Are any text or links there solely for search engines rather than visitors? If you do find hidden text or links on your site, either remove them or, if they are relevant for your site&#8217;s visitors, make them easily viewable. If your site has been removed from our search results, review our webmaster guidelines for more information. Once you&#8217;ve made your changes and are confident that your site no longer violates our guidelines, submit your site for reconsideration.</p></blockquote>
<p>It&#8217;s quite possible Google has even told you about this hidden text problem. Create a Google Webmaster Tools account and verify ownership of standbykay.com. You may <a href="http://googlewebmastercentral.blogspot.com/2007/07/message-center-let-us-communicate-with.html">have a message</a> waiting for you.</p>
<p>Hidden text is also frustrating for the users you&#8217;re trying to attract and may cause them to get irritated at your site and leave. Let&#8217;s say, for instance, the site does indeed rank for &#8220;what is a keg&#8221;, as it apparently is trying to do. What exactly do they expect searchers who reach the site to do? The site doesn&#8217;t actually answer the question of what a keg is. Do they think that all these keg definition-seekers are going to get distracted and decide to abandon their beer research efforts and become political activists?What other searches are they apparently targeting?</p>
<ul>
<li> texas bbq pit texas smokehouse</li>
<li>chicken fried steak recipe</li>
<li>debachary [sic] definition</li>
<li>gambling addiction</li>
<li>houston rockets</li>
<li>why do we have knees</li>
</ul>
<p>Why <em>do </em>we have knees?!!!</p>
<p>According to the <a href="http://www.statesman.com/blogs/content/shared-gen/blogs/austin/politics/entries/2009/07/30/hutchisonoriented_site_luring.html">Austin American-Statesman</a>, which alerted us to this situation, the campaign has no intention of the removing the hidden text:</p>
<blockquote><p>Hutchison’s campaign spokesman, Jeff ] Sadosky and other campaign aides said this afternoon that only the two phrases using “rick perry gay” will be removed because they won’t play into the campaign’s future messages.</p></blockquote>
<p>[Note: At 2:22pm Eastern, it appears all of the hidden text is <a href="http://twitter.com/morgret/status/3055401441">now gone</a>.]</p>
<p>The explanation is that software is creating the lists of phrases based on search volume and is intended to help direct online advertising efforts. If that&#8217;s indeed the case, wouldn&#8217;t the list be more easily managed as an offline report than as text hidden inside source code of a web page? Just saying. [Can you tell I'm not buying it?]</p>
<p>Which brings us to the next lesson in the SEO crash course.</p>
<p><strong>Understanding keyword research</strong></p>
<p>Sure, there are sophisticated and expensive keyword research tools that provides lots of awesome information, but I&#8217;m venturing a guess that you&#8217;re more in the beginner stage, so you&#8217;ll do just fine with the free stuff. And as a bonus, the free tools give you files you can open in Excel rather than store the data in the source code of your site! The campaign said the software helped them figure out what else those who were searching for  “Rick Perry,” “Kay Bailey Hutchison” and “Texas&#8221; were looking for, so let&#8217;s do the same, shall we? <strong><a href="https://adwords.google.com/select/KeywordToolExternal"></a></strong></p>
<p><strong><a href="https://adwords.google.com/select/KeywordToolExternal">Google AdWords Keyword Tool</a> </strong>- this is the old standby, and it&#8217;s pretty solid and useful since it&#8217;s based on Google searches. And hey, it even lets your sort by search volume!</p>
<p><a title="texas-keywords by Search Engine Land, on Flickr" href="http://www.flickr.com/photos/23148333@N06/3775549368/"><img src="http://farm3.static.flickr.com/2481/3775549368_c99473dce7_o.jpg" alt="texas-keywords" width="346" height="506" /></a></p>
<p>Tip: &#8220;Texas&#8221; might be overly broad as you&#8217;ll likely get a lot of untargeted traffic. You might want to stick with more relevant keywords, such as &#8220;texas politics&#8221; or &#8220;texas governor race&#8221;. A more targeted set of keywords can give you lots of other valuable information, like that opposing candidate has more searcher interest:</p>
<p><a title="governor-keywords by Search Engine Land, on Flickr" href="http://www.flickr.com/photos/23148333@N06/3774743893/"> <img src="http://farm4.static.flickr.com/3527/3774743893_0890403f5c_o.jpg" alt="governor-keywords" width="409" height="167" /></a></p>
<p><a href="http://www.google.com/insights/search"><strong>Google Insights For Search</strong></a> -This cool tools lets you compare search terms and zero in on specific regions (such as just Texas) and specific time frames (such as the last 90 days).</p>
<p><a title="Kay Bailey Hutchinson Insights by Search Engine Land, on Flickr" href="http://www.flickr.com/photos/23148333@N06/3775549502/"><img src="http://farm3.static.flickr.com/2425/3775549502_9f4cf75c4e.jpg" alt="Kay Bailey Hutchinson Insights" width="500" height="370" /></a><a title="Rick Perry Regional Insights by Search Engine Land, on Flickr" href="http://www.flickr.com/photos/23148333@N06/3774744031/"><img src="http://farm4.static.flickr.com/3433/3774744031_1d087b6487.jpg" alt="Rick Perry Regional Insights" width="500" height="191" /></a></p>
<p><a href="http://labs.wordtracker.com/keyword-questions/"><strong>Wordtracker Keyword Questions</strong></a> &#8211; this new tools uses ISP data to generate a list of questions people are searching for related to keywords. Using this tool, we find that perhaps Texans for Kay Bailey Hutchinson, Allan Shivers, Jr., Treasurer has been doing a few searches of their own.</p>
<p><a title="Wordtracker Questions by Search Engine Land, on Flickr" href="http://www.flickr.com/photos/23148333@N06/3774744169/"><img src="http://farm3.static.flickr.com/2461/3774744169_737b261316.jpg" alt="Wordtracker Questions" width="330" height="500" /></a></p>
<p>Here&#8217;s another tip! You can use these keywords in the actual copy of the website. It can help you not only be more visible in search engines, but also have deeper engagement with your consituents! How awesome is that. If you want to know what people are searching for, why only use that for ad targeting, when you can use it to understand and reach your audience more generally? For instance, people are apparently asking the question &#8220;why is Kay Bailey Hutchison running against Rick Perry for governor of Texas?&#8221; Add a page to the site with frequently asked questions, add that one, and answer it.</p>
<p><strong>Understanding basic SEO</strong></p>
<p>Texans for Kay Bailey Hutchinson, Allan Shivers, Jr., Treasurer, there&#8217;s another problem with your SEO. Not only does the site have elements that will get it banned, the site doesn&#8217;t have thebasic stuff that can help you be found. Let&#8217;s run down just a few of them.</p>
<p><strong>Extractable text</strong>- None of the visible text on the page is viewable by search engines! (Kinda funny, huh. None of the visible text is viewable, but all the hidden text is.)  It&#8217;s all hidden in JavaScript. With images and JavaScript turned off, the page looks like this:</p>
<p><a title="JavaScript Off by Search Engine Land, on Flickr" href="http://www.flickr.com/photos/23148333@N06/3775549826/"><img src="http://farm3.static.flickr.com/2566/3775549826_c72b5089b2.jpg" alt="JavaScript Off" width="500" height="423" /></a>
I should clarify, it&#8217;s not that <em>none </em>of the text is visible. &#8220;Political ad Paid for by Texans for Kay Bailey Hutchison, Allan Shivers, Jr., Treasurer&#8221; shows up just fine. So anyone looking for political ads paid for by (etc.), just might find the site. Well, if it wasn&#8217;t banned by Google for hidden text, of course.</p>
<p><strong>Meta tags</strong> &#8211; The keywords, robot, and revisit-after tag are completely ignored in this context. So I hope you didn&#8217;t pay very much for someone to include them. The description tag is fairly important as it&#8217;s your one and only chance to provide a compelling marketing message in the search results to motivate a searcher to click over to the site. Was &#8220;Kay Bailey Hutchison &#8211; Running for Texas Governor&#8221; the most inspiring message you could come up with?<a title="Meta Tags by Search Engine Land, on Flickr" href="http://www.flickr.com/photos/23148333@N06/3774744311/"><img src="http://farm4.static.flickr.com/3482/3774744311_cd91b79209.jpg" alt="Meta Tags" width="500" height="137" /></a>
<strong>Video</strong> &#8211; Don&#8217;t autoplay video when the page loads. That&#8217;s not really an SEO thing. Just an annoying-the-hell-out-of-me thing. More to the point, add at least a caption above or below the video. Add some descriptive text. Something that lets the search engines know what it is.<strong></strong></p>
<p><strong>Canonicalization </strong>- it sounds like an advanced SEO tactic, but it&#8217;s actually a foundational element that even the most novice SEO should know. www.standbykay.com and standbykay.com both resolve to the site. One should 301 redirect to the other.</p>
<p><strong>Duplicate content</strong> &#8211; Texans for Kay Bailey Hutchison, Allan Shivers, Jr., Treasurer also runs another site, texans.forkay.com. (Why the subdomain? No idea.) They also own texansforkay.com, which has a JavaScript redirect to texans.forkay.com. First lesson in redirects: always go with a 301 redirect.</p>
<p>The bigger issue is why two sites that have exactly the same purpose? So far, standbykay.com doesn&#8217;t have much content (and only one page), but the bulk of it is that dang autoplaying video, which is the same video featured prominently on the texans.forkay.com home page.</p>
<p>That leads me to think that any expansion of standforkay.com might use other content from texans.forkay.com. That&#8217;s another way to ensure the new site <em>won&#8217;t</em> rank in search engines, as they don&#8217;t want to list the same content multiple times. Any good SEO firm should ask why you need another site and work closely with you to define differentiation &#8212; a different purpose, audience, and goals.</p>
<p>But wait? What&#8217;s this? <a href="http://www.kay4texas.com/">kayfortexas.com</a>? That appears to be an exact duplicate of texans.forkay.com. Studying up on that 301 redirect is probably a really good idea.</p>
<p>I hope Texans for Kay Bailey Hutchison, Allan Shivers, Jr., Treasurer enjoyed this crash course in SEO. If you found it helpful and are want a second lesson, might I suggest Google&#8217;s informative <a href="http://www.google.com/support/webmasters/bin/answer.py?answer=35843">Requesting reconsideration of your site</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/dear-senator-kay-bailey-hutchinson-crash-course-on-seo-23393/feed</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>What Site Owners, Web Developers &amp; SEOs Should Know About The Yahoo Microsoft Deal</title>
		<link>http://searchengineland.com/what-site-owners-web-developers-and-seos-should-know-about-the-yahoomicrosoft-deal-23344</link>
		<comments>http://searchengineland.com/what-site-owners-web-developers-and-seos-should-know-about-the-yahoomicrosoft-deal-23344#comments</comments>
		<pubDate>Thu, 30 Jul 2009 17:45:26 +0000</pubDate>
		<dc:creator>Vanessa Fox</dc:creator>
				<category><![CDATA[Microsoft & Yahoo Search Deal]]></category>
		<category><![CDATA[Microsoft: Bing]]></category>
		<category><![CDATA[SEO - Search Engine Optimization]]></category>
		<category><![CDATA[Top News]]></category>
		<category><![CDATA[Yahoo: APIs]]></category>
		<category><![CDATA[Yahoo: Search Monkey]]></category>
		<category><![CDATA[Yahoo: Site Explorer]]></category>

		<guid isPermaLink="false">http://searchengineland.com/?p=23344</guid>
		<description><![CDATA[By now, everyone has read all about the news that Yahoo is replacing its search index with Microsoft&#8217;s Bing. In a way, it&#8217;s a great story of complete reversal, as in 2002, Microsoft didn&#8217;t have its own index and instead used Inktomi. Late that year, Yahoo! acquired Inktomi, which spurred Microsoft to start building its [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fsearchengineland.com%2Fwhat-site-owners-web-developers-and-seos-should-know-about-the-yahoomicrosoft-deal-23344"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fsearchengineland.com%2Fwhat-site-owners-web-developers-and-seos-should-know-about-the-yahoomicrosoft-deal-23344" height="61" width="51" /></a></div><p>By now, everyone has read all about the news that Yahoo is <a href="http://searchengineland.com/its-finally-official-microsoft-yahoo-make-a-deal-yahoo-gives-up-on-search-23197">replacing its search index with Microsoft&#8217;s Bing</a>. In a way, it&#8217;s a great story of complete reversal, as in 2002, Microsoft didn&#8217;t have its own index and instead used Inktomi. Late that year, Yahoo! acquired Inktomi, which spurred Microsoft to start building its own search index to avoid having a search supplier owned by a major competitor. Now Yahoo is ditching its index (including all of the technology it acquired with Inktomi) to use the very index it motivated Microsoft to build.</p>
<p>We&#8217;ve read about what this <a href="http://searchengineland.com/micro-hoo-details-qa-with-mehdi-schneider-23248">means for advertisers</a> (more overall traffic from the combined audience, use of Microsoft adCenter for self-serve and Yahoo!&#8217;s sales force for premium) and for <a href="http://searchengineland.com/microsoft-yahoo-search-deal-simplified-23299">searchers</a> (they likely won&#8217;t notice), but what does the deal means for those who create websites: publishers, web developers, and SEOs?</p>
<p><strong>Web Developers</strong></p>
<p>The hardest hit by this change will likely be developers. Over the last couple of years, Yahoo seems to have shifted its focus from innovating the search index to innovating its <a href="http://searchengineland.com/microsoft-yahoo-search-deal-simplified-23299">developer offerings</a>: encouraging <a href="http://searchengineland.com/yahoo-lets-you-build-your-own-search-service-14349">third-party developmen</a>t and creating a &#8220;<a href="http://searchengineland.com/more-yahoo-search-monkey-details-creating-a-developer-ecosystem-for-search-13571">developer ecosystem</a>&#8221; for search.</p>
<p>Any developer options that don&#8217;t rely on the Yahoo search index may be unscathed. In particular, the non-search development tools and search-related offerings that are solely focused on the user interface may continue to be supported. While Bing will power Yahoo&#8217;s search engine, Yahoo will control their user interface and likely will try to continue to differentiate there. That&#8217;s means <a href="http://searchengineland.com/yahoo-searchmonkey-becomes-more-mainstream-14498">Search Monkey</a>, which enables site owners to enhance how their results appear on Yahoo, is potentially safe.</p>
<p><a href="http://searchengineland.com/yahoo-lets-you-build-your-own-search-service-14349">Build Your Own Search Service</a> (BOSS) likely won&#8217;t be so lucky. BOSS is built on the Yahoo index as its foundation. A company can build their own search engine using Yahoo&#8217;s underlying technology and differentiate via the user experience.  Essentially, that&#8217;s what Yahoo is planning to do now with Bing as their underlying technology. No more Yahoo index likely means no more BOSS. Yahoo all but <a href="http://tech.groups.yahoo.com/group/ysearchboss/message/2018">concedes as much</a>: &#8220;We can tell you that BOSS will remain live for the time being.&#8221; What does that mean for <a href="http://developer.yahoo.com/search/boss/">companies like</a> hakia, OneRiot, Daylife, and Cluuz? And for that matter, all of the developers using BOSS who are now <a href="http://tech.groups.yahoo.com/group/ysearchboss/messages?o=1">filling the Yahoo BOSS message boards</a> with questions?</p>
<p>On the <a href="http://developer.yahoo.net/blog/archives/2009/07/developer_update.html">Yahoo developer blog</a>, Yahoo commented that &#8220;For SearchMonkey and BOSS, we currently do not have anything concrete to tell you. Clearly, we’ll need to work with Microsoft to determine what makes the most sense for you and for us.&#8221; If BOSS&#8217;s future is left up to Microsoft, I have no doubt that future will involve migrating BOSS users to the Bing search API. In order to continue to support BOSS, Microsoft would have to completely recreate it to work with the Bing search infrastructure. Why would they do that when they can increase the audience of a product they already have? It&#8217;s possible they&#8217;ll add some of the unique BOSS features their search API (such as unlimited queries, ability to mash up the data with other sources, and ability to tweak ranking signals), but I wouldn&#8217;t hold my breath. The Yahoo BOSS team is just as in the dark as the developers wanting answers. From a <a href="http://tech.groups.yahoo.com/group/ysearchboss/message/2018">message board post</a>: &#8220;What specifically does it mean for BOSS? Honestly the team is still absorbing the implications and we just don&#8217;t know.&#8221;.</p>
<p>BOSS users could switch to <a href="http://code.google.com/apis/customsearch/">Google&#8217;s Custom Search API</a>, but it is more restrictive than Microsoft&#8217;s offering, and isn&#8217;t really well-suited as the foundation of a search engines or other commercial company. Several other companies offer web indices, such as <a href="http://www.commoncrawl.org/">CommonCrawl</a> and <a href="http://aws.amazon.com/alexawebsearch/">Alexa</a>, so perhaps they or a new company will take advantage Boss&#8217;s imminent demise and offer matching features.</p>
<p>Any Yahoo offerings that don&#8217;t rely on an underlying index, such as the <a href="http://developer.yahoo.com/yui/">Yahoo User Interface library</a> are likely going to remain. Yahoo confirmed this in their blog post:</p>
<blockquote><p>&#8220;We’ve also received questions about the future of Yahoo!&#8217;s other developer offerings, such as <a href="http://developer.yahoo.com/yui/">YUI</a>, <a href="http://developer.yahoo.com/yql/">YQL </a>,  and <a href="http://pipes.yahoo.com/pipes/">Pipes</a>. We wanted to let you know that today’s news does not affect these products. None of our other <strong>non-search developer products</strong> are affected.&#8221; [Emphasis mine.]</p></blockquote>
<p>However, look for any search index-based offerings (such as the Maps API and Local API) to be deprecated in favor of the Bing equivalents once the deal goes through.</p>
<p><strong>Search Engine Optimizers and Site Owners</strong></p>
<p>What about those who are concerned with getting customer acquisitions through organic search? How will this change impact them? From a traffic perspective, take a look at how well you&#8217;re indexed and ranked in Bing. That&#8217;s how well you&#8217;ll be indexed and ranked in Yahoo. What do your titles and descriptions look like in the results in Bing? That&#8217;s how things will generally look in Yahoo. This might not be a bad thing for site owners, as over the last year, Yahoo&#8217;s search quality seems to have been declining to the point that I&#8217;ve <a href="http://twitter.com/vanessafox/status/2878226180">been wondering if their engineering team</a> had already begun to be phased out or least was spending a lot of time at the bar mourning the likely phase out.</p>
<p>Just as you don&#8217;t need to optimize separately for AOL since they use Google&#8217;s index, you won&#8217;t need to optimize for Yahoo since they&#8217;ll use Bing&#8217;s index. The exception to this may be in how Yahoo displays results. We&#8217;ll have to wait and see exactly what this means, but Bing has been trying to differentiate in display and it supposedly, Yahoo will continue to do that as well. This may mean, for instance:</p>
<ul>
<li>SearchMonkey will continue to be important as a way to stand out in the results.</li>
<li>Hmm. I can&#8217;t really think of anything else.</li>
</ul>
<p>My guess is that the <a href="http://www.ysearchblog.com/2007/05/02/introducing-robots-nocontent-for-page-sections/">robots-nocontent tag</a> will no longer be supported, since Bing&#8217;s infrastructure doesn&#8217;t support it. The search engines have already come together to standardize their <a href="http://searchengineland.com/yahoo-google-microsoft-clarify-robotstxt-support-14125">support of robots.txt</a> and <a href="http://searchengineland.com/sitemapsorg-update-you-can-now-store-your-xml-sitemap-files-anywhere-13476">XML Sitemaps</a>, so site owners shouldn&#8217;t worry about changing anything with those.</p>
<p>The bigger issue many SEOs are concerned about is Site Explorer. <a href="http://siteexplorer.search.yahoo.com/">Site Explorer</a> is one of the more reliable tools for competitive backlink research. You can see a substantial list of links to any site, generally ordered according to value. That&#8217;s useful stuff! Both Google and Bing Webmaster Tools provide backlink data, but only for your own sites. Yahoo will be unable to maintain Site Explorer without a search index of their own. Will Bing take it over? Well, it could add the feature to its Webmaster Tools, but Microsoft has historically been moving the other direction. They removed the ability to query their index for link data with the <a href="http://www.bing.com/community/blogs/webmaster/archive/2008/08/13/making-backlinks-actionable-again.aspx">link: operator in 2007</a> and have never brought it back for competitive research.</p>
<p>Microsoft likely won&#8217;t be motivated to add a feature that they specifically chose to remove. And it&#8217;s not trivial to build the code to query for competitive links and store the data. Believe me, I know. I managed the process for adding non-competitive backlink data to Google Webmaster Tools. As with the potential end to BOSS, the potential end to Site Explorer opens up new opportunities for third-parties. In fact, the same companies who build a web index could provide competitive link data. Currently, SEOmoz  provides <a href="http://www.seomoz.org/linkscape">Linkscape</a>, which offers some similar features. (Speaking of SEOmoz, Rand Fishkin posted yesterday about the <a href="http://www.seomoz.org/blog/top-10-things-the-microsoftyahoo-deal-change-for-seo">SEO impact</a> of this deal.) <a href="http://www.majesticseo.com/">Majestic SEO</a> and <a href="http://www.exalead.com/search/web/results/?q=link%3Awww.searchengineland.com">Exalead</a> provides link data as well.</p>
<p>More generally, will Microsoft step up its efforts with webmaster relationships? Yahoo used to have a fairly significant presence in the community. In addition to Site Explorer, they were a constant at conferences and participated in online discussions. That participation has declined lately, coinciding with the decline in search quality. Microsoft seemed to be rallying with its webmaster relationships with the <a href="http://www.bing.com/community/blogs/webmaster/archive/2007/11/14/get-better-results-from-live.aspx">launch of the Webmaster Center</a> in November 2007. But Microsoft hasn&#8217;t updated the Webmaster Center with new features since <a href="http://www.bing.com/community/blogs/webmaster/archive/2008/08/13/making-backlinks-actionable-again.aspx">August 2008</a>. (A minor <a href="http://www.bing.com/community/blogs/webmaster/archive/2008/11/25/live-search-webmaster-center-fall-update.aspx">release in November</a> didn&#8217;t add  new features).</p>
<p>Microsoft didn&#8217;t respond to my questions about their current and future resource investment in this area. They did recently release a rudimentary <a href="http://www.iis.net/extensions/SEOToolkit">SEO Toolkit</a>, although it requires Windows Vista and IIS 7.0 to run.</p>
<p>And what about paid inclusion? Yahoo has long offered <a href="http://searchmarketing.yahoo.com/srchsb/ssp.php">Search Submit Pro</a>, which essentially enables sites to pay to be included in the organic listings. Microsoft doesn&#8217;t offer a similar product and while it&#8217;s certainly possible that Microsoft will add this product to their offerings, paid inclusion is quite a substantial shift in overall approach to organic search. It&#8217;s less about the ability to implement the technology and more about belief around what constitutes an &#8220;organic&#8221; index. <a href="http://searchengineland.com/live-blogging-the-microsoft-yahoo-search-press-conference-23202">Danny Sullivan asked about paid inclusion</a> at the announcement press conference. Carol Bartz, Yahoo CEO replied, &#8221; Paid inclusion, we’ll decide on that later.&#8221; But it would be difficult for Yahoo to continue the program on its own, as Yahoo will no longer have control over what pages are included in the search index.</p>
<p><strong>In the end, it&#8217;s about the traffic</strong></p>
<p>The big question is will this partnership significantly change market share percentages? Depending on whose numbers you use, Google has either <a href="http://www.comscore.com/Press_Events/Press_Releases/2009/7/comScore_Releases_June_2009_U.S._Search_Engine_Rankings">65%</a> or <a href="http://searchengineland.com/hitwise-bing-both-grows-google-still-tops-22202">74%</a> share in the US (more in some European countries). That puts the combined Yahoo/Microsoft share at 28% or 25.5%. That&#8217;s substantial traffic, sure, and worth paying attention to. But what will the share look like in three years once the deal is done and we barely remember Yahoo ever had its own index? My guess is pretty similar to how it looks now. Except Google will probably have slightly higher share. I just don&#8217;t see anything game changing here that will cause a mass exodus from the status quo. But I&#8217;ve been wrong before. What I do know is that site owners who have ignored how their sites were doing in Bing until now do to low traffic numbers will likely start paying a lot more attention.</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/what-site-owners-web-developers-and-seos-should-know-about-the-yahoomicrosoft-deal-23344/feed</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>Yahoo Supports Even More Structured Data In SearchMonkey</title>
		<link>http://searchengineland.com/yahoo-supports-even-more-structured-data-in-searchmonkey-21254</link>
		<comments>http://searchengineland.com/yahoo-supports-even-more-structured-data-in-searchmonkey-21254#comments</comments>
		<pubDate>Thu, 18 Jun 2009 16:36:58 +0000</pubDate>
		<dc:creator>Vanessa Fox</dc:creator>
				<category><![CDATA[Yahoo: Search Monkey]]></category>

		<guid isPermaLink="false">http://searchengineland.com/?p=21254</guid>
		<description><![CDATA[Yahoo started the push to structure the web when it launched SearchMonkey. With SearchMonkey, a web developer could add semantic markup to a page and then build an enhanced listing for Yahoo search results.  Not long after, they made things even easier by using this data to enhance listings even without a SearchMonkey application. Google [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fsearchengineland.com%2Fyahoo-supports-even-more-structured-data-in-searchmonkey-21254"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fsearchengineland.com%2Fyahoo-supports-even-more-structured-data-in-searchmonkey-21254" height="61" width="51" /></a></div><p>Yahoo started the <a href="http://searchengineland.com/more-yahoo-search-monkey-details-creating-a-developer-ecosystem-for-search-13571">push to structure the web</a> when it launched SearchMonkey. With SearchMonkey, a web developer could add semantic markup to a page and then build an enhanced listing for Yahoo search results.  Not long after, they made things even easier by <a href="http://www.ysearchblog.com/2009/03/12/embed-videos-games-and-docs-with-searchmonkey/">using this data to enhance listings</a> even without a SearchMonkey application. <a href="http://searchengineland.com/google-search-now-supports-microformats-and-adds-rich-snippets-to-search-results-19055">Google jumped on the structured data bandwagon</a> as well, picking up things like reviews information to provide additional information in search results. <a href="http://searchengineland.com/microsofts-search-engine-optimization-advice-for-bing-21152">Microsoft Bing&#8217;s new document for webmasters</a> also encourages the use of structured data, although it&#8217;s not clear what they do with the data. Yahoo continued its structured data campaign last week, with the <a href="http://searchengineland.com/yahoo-announces-common-tag-like-the-meta-keywords-tag-but-even-better-21021">announcement of Common Tag</a>, a vocabulary that extends RDFa.</p>
<p>Now <a href="http://www.ysearchblog.com/2009/06/18/searchmonkey-updates-new-enhanced-results-and-support-of-google-base-formatting/">Yahoo has announced support</a> for <a href="http://developer.search.yahoo.com/start">several additional opportunties</a> for automated enhanced listings.  You can use DataRSS XML to send a feed directly to Yahoo or use RDFa or microformat markup on your pages for content such as products, events, and news. In addition, Yahoo can now use a Google Base feed to create enhanced listings. Simply submit your Google Base feed through <a href="https://siteexplorer.search.yahoo.com/mysites">Site Explorer</a>. As I mentioned in my <a href="http://searchengineland.com/yahoo-announces-common-tag-like-the-meta-keywords-tag-but-even-better-21021">analysis of Common Tag</a>, these moves seem to be motivated at least in part by Yahoo&#8217;s quest to increase the reach of <a href="http://searchengineland.com/yahoo-lets-you-build-your-own-search-service-14349">BOSS</a>. Yahoo&#8217;s blog post notes that when you submit your Google Base feed, Yahoo converts it to DataRSS and makes it available in BOSS so it can be used for third-party search engine development.</p>
<p>While none of the search engines are using structured data for crawling, indexing, or ranking, it&#8217;s still likely worthwhile to include this markup for data like events, news, and reviews to take advantage of the opportunity to have a search result that stands out in Yahoo and Google (and potentially in Bing as well).</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/yahoo-supports-even-more-structured-data-in-searchmonkey-21254/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Microsoft&#8217;s Search Engine Optimization Advice for Bing</title>
		<link>http://searchengineland.com/microsofts-search-engine-optimization-advice-for-bing-21152</link>
		<comments>http://searchengineland.com/microsofts-search-engine-optimization-advice-for-bing-21152#comments</comments>
		<pubDate>Thu, 18 Jun 2009 16:08:08 +0000</pubDate>
		<dc:creator>Vanessa Fox</dc:creator>
				<category><![CDATA[Features: Analysis]]></category>
		<category><![CDATA[How To: SEO]]></category>
		<category><![CDATA[Microsoft: Bing]]></category>
		<category><![CDATA[SEO: General]]></category>
		<category><![CDATA[Top News]]></category>

		<guid isPermaLink="false">http://searchengineland.com/?p=21152</guid>
		<description><![CDATA[Microsoft recently published a PDF about Search Engine Optimization called Bing: New Features Relevant to Webmasters. This is the second SEO-related offering in as many weeks. During SMX Advanced, Microsoft launched the IIS Search Engine Optimization Toolkit. I&#8217;m working on a review of that as well, but it requires Vista or Windows 7 and IIS [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fsearchengineland.com%2Fmicrosofts-search-engine-optimization-advice-for-bing-21152"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fsearchengineland.com%2Fmicrosofts-search-engine-optimization-advice-for-bing-21152" height="61" width="51" /></a></div><p>Microsoft recently published a PDF about Search Engine Optimization called <a href="http://www.bing.com/community/blogs/webmaster/archive/2009/06/11/bing-white-paper-for-webmasters-amp-publishers-released.aspx">Bing: New Features Relevant to Webmasters</a>. This is the second SEO-related offering in as many weeks. During SMX Advanced, Microsoft launched the <a href="http://www.iis.net/extensions/SEOToolkit">IIS Search Engine Optimization Toolkit</a>. I&#8217;m working on a review of that as well, but it requires Vista or Windows 7 and IIS 7, so installation takes a bit of effort. I talked with Microsoft about the information in the PDF and got some clarity and additional information.  The PDF primarily describes the user interface changes launched with Bing, and how those might impact site owners, but also touches on search engine optimization.</p>
<p>Google has provided SEO advice for a while, both in their <a href="http://google.com/support/webmasters/bin/answer.py?hl=en&amp;answer=35291">help center</a> and in their <a href="http://googlewebmastercentral.blogspot.com/2008/11/googles-seo-starter-guide.html">Search Engine Optimization Starter Guide</a>, so check those out if you&#8217;re interested in hearing the official search engine stance on SEO. Google&#8217;s document is much more instructive, providing their top guidelines for creating websites that can be easily crawled and indexed by search engines.</p>
<p><strong>What can we learn from Microsoft&#8217;s advice for webmasters?</strong></p>
<p>The document starts by describing the primary difference in Bing vs. Live Search that impacts site owners: the first search results page no longer shows results 6 through 10 for the searcher&#8217;s query. Instead, the page includes &#8220;categorized&#8221; results after the list of 5 sites that match the initial query. So, for instance, if you search for [<a href="http://www.bing.com/search?q=volkswagen+beetle&amp;FORM=AWRE">vw beetle</a>], results 6 through 8 are for [volkswagon bettle for sale] and results 9 through 12 are for [used volkswagon beetle].</p>
<p><a title="Bing Categories by Search Engine Land, on Flickr" href="http://www.flickr.com/photos/23148333@N06/3638995066/"><img src="http://farm4.static.flickr.com/3659/3638995066_d87baf1bf2.jpg" alt="Bing Categories" width="358" height="500" /></a></p>
<p><strong>Expanded long tail opportunities</strong></p>
<p>The document clarifies that more than 300 factors are used in determining this categorization and that what categories appear is entity-dependent. In other words, if you search for [<a href="http://www.bing.com/search?q=buffy+the+vampire+slayer&amp;go=&amp;form=QBRE">buffy the vampire slayer</a>], you may see a category for &#8220;wallpaper&#8221; because lots of people search for Buffy wallpaper, but you might see categories for &#8220;episodes&#8221; and &#8220;characters&#8221; because Microsoft has identified &#8220;Buffy the Vampire Slayer&#8221; as a TV show.</p>
<p>Microsoft says that this new way of organizing results provides new long tail opportunities for site owners. They suggest determining what entities you most want your site to rank for and then optimizing for the long tail queries within those entities. For instance, if your site is about cars, you&#8217;ll want to optimize for the car brands (VW, Volvo, etc.) but also car-related categories such as dealers, used, and for sale. That will give you additional opportunities to rank on the first page of results.</p>
<p><strong>Best match</strong></p>
<p>Bing has also introduced a concept called &#8220;best match&#8221;. When Bing is reasonably confident that the first result is what the searcher is looking for, it will be noted as the &#8220;best result&#8221; and could contain additional details, such as a box for searching within the site, phone numbers, and links to pages within the site. In some instance, no results other than the best match appear on the page, which Microsoft says their team chooses when they have high confidence in the result and the query volume is high. Of course, this isn&#8217;t great news for the site owner at position 2.</p>
<p><a title="Bing Best Match by Search Engine Land, on Flickr" href="http://www.flickr.com/photos/23148333@N06/3638182479/"><img src="http://farm4.static.flickr.com/3328/3638182479_9cecf80350.jpg" alt="Bing Best Match" width="500" height="280" /></a></p>
<p>If you don&#8217;t want your site to appear as the best match, don&#8217;t want the internal site search surfaced, want to correct the phone number that appears, or want to remove links, you can request that by <a href="http://www.bing.com/community/blogs/webmaster/archive/2009/06/10/best-match-deep-links-feature-support.aspx">emailing bestmatc@microsoft.com</a>.</p>
<p><strong>Document preview</strong></p>
<p>Document preview provides additional content from the site in a hover and, in Microsoft&#8217;s words, &#8220;helps searchers find the content they want faster, without leaving the SERP until they are ready&#8221;.</p>
<p><a title="Bing Hover by Search Engine Land, on Flickr" href="http://www.flickr.com/photos/23148333@N06/3638182683/"><img src="http://farm4.static.flickr.com/3612/3638182683_d9ddb17128.jpg" alt="Bing Hover" width="500" height="211" /></a></p>
<p>They say this helps increase qualified traffic, but some webmasters might think that it helps searchers find the content they want without leaving the SERP at all. For them, Microsoft provides a way to opt-out of the feature. Just add the following to the &lt;head&gt; section of each page:</p>
<pre>&lt;meta name=“msnbot”, content=“nopreview”&gt;</pre>
<p>If you don&#8217;t want this feature used on any page of the site, it might be easier to return the directive in the server HTTP header as follows:</p>
<pre>x-robots-tag: nopreview</pre>
<p>Not everything from the document preview comes from the site. Some information could be from external sources as well. For instance, if the site is in Flash and Microsoft has trouble extracting data, they might turn to a third-party source. And they may use local information such as an address or phone number from an external source. You can request that third-party data not be used for your listing by <a href="http://www.bing.com/community/blogs/webmaster/archive/2009/06/10/best-match-deep-links-feature-support.aspx">emailing bestmatc@microsoft.com</a>.</p>
<p><strong>Flash extraction</strong></p>
<p>Microsoft says that Flash-based sites are responsible for 21% of all empty descriptions in their index. They say they are doing &#8220;limited data extraction&#8221; and are now able to generate descriptions for one-third of those. They may also use anchor text from incoming links. Of course, Google has been using incoming anchor text as source data for missing titles for a while, and has continued to <a href="http://searchengineland.com/google-now-crawling-and-indexing-flash-content-14299">evolve its ability to extract Flash data</a>. No word on whether Microsoft is using Adobe&#8217;s crawler API or another technology for this extraction.</p>
<p><strong>Use of microformats</strong></p>
<p>While the document encourages the use of structured data, such as microformats, Microsoft tells me that they aren&#8217;t currently using this data for crawling, indexing, or ranking. This stance mirrors <a href="http://searchengineland.com/google-search-now-supports-microformats-and-adds-rich-snippets-to-search-results-19055">Google&#8217;s</a> and <a href="http://searchengineland.com/yahoo-announces-common-tag-like-the-meta-keywords-tag-but-even-better-21021">Yahoo&#8217;s</a>, both of whom are encouraging structured data use, but are not yet using the data for web search.</p>
<p><strong>Instant answers</strong></p>
<p>Instant answers, the &#8220;OneBox&#8221;-style results that provide data to answer the query, aren&#8217;t new to Bing, but may be gaining prevalence. For now, Microsoft tells me that you can request evaluation of your data for inclusion using their <a href="http://www.bing.com/community/blogs/webmaster/archive/2009/06/10/best-match-deep-links-feature-support.aspx">support forum</a>.</p>
<p><strong>Local listings</strong></p>
<p>Local results are more heavily featured in Bing. You can add your site to their <a href="https://ssl.bing.com/listings/ListingCenter.aspx">Local Listing Center</a> to ensure you have the opportunity to be featured in these results.</p>
<p><strong>Searcher behavior</strong></p>
<p>Microsoft provides some interesting data about searcher behavior, which they tell me is based on a number of internal studies derived from analysis of their search logs and toolbar logs. For instance, they found that searchers refine their query, bounce back to the search result, or abandon the search 50% of the time. The breakdown of that 50% is below.</p>
<p><a title="Microsoft Search Data by Search Engine Land, on Flickr" href="http://www.flickr.com/photos/23148333@N06/3638994838/"><img src="http://farm4.static.flickr.com/3378/3638994838_dcf341a5a6_o.png" alt="Microsoft Search Data" width="387" height="362" /></a></p>
<p>They also found that searchers repeated 24% of their queries during a session (which led to the additional of search history in the left column of the search results page).</p>
<p>All in all, this document doesn&#8217;t provide a lot of new information about SEO. But I applaud Microsoft for understanding things like removing results 6 through 10 and replacing them with categorized results and adding a hover with additional content from the site could impact search traffic and for providing information about these features. It&#8217;s difficult to say just how much these changes will effect traffic patterns. The biggest change, of course, would come if Bing brings about Microsoft&#8217;s goal of <a href="http://searchengineland.com/bing-comscore-sees-gains-compete-sees-same-21158">increasing search share</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/microsofts-search-engine-optimization-advice-for-bing-21152/feed</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
	</channel>
</rss>
