<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Search Engine Land &#187; SEO: Flash</title>
	<atom:link href="http://searchengineland.com/library/seo/seo-flash/feed" rel="self" type="application/rss+xml" />
	<link>http://searchengineland.com</link>
	<description>Search Engine Land: News On Search Engines, Search Engine Optimization (SEO) &#38; Search Engine Marketing (SEM)</description>
	<lastBuildDate>Fri, 25 May 2012 23:34:25 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
<xhtml:meta xmlns:xhtml="http://www.w3.org/1999/xhtml" name="robots" content="noindex" />
		<item>
		<title>Google Can Now Execute AJAX &amp; JavaScript For Indexing</title>
		<link>http://searchengineland.com/google-can-now-execute-ajax-javascript-for-indexing-99518</link>
		<comments>http://searchengineland.com/google-can-now-execute-ajax-javascript-for-indexing-99518#comments</comments>
		<pubDate>Tue, 01 Nov 2011 17:57:26 +0000</pubDate>
		<dc:creator>Barry Schwartz</dc:creator>
				<category><![CDATA[Google: SEO]]></category>
		<category><![CDATA[SEO: Blocking Spiders]]></category>
		<category><![CDATA[SEO: Flash]]></category>
		<category><![CDATA[Top News]]></category>

		<guid isPermaLink="false">http://searchengineland.com/?p=99518</guid>
		<description><![CDATA[This morning we reported that the comments on Facebook are being indexed by Google. Google&#8217;s Matt Cutts just confirmed on Twitter that Google is now able to &#8220;execute AJAX/JS to index some dynamic comments.&#8221; This gives Google&#8217;s spider, GoogleBot, the ability to read comments in AJAX or JavaScript, such as Facebook comments or Disqus comments [...]]]></description>
			<content:encoded><![CDATA[<p>This morning we reported that the <a href="http://searchengineland.com/many-facebook-comments-now-being-indexed-by-google-99399">comments on Facebook are being indexed</a> by Google. Google&#8217;s Matt Cutts just <a href="https://twitter.com/#!/mattcutts/status/131425949597179904">confirmed on Twitter</a> that Google is now able to &#8220;execute AJAX/JS to index some dynamic comments.&#8221;</p>
<p>This gives Google&#8217;s spider, GoogleBot, the ability to read comments in AJAX or JavaScript, such as Facebook comments or Disqus comments and others that are dynamically loaded via AJAX or JavaScript. In addition, this means, Google is better at seeing the content behind more of your JavaScript or AJAX.</p>
<p><img class="alignnone size-full wp-image-99520" title="cutts-google-index-ajax" src="http://searchengineland.com/figz/wp-content/seloads/2011/11/cutts-google-index-ajax.png" alt="" width="545" height="228" /></p>
<p><strong>Postscript:</strong> Google now has an official blog <a href="http://googlewebmastercentral.blogspot.com/2011/11/get-post-and-safely-surfacing-more-of.html">post</a> up with more details.</p>
<h3>Related Stories:</h3>
<ul>
<li><a href="http://searchengineland.com/google-proposes-to-make-ajax-crawlable-27408">Google Offers A Proposal To Make AJAX Crawlable</a></li>
<li><a href="http://searchengineland.com/google-offers-seo-advice-on-ajax-coding-12637">Google Offers SEO Advice On AJAX Coding</a></li>
<li><a href="http://searchengineland.com/googles-proposal-for-crawling-ajax-may-be-live-34411">Google May Be Crawling AJAX Now – How To Best Take Advantage Of It</a></li>
<li><a href="http://searchengineland.com/its-official-googles-proposal-for-crawling-ajax-urls-is-live-37298">It’s Official: Google’s Proposal For Crawling AJAX URLs is Live</a></li>
<li><a href="http://searchengineland.com/an-update-on-javascript-menus-and-seo-16060">An Update On Javascript Menus And SEO</a></li>
<li><a href="http://searchengineland.com/google-now-crawling-and-indexing-flash-content-14299">Google Now Crawling And Indexing Flash Content</a></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/google-can-now-execute-ajax-javascript-for-indexing-99518/feed</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Google Can Index &#8220;Almost Any Text&#8221; In A Flash (SWF) File</title>
		<link>http://searchengineland.com/google-can-index-almost-any-text-in-a-flash-swf-file-55580</link>
		<comments>http://searchengineland.com/google-can-index-almost-any-text-in-a-flash-swf-file-55580#comments</comments>
		<pubDate>Fri, 12 Nov 2010 14:09:39 +0000</pubDate>
		<dc:creator>Barry Schwartz</dc:creator>
				<category><![CDATA[Google: SEO]]></category>
		<category><![CDATA[SEO: Flash]]></category>
		<category><![CDATA[Top News]]></category>

		<guid isPermaLink="false">http://searchengineland.com/?p=55580</guid>
		<description><![CDATA[The Google Webmaster Central Blog posted an update on their Flash indexing capabilities. Google said, as of today, they are able to index and understand &#8220;almost any text a user can see as they interact with a SWF file on your site.&#8221; Typically, Flash has been one of those big No-Nos in the SEO space. [...]]]></description>
			<content:encoded><![CDATA[<p>The Google Webmaster Central Blog <a href="http://googlewebmastercentral.blogspot.com/2010/11/what-feeling-even-better-indexing-of.html">posted</a> an update on their Flash indexing capabilities.  Google said, as of today, they are able to index and understand &#8220;almost any text a user can see as they interact with a SWF file on your site.&#8221;</p>
<p>Typically, Flash has been one of those big No-Nos in the SEO space.  I still do not believe any SEO would recommend anyone to build out a site completely built in Flash.  However, there are many Flash sites out there that are not concerned with being indexed, either intentionally or not intentionally, but Google wants to index them.</p>
<p>Google has been working with Adobe to continually better index Flash content.  In 2008, Google made stride in  <a href="http://searchengineland.com/google-now-crawling-and-indexing-flash-content-14299">indexing Flash content</a> and improved on that <a href="http://searchengineland.com/google-now-indexing-flash-from-external-files-21303">last year</a> as well.  Now, Google is able to index almost all the text and links within a Flash file.</p>
<p>In addition, Google said they can index the latest compatible content for Flash Player 10.1 and they are better at understanding JavaScript to embed SWF content.  Finally, Google is also better at detecting and extracting meta data from Flash video files, metadata such as alternate thumbnails.</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/google-can-index-almost-any-text-in-a-flash-swf-file-55580/feed</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Google Now Indexing Flash From External Files</title>
		<link>http://searchengineland.com/google-now-indexing-flash-from-external-files-21303</link>
		<comments>http://searchengineland.com/google-now-indexing-flash-from-external-files-21303#comments</comments>
		<pubDate>Fri, 19 Jun 2009 11:34:05 +0000</pubDate>
		<dc:creator>Barry Schwartz</dc:creator>
				<category><![CDATA[Google: SEO]]></category>
		<category><![CDATA[SEO: Flash]]></category>

		<guid isPermaLink="false">http://searchengineland.com/?p=21303</guid>
		<description><![CDATA[The Google Webmaster Central blog finally blogged Google is now indexing the content of files that some SWF files load externally, technically from &#8220;external resource loading&#8221;. Flash often loads external files to generate what you see in the SWF, so if the SWF is calling an external HTML, XML, another SWF, etc files, Google will [...]]]></description>
			<content:encoded><![CDATA[<p>The Google Webmaster Central blog finally <a href="http://googlewebmastercentral.blogspot.com/2009/06/flash-indexing-with-external-resource.html">blogged</a> Google is now indexing the content of files that some SWF files load externally, technically from &#8220;external resource loading&#8221;.  Flash often loads external files to generate what you see in the SWF, so if the SWF is calling an external HTML, XML, another SWF, etc files, Google will index it.  Vanessa&#8217;s coverage of the Google I/O event also <a href="http://searchengineland.com/google-io-new-advances-in-the-searchability-of-javascript-and-flash-but-is-it-enough-19881">covers</a> this from a couple weeks back.</p>
<p>The example given by Google on how this improves the search results is by showing a search snippet for the query [<a href="http://www.google.com/search?q=2002+VW+Transporter+888&#038;ie=utf-8&#038;oe=utf-8&#038;aq=t">2002 VW Transporter 888</a>].  In the past, Google would not have been able to show this snippet:</p>
<p><a href="http://www.flickr.com/photos/rustybrick/3640394845/" title="Google External File Flash by rustybrick, on Flickr"><img src="http://farm4.static.flickr.com/3625/3640394845_4bd07ff4c9.jpg" width="500" height="104" alt="Google External File Flash" /></a></p>
<p>Google began <a href="http://searchengineland.com/google-now-crawling-and-indexing-flash-content-14299">indexing Flash</a> files back in June 2008.  Now Google can handle SWFs in many ways, including:</p>
<ul>
<li>Index textual content displayed as a user interacts with the file. We click buttons and enter input, just like a user would.</li>
<li>Discover links within Flash files.</li>
<li>Load external resources and associate the content with the parent file.</li>
<li>Support common JavaScript techniques for embedding Flash, such as SWFObject and SWFObject2.
Index sites scripted with AS1 and AS2, even if the ActionScript is obfuscated.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/google-now-indexing-flash-from-external-files-21303/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Google I/O: New Advances In The Searchability of JavaScript and Flash, But Is It Enough?</title>
		<link>http://searchengineland.com/google-io-new-advances-in-the-searchability-of-javascript-and-flash-but-is-it-enough-19881</link>
		<comments>http://searchengineland.com/google-io-new-advances-in-the-searchability-of-javascript-and-flash-but-is-it-enough-19881#comments</comments>
		<pubDate>Fri, 29 May 2009 18:11:04 +0000</pubDate>
		<dc:creator>Vanessa Fox</dc:creator>
				<category><![CDATA[Features: Analysis]]></category>
		<category><![CDATA[Google: APIs]]></category>
		<category><![CDATA[Google: SEO]]></category>
		<category><![CDATA[Google: Webmaster Central]]></category>
		<category><![CDATA[SEO: Flash]]></category>
		<category><![CDATA[Top News]]></category>

		<guid isPermaLink="false">http://searchengineland.com/?p=19881</guid>
		<description><![CDATA[This week at Google I/O, Google talked a lot about the evolution of the technological capabilities of the web. HTML 5 is ushering in new era of browser-based development and applications. Eric Schmidt, Google CEO, kicked things off with, &#8220;My message to you is that this is the beginning of the real win of cloud [...]]]></description>
			<content:encoded><![CDATA[<p><a title="Google I/O by Search Engine Land, on Flickr" href="http://www.flickr.com/photos/23148333@N06/3576675864/"><img src="http://farm4.static.flickr.com/3635/3576675864_4b64ddab0c_o.jpg" alt="Google I/O" width="239" height="75" align="right" /></a><a href="http://searchengineland.com/live-blogging-google-io-keynote-19812">This week at Google I/O</a>, Google talked a lot about the evolution of the technological capabilities of the web. <a href="http://dev.w3.org/html5/spec/Overview.html">HTML 5</a> is ushering in new era of browser-based development and applications. Eric Schmidt, Google CEO, kicked things off with, &#8220;My message to you is that this is the beginning of the real win of cloud computing, of applications, of the internet, which is changing the paradigm that we’ve all grown up with so that it just works … regardless of platform or hardware you’re using.&#8221;</p>
<p>Which is great and all, but if &#8220;the web has won&#8221;, as Vic Gundotra, VP of Engineering at Google proclaimed, then it&#8217;s not just application development that&#8217;s moved to the web. The potential consumers of these applications have moved to the web too. And Google, more than any other company, knows that <a href="http://searchengineland.com/how-we-navigate-our-online-landscape-12350">search has become the primary navigation point of the web</a>. We&#8217;ve become a searching culture and if we don&#8217;t see something in the first 10 search results on Google, we may not realize it exists. (<a href="http://www.pewinternet.org/Reports/2008/Search-Engine-Use.aspx">A 2008 PEW/Internet survey</a> found that 49% of online Americans use search engines every day, and a <a href="http://www.iprospect.com/about/researchstudy_2008_blendedsearchresults.htm">2008 iProspect/JupiterResearch study</a> found that &#8220;68% of search engine users click a search result within the first page of  results, and a full 92% of search engine users click a result within the first  three pages of search results.&#8221;)</p>
<p>Web applications need more than technology to thrive. They also need customers. And more often than not these days, those customers are acquired through search. But how well can this new world of the web that Google is ushering in at Google I/O be crawled, indexed, and ranked by search engines such as Google?</p>
<p>If the search engines&#8217; ability to handle the rich internet applications (RIAs) developers have been creating over the last few years are any indication, <a href="http://janeandrobot.com/library/search-friendly-design-patterns-for-web-developers">not very well</a>. Google in particular has <a href="http://searchengineland.com/google-now-crawling-and-indexing-flash-content-14299">recently made strides in this area</a>, as it benefits them in their <a href="http://www.google.com/corporate/">quest</a> to organize the world&#8217;s information and make it universally accessible and useful. But years after Flash and AJAX hit the market, their <a href="http://googlewebmastercentral.blogspot.com/2007/11/spiders-view-of-web-20.html">searchability still isn&#8217;t ideal</a>.</p>
<p>When this year&#8217;s <a href="http://code.google.com/events/io/about.html">Google I/O conference</a> was first announced, I talked with Tom Stocky, Director of Product Management for developer products at Google, and asked about the searchability issues with some of the Google Code APIs and products. He noted that Google I/O included a session about searchability: <a href="http://code.google.com/events/io/sessions/SearchFriendlyDevelopment.html">Search Friendly Developmen</a>t, presented by Maile Ohye. Having heard Maile speak about developer issues before, I have no doubt that her content was top notch, but that doesn&#8217;t answer my underlying question about the general searchability of the code Google offers to developers. After all, if Google is encouraging developers to<a href="http://code.google.com/events/io/sessions/BuildingBusinessFreeApis.html"> &#8220;build a business&#8221; with their APIs</a>, they should realize that building a business is about more than application construction &#8212; it&#8217;s about the ability to acquire customers as well.</p>
<p><strong>JavaScript and AJAX</strong></p>
<p>For instance, <a href="http://code.google.com/apis/ajax/">Google&#8217;s AJAX APIs</a> are created with, well, AJAX, which is <a href="http://www.businessol.com/seo-blog/2007/12/making-ajax-seo-friendly.html">notoriously difficult to index</a>. A core issue with AJAX is that it dynamically changes the content of the page. The URL doesn&#8217;t change as the new content loads. Often, the URL is appended with a hash mark (#). Historically, a # in a URL has denoted a named anchor within a page, and thus search engines generally drop everything in a URL beginning with a # as not to index the same page multiple times.</p>
<p>You can see this implementation on <a href="http://www.coldfusionbloggers.org">coldfusionbloggers.org</a>, for instance.</p>
<p>Click the Next button, and the URL is appended as follows: http://www.coldfusionbloggers.org/#2</p>
<p>Other AJAX implementations don&#8217;t append anything to the URL, but simply dynamically load new content on the page based on clicks. Take a look at an example from the <a href="http://code.google.com/apis/ajax/playground/#tabbed_display_mode">Google Code Playground</a>.</p>
<p><a title="Google Code Playground by Search Engine Land, on Flickr" href="http://www.flickr.com/photos/23148333@N06/3571264085/"><img src="http://farm4.static.flickr.com/3633/3571264085_b48a448e90.jpg" alt="Google Code Playground" width="500" height="463" /></a></p>
<p>In this example, each of the three tabs (Local, Web, and Blog) contains unique content. The URL of the page doesn&#8217;t change as the content reloads. You might expect one of two things to happen:</p>
<ul>
<li> search engines associate the content from the tab that appears when the page first loads with the URL</li>
<li> search engines associate the content from all of the tabs with the URL.</li>
</ul>
<p>Either of these scenarios could happen, depending on how the code is implemented. What actually happens in this case is less desirable than either of those options. Because the entire tab architecture is loaded in JavaScript, search engines can&#8217;t access the content at all.</p>
<p>You can see this in action for an implementation <a href="http://googleajaxsearchapi.blogspot.com/2008/11/styling-searchcontrol-guest-post.html">showcased on the Google AJAX APIs blog</a>. The <a href="http://jgeerdes.home.mchsi.com/google/googleAjaxApiSearchControlStyled.html">sample site</a> uses a tabbed architecture to organize results, but all that shows up in Google search results is &#8220;loading&#8221;:</p>
<p><a title="Loading Sample by Search Engine Land, on Flickr" href="http://www.flickr.com/photos/23148333@N06/3576557916/"><img src="http://farm3.static.flickr.com/2438/3576557916_c843326d04.jpg" alt="Loading Sample" width="500" height="240" /></a></p>
<p>A similar thing happens with<a href="http://74.125.155.132/search?q=cache:QBqLLs4mVpkJ:www.agilegallery.com/ajax-demo.html+http://www.agilegallery.com/ajax-demo.html&amp;cd=1&amp;hl=en&amp;ct=clnk&amp;gl=us&amp;client=firefox-a"> this page that uses AJAX</a> to provide navigation through photos:</p>
<p>All Google sees is &#8220;loading photos, please wait&#8230;&#8221;</p>
<p><a title="loading by Search Engine Land, on Flickr" href="http://www.flickr.com/photos/23148333@N06/3575465556/"><img src="http://farm4.static.flickr.com/3630/3575465556_9e0abfcddd.jpg" alt="loading" width="500" height="202" /></a></p>
<p>A <a href="http://www.dynamicdrive.com/dynamicindex17/ajaxtabscontent/">dynamic menu such as the one described here</a> has related, but different issues. Each tab loads content from an external HTML file. Google doesn&#8217;t see that content as part of the page. Rather, it sees the content as part of those external files and <a href="http://www.google.com/search?q=inurl%3Ahttp%3A%2F%2Fwww.dynamicdrive.com%2Fdynamicindex17%2Fajaxtabscontent%2F+%22This+is+the+contents+of+%22external3.htm%22+&amp;ie=utf-8&amp;oe=utf-8&amp;aq=t">indexes them separately</a>.</p>
<p>There are ways to get the content indexed, of course.</p>
<ul>
<li>You can do away with AJAX in this instance and use CSS and divs instead.</li>
<li>You can use the approach taken on the Yahoo home page. The Featured, Entertainment, Sports, and Life tabs load inline when you click on them, but if JavaScript isn&#8217;t enabled, the tabs are shown as links to separate pages. (For instance, the entertainment tab links to http://entertainment.yahoo.com/.)</li>
</ul>
<p style="padding-left: 30px;"><a title="Yahoo Home Page by Search Engine Land, on Flickr" href="http://www.flickr.com/photos/23148333@N06/3571275153/"><img src="http://farm4.static.flickr.com/3664/3571275153_7c5f5508e3.jpg" alt="Yahoo Home Page" width="500" height="224" /></a></p>
<p style="padding-left: 30px;">You can tell that Google sees the version of the Yahoo page in which the tabs link to separate pages because this is the behavior found in the <a href="http://74.125.155.132/search?hl=en&amp;q=cache%3Ayahoo.com&amp;btnG=Search">Google cache</a>.</p>
<ul>
<li>You can gracefully <a href="http://www.businessol.com/seo-blog/2007/12/making-ajax-seo-friendly.html">degrade your AJAX implementation</a> or use <a href="http://www.sitepoint.com/blogs/2007/02/23/handling-javascript-disabled-browsers/">progressive enhancement techniques.</a></li>
</ul>
<ul>
<li>You can implement a technique that <a href="http://adactio.com/">Jeremy Keith</a> describes as <a href="http://domscripting.com/presentations/xtech2006/">Hijax</a>: return false from the onClick handler and include a crawlable URL in the href as shown below:</li>
</ul>
<pre style="padding-left: 30px;">&lt;a href=”ajax.htm?foo=123” onClick=”navigate('ajax.html#foo=123');
return  false”&gt;123&lt;/a&gt;</pre>
<p style="padding-left: 30px;">Of course, with an implementation like this, similar to the Yahoo home page experience, search engines may index each page individually or as variations of the same page (depending on what the AJAX code is meant to do), rather than as part of a single, comprehensive page  and when visitors enter your site through search, those individual pages will be how they first experience your site.</p>
<p><strong>Cloaking?</strong></p>
<p>None of these JavaScript/AJAX workarounds are <a href="(http://google.com/support/webmasters/bin/answer.py?answer=66355">considered cloaking</a>, which can get a site banned from Google, because they don&#8217;t present content based on user agent (such as Googlebot). Rather, they present &#8220;degraded&#8221; content to any user agent without JavaScript support (including screen readers, some mobile devices, and older browsers) or &#8220;enhanced&#8221; content to any user agent with JavaScript support.</p>
<p><strong>Flash and Flex</strong></p>
<p>Similar issues exist with <a href="http://www2.webmasterradio.fm/office-hours/2009/adobe-flash-improvements">Adobe technologies such as Flash and Fle</a>x. Search engines have historically had trouble crawling Flash. Last year, Adobe made a search crawler version of the Flash player available to Google so it could extract text and links, and more recently launched an <a href="http://www.adobe.com/devnet/seo/">SEO knowledge center</a>, but <a href="http://www.ninebyblue.com/blog/search-friendly-flash/">problems remain</a>.</p>
<p>If the Flash application is built with a single URL (rather than changing URLs for each interaction), then the site visitor coming from search will always enter the site at the home page, and have no way of knowing how to get to the content. Not to mention this style of web application is difficult to share. It&#8217;s much easier to copy and paste a link to a cute pair of shoes than it is to email a friend instructions like, &#8220;go to the home page, then click shoes, then click sandals, then brown, then scroll to the third page and look in the fifth row, second pair over from the left.&#8221; Seriously, no matter how good a friend you are to me, I am not following those instructions.</p>
<p>The Flex framework introduces challenges similar to AJAX. <a href="http://www.insideria.com/2008/09/advanced-flex-deep-linking-wit-1.html?sdid=DNJYZ">It creates new URLs by adding a hash mark (#)</a>. As with AJAX, this can make things faster because new pages aren&#8217;t loading with each click, but also as with AJAX, search engines drop everything in a URL beginning with the #. Depending on your infrastructure, you might be able to <a href="http://grails.org/URL+mapping">remap</a> or rewrite the URLs, but that sort of defeats the purpose of using a coding framework to begin with.</p>
<p><strong>Should Google Search and Google Code Have Coffee?</strong></p>
<p>I (more than many people, having previously worked in Google search) understand that Google is a big company. And I know that both the search and code teams are working hard at their core goals. And believe me, I really love the people working on both teams. I&#8217;ve had the wonderful opportunity to work very closely with many of them, and I can vouch for the fact that any lack of integration between the two is not for lack of caring. Those teams care very much about putting out quality products that their customers are delighted by.</p>
<p>But as can happen in big companies where everyone is working hard, there seems to be a disconnect. Google APIs and other products for developers should be built search-engine friendly right out of the box not only because they&#8217;re being built by Googlers who should therefore know better, but because searchability is vital to for a business to be successful on the web today. Code should be secure; it should function properly; it should be crawlable.</p>
<p>At Google I/O&#8217;s first <a href="http://sites.google.com/a/pressatgoogle.com/googleio2009/keynote-videos">keynote session</a>, Vic Gundotra touted the <a href="http://www.whitehouse.gov/OpenForQuestions/">White House&#8217;s use of Google Moderator</a> earlier this year as an example of how useful Google tools are to developers. Sure, the AppEngine servers running Google Moderator held up, but <a href="http://www.ninebyblue.com/blog/whitehousegov-keeping-searchers-from-asking-questions-inadvertently/">none of the discussion could be crawled or indexed by search engines</a>. Maybe this didn&#8217;t matter to the White House (although I would think that some American citizens might find it helpful to be able to search through the questions people had). But a small business using Google Moderator as its forum or support framework would almost certainly want that content to be found in Google searches.</p>
<p><strong>Google is working on it</strong></p>
<p>The Google Webmaster Central team has been providing a wealth of education around these issues to help developers build search-friendly web sites. For instance:</p>
<ul>
<li><a href="http://www.google.com/support/webmasters/bin/answer.py?hl=en&amp;answer=81766">Search-friendly AJAX</a></li>
<li><a href="http://www.google.com/support/webmasters/bin/answer.py?hl=en&amp;answer=139066">Canonicalization</a></li>
<li><a href="http://www.google.com/support/webmasters/bin/answer.py?hl=en&amp;answer=83105">Site moves</a></li>
</ul>
<p>At Maile&#8217;s Search-Friendly Development session at Google I/O, Google announced two advances in their ability to crawl and index RIAs. While both of these advances are great efforts, they were driven entirely by the search team. And unfortunately, they don&#8217;t solve the issues with the Google Code APIs. Wouldn&#8217;t it be great if the new <a href="http://google-code-updates.blogspot.com/2009/05/introduce-google-web-elements.html">Web Elements</a> they just announced were search-engine friendly by default?</p>
<p><strong>Flash improvements</strong></p>
<p>Google <a href="http://googlewebmastercentral.blogspot.com/2008/06/improved-flash-indexing.html">made improvements in crawling content in Flash files in July of 2008</a>, but wasn&#8217;t able to extract content from external files (such as resource files). In addition, many sites loaded Flash via a JavaScript link on the home page, and since Google didn&#8217;t follow the JavaScript link, they couldn&#8217;t get to the Flash at all. Both of those things have changed as of this week. Google can now access content in external files and can follow JavaScript links to Flash applications.</p>
<p>Both of these are great improvements, but the <a href="http://searchengineland.com/google-now-crawling-and-indexing-flash-content-14299">advice in my original article remains</a>. Don&#8217;t use Flash in situations where you really don&#8217;t need it. Make sure that all text and links are built into the Flash file as text and links (and not, for instance, in images), and create a new URL for each interaction.</p>
<p><strong>JavaScript improvements</strong></p>
<p>Google has also been crawling some JavaScript for a while. Primarily, they&#8217;ve been <a href="http://googlewebmastercentral.blogspot.com/2007/11/spiders-view-of-web-20.html">extracting very simply coded links</a>. As of today, they&#8217;re able execute JavaScript onClick events. They still recommend using progressive enhancement techniques, however, rather than to rely on Googlebot&#8217;s ability to extract from the JavaScript (not just for search engine purposes, but for accessibility reasons as well).</p>
<p>Googlebot is now able to construct much of the page and can access the onClick event contained in most tags. For now, if the onClick event calls a function that then constructs the URL, Googlebot can only interpret it if the function is part of the page (rather than in an external script).</p>
<p>Some examples of code that Googlebot can now execute include:</p>
<ul>
<li>
<pre>&lt;div onclick="document.location.href='http://foo.com/'"&gt;</pre>
</li>
<li>
<pre>&lt;tr onclick="myfunction('index.html')"&gt;&lt;a href="#"
onclick="myfunction()"&gt;new page&lt;/a&gt;</pre>
</li>
<li>
<pre>&lt;a href="javascript:void(0)" onclick="window.open
('welcome.html')"&gt;open new window&lt;/a&gt;</pre>
</li>
</ul>
<p>These links pass both anchor text and PageRank.</p>
<p><strong>The end of progressive enhancement?</strong></p>
<p>Does this mean that web developers no longer need to progressively enhance their JavaScript? I would recommend continuing this practice. Not only does it benefit accessibility and mobile users, but search engines other than Google aren&#8217;t yet crawling JavaScript. And in any case, we will have to watch and see what happens in the Google search results before we know exactly how JavaScript content is handled.</p>
<p>For instance, for that Star Trek example above, the entire body code is as follows:</p>
<pre id="line17"> &lt;<span class="start-tag">body</span>&gt;
    &lt;<span class="start-tag">div</span><span class="attribute-name"> id</span>=<span class="attribute-value">"searchcontrol"</span>&gt;Loading&lt;/<span class="end-tag">div</span>&gt;
  &lt;/<span class="end-tag">body</span>&gt;</pre>
<p>The JavaScript function in the head section of the page loads the Google AJAX API code externally, and even with these latest improvements, Googlebot is still only able to interpret code on the page.</p>
<p><strong>What about all those workarounds to accommodate Googlebot&#8217;s previous lack of support for JavaScript?</strong></p>
<p>One issue that is worth investigating is how is Google handling &lt;noscript&gt; content and content when the onClick handler returns false? If a developer uses the Hijax method described above as a workaround for URLs with hash marks in them, will Google now see only the non-search-friendly version of the URL? I asked Google about these issues and they told me regarding Hijax that &#8220;we try to mimic the browser&#8217;s behavior, but it&#8217;s still possible for us to discover URLs even though the function returns false.&#8221;</p>
<p>As for &lt;noscript&gt;, Google told me that text outside of noscript is best, but they do process information inside of &lt;noscript&gt;. That statement is worth digging into a bit more [and I'll update this post when I have more information]. Does Google prefer text outside of &lt;noscript&gt; because they can now easily crawl all the text inside of JavaScript? Or is it a technique that could be perceived as potentially manipulative? And what does &#8220;process  information&#8221; mean exactly? [<strong>Updated with additional information from Google: </strong>as noted below in their help information, &lt;noscript&gt; is a technique that can be useful for graceful degradation, but HTML text is the best option, if possible. And as noted below, the &lt;noscript&gt; text should match what displays in JavaScript-enabled browsers.]</p>
<p>Conventional wisdom has generally been that using &lt;noscript&gt; to provide graceful degradation for those without JavaScript support was fine by the search engines as long as the text in the &lt;noscript&gt; matched the text in the JavaScript exactly. <a href="http://www.google.com/support/webmasters/bin/answer.py?hl=en&amp;answer=66355">Google itself recommends it</a> in their guidelines. Although, to be fair, there&#8217;s always <a href="http://www.stonetemple.com/blog/?p=80">been a bit of concern</a> about this method. In Google&#8217;s words (emphasis mine):</p>
<blockquote>If your site contains elements that aren&#8217;t crawlable by search engines (such  as rich media files other than Flash, JavaScript, or images), you shouldn&#8217;t  provide cloaked content to search engines. Rather, you should consider visitors  to your site who are unable to view these elements as well. For instance:</p>
<ul>
<li>Provide alt text that describes images for visitors with screen readers or  images turned off in their browsers.</li>
<li><strong>Provide the textual contents of JavaScript in a noscript tag.</strong></li>
</ul>
<p><strong>Ensure that you provide the same content in both elements (for instance,  provide the same text in the JavaScript as in the noscript tag).</strong> Including  substantially different content in the alternate element may cause Google to  take action on the site.</p>
<p>Sneaky JavaScript redirects</p>
<p>When Googlebot indexes a page containing JavaScript, it will index that page  but it cannot follow or index any links hidden in the JavaScript itself. Use of  JavaScript is an entirely legitimate web practice. However, use of JavaScript  with the intent to deceive search engines is not. <strong>For instance, placing  different text in JavaScript than in a noscript tag violates our <a href="answer.py?answer=35769">webmaster guidelines</a></strong> because it displays  different content for users (who see the JavaScript-based text) than for search  engines (which see the noscript-based text). Along those lines, it violates the  webmaster guidelines to embed a link in JavaScript that redirects the user to a  different page with the intent to show the user a different page than the search  engine sees. When a redirect link is embedded in JavaScript, the search engine  indexes the original page rather than following the link, whereas users are  taken to the redirect target. Like cloaking, this practice is deceptive because  it displays different content to users and to Googlebot, and can take a visitor  somewhere other than where they intended to go.</p>
<p>Note that placement of links within JavaScript is alone not deceptive. When  examining JavaScript on your site to ensure your site adheres to our guidelines,  consider the intent.</p>
<p>Keep in mind that since search engines generally can&#8217;t access the contents of  JavaScript, legitimate links within JavaScript will likely be inaccessible to  them (as well as to visitors without Javascript-enabled browsers). <strong>You might  instead keep links outside of JavaScript or replicate them in a noscript  tag.</strong></blockquote>
<p><strong>What about paid links</strong>?</p>
<p>On the one hand, this is great news for the web. Google can access more content, which means searchers now have easier access as well. One potential issue is that historically, JavaScript was one of the common methods used for coding a <a href="http://searchengineland.com/the-2007-paid-links-war-in-review-13032">link that was paid</a> (an advertising rather than editorial link). <a href="http://google.com/support/webmasters/bin/answer.py?answer=66736">Google has been recommending for some time</a> that site owners either add a nofollow attributes to those links or use a URL redirector and block the redirect via robots.txt, but they haven&#8217;t explicitly said that the JavaScript method was no longer allowed.</p>
<p><a href="http://www.seomoz.org/blog/the-paid-links-debate-rages-on-ses-san-jose-2007">As late as 2007</a>, Google&#8217;s Matt Cutts was <a href="http://www.mattcutts.com/files/paid-links-presentation.ppt">espousing JavaScript as a valid option</a> for indicating links that shouldn&#8217;t pass PageRank because they were paid for. It&#8217;s likely that many sites around the web use this method for &#8220;machine-readable&#8221; disclosure that the links are advertising. In fact, one of the advertising platforms that Matt showcased as coding links properly is <a href="http://quigo.com/index.html">Quigo</a> (now owned by AOL), which appears to still enclose its ad links in JavaScript without adding the extra step of blocking the URL redirect with robots.txt. Look, for instance, at the <a href="http://sportsillustrated.cnn.com/">Sports Illustrated</a> home page:</p>
<p><a title="Quigo Ads by Search Engine Land, on Flickr" href="http://www.flickr.com/photos/23148333@N06/3572196628/"><img src="http://farm4.static.flickr.com/3416/3572196628_8ab9049319.jpg" alt="Quigo Ads" width="500" height="367" /></a></p>
<p>You can see that the sponsored links are implemented in JavaScript (and when onClick returns false, the link is disabled). The links are redirected through Quigo&#8217;s ad server: redir.adsonar.com, but this subdomain <a href="http://redir.adsonar.com/robots.txt">isn&#8217;t blocked with robots.txt</a>.</p>
<p>What about <a href="http://advertising.microsoft.com/publishersolutions">Microsoft adCenter Content Ads</a>? The example below is from an MSN page.</p>
<p><a title="Microsoft adCenter Content Ads by Search Engine Land, on Flickr" href="http://www.flickr.com/photos/23148333@N06/3572247416/"><img src="http://farm4.static.flickr.com/3647/3572247416_f297b2941b.jpg" alt="Microsoft adCenter Content Ads" width="500" height="481" /></a></p>
<p>The ad links are run through a JavaScript redirect (a variation of r.msn.com, in this case, 0.r.msn.com). But this subdomain has no <a href="http://0.r.msn.com/robots.txt">robots.txt</a>. Is MSN in danger of violating the Google webmaster guidelines? Will be Microsoft adCenter be penalized for selling links? Surely not. Surely Google is working out a way to crawl more of the web, while not inadvertently penalizing large portions of it. (Although this may not be an issue in any case. Both of the examples above use a 302 redirect, which likely satisfies Google&#8217;s guidelines about coding paid links in such a way that they don&#8217;t pass PageRank.)</p>
<p>When I asked Google about this, they told me:</p>
<blockquote>Our onclick processing is becoming more widespread, but keep in mind it&#8217;s still an area where we&#8217;re constantly improving. We already detect many ads generated by onclick events.</p>
<p>To prevent PR [PageRank] flow, it remains a good practice to do things like have the onclick-generated links in an area that&#8217;s blocked from robots, or to use a url redirector that&#8217;s robots.txt disallow&#8217;d. Penalties for spam techniques have been and will continue to be enforced, but as you know, we work extremely hard to minimize false positives.</p>
<p>Webmaster Tools Message Center already sends emails to developers to inform them when we believe that they are inadvertently violating our guidelines. Whether it&#8217;s through our blog or our tools, we&#8217;ll continue to find ways to communicate with webmasters, especially as we further innovate in our crawling capability. Processing onclicks is one step of many! :)</blockquote>
<p>I entirely understand their answer, even if I might not entirely like it. They want to crawl and index more of the web and they have to keep evolving to do that. Their aim is to only penalize those sites that intentionally violate their guidelines, but they&#8217;re not going to give away the secret sauce of how they detect that intention.</p>
<p>But the truth is that most people who have websites haven&#8217;t heard of the <a href="http://www.google.com/support/webmasters/bin/answer.py?hl=en&amp;answer=140528">Google Webmaster Tools Message Center</a> or <a href="http://googlewebmastercentral.blogspot.com/">Google Webmaster Central blog</a> (or Search Engine Land!). The web contains substantially more site owners than the 52,000 who are subscribed to Google&#8217;s webmaster blog. Most site owners don&#8217;t know what SEO means. One could argue that anyone who has a web site <em>should </em>know about these things, but most small business owners don&#8217;t know really know how to set up accounts payable either, but they&#8217;re doing accounting themselves too because they can&#8217;t afford to get expert help. And while it&#8217;s not Google&#8217;s responsibility to ensure business owners know how to run their businesses properly, it is in Google&#8217;s best interest to index all of the web. And if they change the rules in inadvertently throw innocent businesses out of their search results, they&#8217;re not reaching that goal.</p>
<p>Since Google itself previously recommended JavaScript as a way to block paid links, it seems a bit much for them to now expect the entire web to modify their sites now. Google should take it upon themselves to sort things out. And to be fair, they&#8217;re saying that they will. But I imagine some webmasters will be nervous about leaving things up to Google, when the risk of them getting it wrong is being removed from the index.</p>
<p><strong>This isn&#8217;t a new issue</strong></p>
<p>As technology on the web continues to advance, web developers will continue to confront these issues.  New infrastructure and platforms will move faster than search engines, which after all, were originally built on the concept of HTML-powered, text-based web pages. So developers will have to create workarounds and then dismantle those workarounds as search engines catch up.</p>
<p>This happened, for instance, with dynamic URLs. Originally, search engines had trouble with URLs that contained characters such as question marks (?) and ampersands (&amp;). In fact, Google advised in its guidelines to avoid using &amp;=sid <a href="http://googlewebmastercentral.blogspot.com/2006/10/update-to-our-webmaster-guidelines.html">until mid 2006</a>.</p>
<p>To get around this, some sites encoded their URLs to appear static. This <a href="http://www.sitepoint.com/article/dynamic-site-seo-tips-hints/">Sitepoint article on dynamic URLs</a> in 2002 explained:</p>
<blockquote>For example, the following URL contains both &#8220;?&#8221; and &#8220;&amp;,&#8221; making it non-indexable:</p>
<pre>http://www.planet-source-code.com/vb/scripts/ShowCode.asp?
lngWId=3&amp;txtCodeId=769</pre>
<p>Below, it has been made search engine-friendly (all &#8220;?&#8221; and &#8220;&amp;&#8221; and &#8220;=&#8221; characters replaced with alternate characters):</p>
<pre>http://www.planet-source-code.com/xq/ASP/txtCodeId.769/lngWId.3
/qx/vb/scripts/ShowCode.htm</pre>
</blockquote>
<p>Dennis Goedegebuure of eBay explains  <a href="http://thenextcorner.net/w0qq-double-encoding-legacy/">in his blog</a> that eBay employed this technique:</p>
<blockquote>In 2004 search engines were not smart enough to read dynamic URL&#8217;s. Especially those URL&#8217;s that had a lot of parameters in them to determine sort order or aspects of the product search for shopping sites were a problem to get these indexed. Replacing the dynamic parameters like &amp; or ? with static delimiters was one technique back in the days to make a dynamic URL static for the search engines to crawl.</p>
<p>Now fast forward to 2009, Search Engines have become much smarter and are now able to understand dynamic URL with parameters much better. Last week they even announced their new canonical tag to help website owners to avoid duplicate content issues when it comes to sort order.</blockquote>
<p>In fact, Google is now so good at interpreting dynamic URLs that use traditional patterns that Maile Ohye used eBay as an example of what not to do in a presentation at SMX West:</p>
<blockquote>[At] SMX West a Google rep presented on URL structure. One part of her presentation was about MAVRICK URL&#8217;s, and in particular the long and complicated url&#8217;s you sometimes see on the Interwebs. i.e. used in her presentation:</p>
<p><a href="http://shop.ebay.com/items/_W0QQ_nkwZipodQQ_armrsZ1QQ_fromZR40QQ_mdoZ">http://shop.ebay.com/items/_W0QQ_nkwZipodQQ_armrsZ1QQ_fromZR40QQ_mdoZ</a></blockquote>
<p>Things on the web have evolved so much that an implementation built entirely to ensure that pages could be crawled by search engines is now being used as an example of what not to do if you want pages to be crawled by search engines!</p>
<p><strong>Continuing the conversation</strong></p>
<p>Clearly, Google and the other major search engines want to crawl and index all of the content on the web. That is, after all, why they continue to evolve their crawlers to adapt to changing technology. And clearly, they want to help site owners and web developers build sites that can be easily found in search. And while it may seem like I&#8217;m singling Google out, I&#8217;m focusing on them now only because of their messaging of building a business with Google APIs at Google I/O. In truth, <a href="http://developer.yahoo.com/">Yahoo</a> and <a href="http://msdn.microsoft.com/en-us/default.aspx">Microsoft</a> are also aggressively courting developers and their developer groups seem to be just as disconnected from their search teams. Microsoft&#8217;s IIS versions 5 and 6, for instance, are configured to <a href="http://www.getfoundnow.com/iis_301_redirect.htm">implement redirects as 302s by default</a>, when for search engine value, <a href="http://searchengineland.com/search-illustrated-301-and-302-redirects-explained-13934">redirects should be 301s</a>. (Newer versions provide more search-friendly configurations.)</p>
<p>Two upcoming events will provide opportunities for us to continue the discussion about search-engine friendly web development with Google, Microsoft, Yahoo, and Adobe. Next week at SMX Advanced in Seattle, we&#8217;ve got a <a href="http://www.ninebyblue.com/blog/developers-welcome-at-smx-advanced/">whole set of things for developers</a> including lunch discussion tables and an after-hours Q&amp;A with Adobe, and Matt Cutts will be doing a Q&amp;A session where we can ask him all about the ripple effects of these advances. The following week in San Francisco, <a href="http://janeandrobot.com/">Jane and Robot</a> is holding a <a href="http://janeandrobot.com/events/developer-summit">search developer summit</a>. Again, all the reps will be on hand for some in-depth technical discussion. We&#8217;ll also have beer, which we all just might need.</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/google-io-new-advances-in-the-searchability-of-javascript-and-flash-but-is-it-enough-19881/feed</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>How Does Google Really Handle Flash Content?</title>
		<link>http://searchengineland.com/how-does-google-really-handle-flash-content-15081</link>
		<comments>http://searchengineland.com/how-does-google-really-handle-flash-content-15081#comments</comments>
		<pubDate>Tue, 14 Oct 2008 16:47:55 +0000</pubDate>
		<dc:creator>Matt McGee</dc:creator>
				<category><![CDATA[Google: SEO]]></category>
		<category><![CDATA[SEO: Flash]]></category>

		<guid isPermaLink="false">http://searchengineland.com/?p=15081</guid>
		<description><![CDATA[Much was made earlier this year of Google&#8217;s announcement that they&#8217;ve improved the indexing of Flash content. For years, search marketers have warned against building sites in Flash because they weren&#8217;t able to be indexed by major search engines; an all-Flash site was a kiss of death where SEO is concerned. More than three months [...]]]></description>
			<content:encoded><![CDATA[<p>Much was made earlier this year of Google&#8217;s announcement that they&#8217;ve <a href="http://googlewebmastercentral.blogspot.com/2008/06/improved-flash-indexing.html">improved the indexing of Flash content</a>. For years, search marketers have warned against building sites in Flash because they weren&#8217;t able to be indexed by major search engines; an all-Flash site was a kiss of death where SEO is concerned.</p>
<p>More than three months after Google&#8217;s announcement, has anything really changed? </p>
<p><span id="more-15081"></span>That&#8217;s the question Atlanta-based SEO Brian Ussery recently set out to answer. After conducting several case studies, Brian published the results in an article titled <a href="http://www.beussery.com/blog/index.php/2008/10/google-flash-seo/">2009 Google Flash SEO</a>.</p>
<p>Here&#8217;s a brief summary of Brian&#8217;s tests:</p>
<ul>
<li>Does Google correctly associate Flash content with the right &#8220;parent URL&#8221;?
<li>Can Flash files accrue PageRank?
<li>How does Googlebot handle #anchors in Flash-based link URLs?
<li>Does Google translate text found in Flash files?
</ul>
<p>You&#8217;ll have to <a href="http://www.beussery.com/blog/index.php/2008/10/google-flash-seo/">read Brian&#8217;s article</a> for the answers to those questions. He also shares some Flash SEO tips at the end.</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/how-does-google-really-handle-flash-content-15081/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Google Now Crawling And Indexing Flash Content</title>
		<link>http://searchengineland.com/google-now-crawling-and-indexing-flash-content-14299</link>
		<comments>http://searchengineland.com/google-now-crawling-and-indexing-flash-content-14299#comments</comments>
		<pubDate>Tue, 01 Jul 2008 04:00:02 +0000</pubDate>
		<dc:creator>Vanessa Fox</dc:creator>
				<category><![CDATA[Google: SEO]]></category>
		<category><![CDATA[SEO: Flash]]></category>

		<guid isPermaLink="false">http://searchengineland.com/beta/google-now-crawling-and-indexing-flash-content-14299.php</guid>
		<description><![CDATA[
]]></description>
			<content:encoded><![CDATA[<p>Historically, search engines have been unable to extract content, such as text and links, from Flash (SWF) files. Subsequently, much of the Flash-based content on the web has been unavailable in search results. This situation has been frustrating for web developers, who have tried to come up with workarounds to help get search engines to index and rank their Flash pages.</p>
<p>This situation hasn&#8217;t been ideal for searchers either, as this limitation has kept them from seeing potentially great matches for their queries because they&#8217;ve been locked away in Flash files.</p>
<p>According to Adobe and Google, <a href="http://googleblog.blogspot.com/2008/06/google-learns-to-crawl-flash.html">all of that is changing</a>. Google is launching what they tell me is a &#8220;deep algorithmic change,&#8221; augmented by <a href="http://www.adobe.com/devnet/flashplayer/articles/swf_searchability.html">Flash reader technology supplied by Adobe</a>, that enables them to &#8220;read&#8221; Flash files and extract text and links from it for better indexing and ranking. This could be great news for both site owners and searchers.</p>
<p>Below, more details about how it all works, as well as some caveats for those who see this development as a Flash panacea and think they no longer have to ensure their Flash applications are search engine friendly.</p>
<p><span id="more-14299"></span>
<b>Google can now crawl and index Flash files</b></p>
<p>Google has been working on improving how they crawl and index rich content (such as Flash and JavaScript) for some time, and in fact have been able to <a href="http://googlewebmastercentral.blogspot.com/2007/07/best-uses-of-flash.html">extract some text and links from Flash files for a while</a>. However, their methods weren&#8217;t perfect, and they tell me that this new technology from Adobe makes Google&#8217;s algorithms &#8220;less error prone&#8221; and enables them to access content created in any version of Flash in a variety of languages.</p>
<p><b>Adobe is providing a Flash player to (some) search engines</b></p>
<p>Adobe says they have developed an optimized Flash player for search engines and are collaborating with both Google and Yahoo!. Yahoo! has not yet implemented the technology, although they stated that “Yahoo! is committed to supporting webmaster needs with plans to support searchable SWF and is working with Adobe to determine the best possible implementation.” Adobe hasn&#8217;t made the technology available to Microsoft&#8217;s Live Search, although they say they are &#8220;exploring ways to make the technology more broadly available&#8221; to &#8220;help make all SWF content more easily searchable.&#8221; Adobe didn&#8217;t comment on whether the fact that Microsoft developing competing Silverlight was a factor in their decision not to collaborate with Microsoft Live Search for this initial announcement.</p>
<p><b>A big step forward</b></p>
<p>Previously, Google&#8217;s help documentation has <a href="http://www.google.com/support/webmasters/bin/answer.py?answer=72746">warned against the use of Flash-only sites</a>:</p>
<blockquote>&#8220;In general, search engines are text based. This means that in order to be crawled and indexed, your content needs to be in text format. This doesn&#8217;t mean that you can&#8217;t include images, Flash files, videos, and other rich media content on your site; it just means that any content you embed in these files should also be available in text format or it won&#8217;t be accessible to search engines.&#8221;</blockquote>
<p>They have suggested using Flash sparingly or using a method such as Scalable Inman Flash Replacement (sIFR) to provide an HTML source that can be rendered as either Flash or non-Flash.</p>
<p>That seems to have changed. The help documentation hasn&#8217;t been updated, but the post on the Google Webmaster blog says that <a href="http://googlewebmastercentral.blogspot.com/2008/06/improved-flash-indexing.html">Googlebot can now extract textual content and links</a> so Google can better crawl, index, and rank the web site.</p>
<p>Both Google and Adobe stressed to me that this is a big win for both site owners and searchers and that it should improve relevancy in search results. They noted that Flash developers don&#8217;t have to do anything in their applications to make this new technology work for their sites.</p>
<p>This is certainly great news for the web, as it&#8217;s a sign that search engines, which are the primary method of navigating the web, are evolving beyond text to take into account newer web technologies.</p>
<p>Just how much will this change impact search relevance? It&#8217;s hard to say until we see the changes, which Google says may take time to percolate through the pipeline. In particular, they note that snippets, the descriptions that display under search results, will be improved. Before, Google often couldn&#8217;t extract any content from a Flash file, so the description for a Flash page was often empty or would consist of the only text available from the file, such as the Flash version or the word &#8220;loading.&#8221;</p>
<p>But although Adobe&#8217;s press release talks about &#8220;dramatic&#8221; improvements in search results and more relevant listings for &#8220;millions of RIAs&#8221; (rich internet applications), neither Adobe nor Google could give me numbers about how many more pages Google was now about to crawl and index and how much this has impacted search results.</p>
<p>A quick look at how SWF files are currently indexed shows that there&#8217;s a lot of room for improvement, so this may indeed be big news for search.</p>
<p><a href="http://www.flickr.com/photos/23148333@N06/2627359192/" title="swfresults1 by Search Engine Land, on Flickr"><img src="http://farm4.static.flickr.com/3144/2627359192_80fd30fdb0.jpg" width="469" height="500" alt="swfresults1" /></a></p>
<p><b>Flash developers should still spend time on Search Engine Optimization</b></p>
<p>However, this isn&#8217;t the perfect solution that it may seem. Adobe assures developers that &#8220;RIA developers and rich Web content producers won’t need to amend existing and future content to make it searchable — they can be confident it can now be found by users around the globe.&#8221; But that&#8217;s not entirely true, particularly for Flash pages that have little textual content.</p>
<p><b>Only text and links are affected</b>
As Danny Sullivan noted last year when word of Google&#8217;s work in this area first came up, <a href="http://searchengineland.com/070327-103805.php">most Flash content isn&#8217;t made up of primarily words</a>. It&#8217;s made up of images, video, and animation, and none of that will be surfaced in search results with this advancement. Google&#8217;s new Flash algorithms extract text and links only. Everything else is still a black box.</p>
<p><b>Associate a unique URL with each unique piece of content</b>
In addition, the searcher experience is better served by Flash implementations that provide a unique URL for each set of content. Some Flash implementations dynamically load text as the user interacts with the application, but the URL remains the same. In this scenario, Googlebot can now follow those interactions (in a limited way) and if the URL doesn&#8217;t change, then all content that is dynamically loaded as the interactions progress is associated with a single URL.</p>
<p>Adobe says the Flash player it is providing to search engines &#8220;allows their search spiders to introspect and navigate through a live SWF application as if they were virtual users. The Flash Player technology, optimized for search spiders, enables the ability to traverse and parse all of the different paths in a SWF-based RIA, similar to traversing multiple pages in a standard web application.&#8221;</p>
<p>This means that if the content that is dynamically loaded into the Flash application from the fifth interaction matches a searcher query, that Flash application may be served in the search results. But when the searcher clicks over to that result, the content won&#8217;t be found on the page. The searcher will have to interact with the application until that content is loaded. Searchers may instead feel frustrated and abandon the page. For the best user experience and higher conversion rates from search, Flash developers should be careful to avoid this situation by creating distinct URLs for each piece of content. This implementation helps the Flash site be more viral as well, as users can email, Digg, and otherwise share the content more easily.</p>
<p>Google acknowledges this scenario may not be an ideal searcher experience, but points out that other non-HTML file formats such as PDFs have the same limitations. When a searcher clicks through the Google search results to a PDF file, the content that matched the query may not be on the first page of that PDF and the searcher has to scroll through the file to find the desired content. Google notes that just as they flag PDF files to alert searchers that the result is non-HTML, they do the same with Flash file, as shown below.</p>
<p><a href="http://www.flickr.com/photos/23148333@N06/2627359194/" title="swfresults2 by Search Engine Land, on Flickr"><img src="http://farm4.static.flickr.com/3185/2627359194_dbde00e1a8.jpg" width="500" height="91" alt="swfresults2" /></a></p>
<p><b>What this means for SEOs</b></p>
<p>Flash has often been a source of frustration for SEOs who argue that text should be in HTML, with Flash used for non-textual content, such as video illustrations. Can SEOs now remove the &#8220;review Flash implementation&#8221; line from their checklists? Probably not. However, it should be easier for SEOs to work with Flash-based sites going forward.</p>
<p>SEOs should keep in mind that these new algorithms don&#8217;t take into account any meta data or formatting markup in the Flash file and, for now, Google&#8217;s cache won&#8217;t show a representation of the extracted text so site owners can&#8217;t verify what is actually being crawled by viewing the cached copy. In addition, since Googlebot doesn&#8217;t execute most JavaScript, Google won&#8217;t crawl or index any Flash executed via JavaScript. Any external sources that the Flash file loads will be indexed separately, rather than as part of the Flash file. And as noted earlier, all non-textual content will remain uncrawled. This new Flash support covers all languages other than bidirectional ones (Hebrew and Arabic) and all versions of Flash.</p>
<p><b>What this means for accessibility and usability</b></p>
<p>Flash developers should continue to think about not only how well their applications can be found in search, but how <a href="http://googlewebmastercentral.blogspot.com/2008/04/webmaster-tips-for-creating-accessible.html">usable and accessible</a> they are.</p>
<p>Eric Wittman, director of platform distribution and business development at Adobe, told me that Flash web sites can be built for usability and accessibility. He noted that 98% of desktop computers have Flash support, although he acknowledged that he didn&#8217;t know how many have Flash blockers installed, and he didn&#8217;t provide numbers on the percentage of mobile devices that don&#8217;t support Flash.</p>
<p>He noted that screen reader support has been available since Flash Player 6 and that the newer Flex framework includes support for accessibility.</p>
<p>At the recent <a href="http://searchmarketingexpo.com/advanced/2008/developer-day.php">Developer Day at SMX Advanced</a>, we talked a lot about <a href="http://searchengineland.com/080411-101105.php">making Flash applications accessible and search-engine friendly</a> using <a href="http://blog.iconara.net/2007/05/08/google-and-flash/">graceful degradation</a>, and both <a href="http://wiki.novemberborn.net/sifr">sFIR</a> and <a href="http://code.google.com/p/swfobject/">SWFObject</a> came up several times as good methods for ensuring this. I find that still to be good advice even in light of this announcement.</p>
<p>Overall, this announcement is great news for both content owners and searchers. Web developers are becoming more focused on <a href="http://janeandrobot.com/">architecting their sites to ensure they can be  found in search engines</a>, as search has become one of the primary acquisition channels online. As web technologies evolve, it&#8217;s important for search engines to evolve as well to ensure they provide the most relevant results for searchers. I&#8217;m eager to see how substantial a change this proves to be, although web developers should continue keeping search-friendly practices in mind when developing sites.</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/google-now-crawling-and-indexing-flash-content-14299/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>How Flash Can Be Search Engine Friendly</title>
		<link>http://searchengineland.com/how-flash-can-be-search-engine-friendly-13754</link>
		<comments>http://searchengineland.com/how-flash-can-be-search-engine-friendly-13754#comments</comments>
		<pubDate>Fri, 11 Apr 2008 14:11:05 +0000</pubDate>
		<dc:creator>Eric Enge</dc:creator>
				<category><![CDATA[SEO: Flash]]></category>

		<guid isPermaLink="false">http://searchengineland.com/beta/how-flash-can-be-search-engine-friendly-13754.php</guid>
		<description><![CDATA[For a long time there has been a tug of war between developers and designers who love to use Flash and SEOs who know how bad Flash can be from a search engine crawler perspective. The designer wants the beauty of an all Flash site. The SEO wants to bring the site traffic. It&#8217;s a [...]]]></description>
			<content:encoded><![CDATA[<p>For a long time there has been a tug of war between developers and designers who love to use Flash and SEOs who know how bad Flash can be from a search engine crawler perspective.  The designer wants the beauty of an all Flash site.  The SEO wants to bring the site traffic.  It&#8217;s a tough situation, but there are ways to implement Flash and still provide something for the crawlers to chew on.</p>
<p><span id="more-13754"></span></p>
<p><strong>Scalable Inman Flash Replacement (sIFR)</strong>
<a href="http://wiki.novemberborn.net/sifr">sIFR</a> is a technique that uses Javascript to read in HTML text and render it in Flash instead. The essential fact to focus on here is that the method guarantees that the HTML content and the Flash content are identical. The intended use for this is to render text in an anti-aliased font.  This can provide a great improvement in the presentation of your site.</p>
<p>Don&#8217;t make too extensive use of the technique.  According to sIFR&#8217;s creator, Mike Davidson, you shouldn&#8217;t try to redo your whole page with sIFR.  He says that &#8220;sIFR is for heacdlines, pull quotes, and other small swaths of text.&#8221;  Overuse the technique and suddenly your page will take a long time to load.</p>
<p>How do search engines view this?  In July of 2007, <a href="http://www.highrankings.com/getting-into-google/">Google&#8217;s Dan Crow went to a Search Engine Marketing New England (SEMNE) event</a>.  Dan, who is head of Google&#8217;s crawl team, said that as long as this technique is used as it is intended, that Google was fine with it.</p>
<p>The key thing that drives this statement is that the HTML version of the text is the input used by sIFR to create its output.  It is not possible for them to be different.  For this reason, this technique is not susceptible to spamming, and there is no reason for a search engine to worry about its implementation.</p>
<p><strong>SWFObject</strong>
The basic underlying concept with <a href="http://code.google.com/p/swfobject/">SWFObject</a> is similar, but the implementation is noticeably different.  SWFObject starts by detecting if a user agent (a browser or crawler) is capable of executing Flash.  If they are not, then the pure HTML version of the given block of content is shown.  However, if the user agent is capable of executing Flash, the Flash version of the content is shown instead.</p>
<p>Unlike sIFR, this method does not guarantee that the HTML and the content in the Flash are the same. SWFObject does not reference the text in the HTML at all. It simply runs a pre-compiled Flash movie in place of the HTML.</p>
<p>This technique could conceivably be used by spammers to show one type of content in Flash and another in HTML.  In the extreme case, you could have information on travel to a Caribbean island in HTML replaced by a Viagra ad in Flash.  This is, of course, not its intended purpose.</p>
<p>At the same SEMNE event, Dan Crow indicated that this technique was &#8220;dangerous.&#8221; Even though this technique could be used for entirely legitimate reasons (e.g., the same purpose as outlined for sIFR above), there is no way for Google to detect that.</p>
<p>In talking with other search engine representatives, I have also heard it suggested that their concept of a clean implementation of SWFObject would depend on the content in the Flash and the HTML being identical.</p>
<p>In practice, it is really difficult for a search engine to ban or punish a site for using such a technique without human review.  I am not saying that sites don&#8217;t get hurt by the use of this technique, but there is no way that the search engines could simply punish sites based on the degree of use of SWFObject without creating a great outcry.</p>
<p>Simply put, it&#8217;s a legitimate technique for providing accessibility for those with poor eyesight, and providing content to users who have Flash disabled.  I think one of the bigger challenges is that you end up with the obligation to craft well thought out implementations in two different mediums at the same time, one in Flash, and one in HTML.</p>
<p>Since the mediums are quite different, you would be tempted to do things quite differently.  As noted above, the search engines indicate that they want your HTML content and Flash content to be identical.  However, Flash will sometimes lead you down a path to do things quite differently.</p>
<p>This is a good thing, in terms of creating the best user experience with such a site.  Search engine comments aside, my own opinion is that a search engine would be hard pressed to justify implementing a penalty on a site if the content of the HTML and the content of the Flash were <strong>substantially</strong> similar (as opposed to identical).</p>
<p>Note, however, that my opinion notwithstanding, Google has not come out in support of implementing similar but not identical content, and in fact warns that if sites present text in the HTML that is not present in the Flash, the site could risk violating the Google webmaster guidelines.</p>
<p><strong>No Need For A Flash War After All</strong></p>
<p>For a long time SEOs have stood against Flash as a technique.  With sIFR and SWFObject, you can implement Flash on your site and still be search engine friendly.  Certainly, this brings some additional challenges due to the need to implement the content twice when you use SWFObject, and a certain amount of search engine risk.</p>
<p>Even with SWFObject, I think your risk is low if your implementation is truly clean, but the risk is not zero.  The best thing to do to minimize that risk is make your HTML implementation as close to the Flash implementation as possible.</p>
<p><i>Eric Enge is the president of <a href="http://www.stonetemple.com">Stone Temple Consulting</a>, an SEO consultancy outside of Boston. Eric is also co-founder of Moving Traffic Inc., the publisher of <a href="http://www.customsearchguide.com">Custom Search Guide</a></i></p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/how-flash-can-be-search-engine-friendly-13754/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Search Engine Unfriendliness Of Web 2.0</title>
		<link>http://searchengineland.com/the-search-engine-unfriendliness-of-web-20-12465</link>
		<comments>http://searchengineland.com/the-search-engine-unfriendliness-of-web-20-12465#comments</comments>
		<pubDate>Thu, 18 Oct 2007 11:46:10 +0000</pubDate>
		<dc:creator>Stephan Spencer</dc:creator>
				<category><![CDATA[All Things SEO]]></category>
		<category><![CDATA[SEO: Flash]]></category>
		<category><![CDATA[SEO: General]]></category>

		<guid isPermaLink="false">http://searchengineland.com/beta/the-search-engine-unfriendliness-of-web-20-12465.php</guid>
		<description><![CDATA[
]]></description>
			<content:encoded><![CDATA[<p><a href="http://searchengineland.com/lands/100-organic.php">
<img border="0" src="http://searchengineland.com/images/organic100.jpg" alt="100% Organic - A Column From Search Engine Land" align="left" hspace="5" vspace="3" width="100" height="100"></a> Wouldn&#8217;t it be great if all those whiz-bang Web 2.0 interactive elements based on AJAX (Asynchronous JavaScript and XML) and Flash&mdash;such as widgets and gadgets and Google Maps mashups&mdash; were search engine optimal?</p>
<p>Unfortunately, that&#8217;s not the case. In fact, these technologies are inherently unfriendly to search engine spiders. So, if you intend to harness Web 2.0 technologies for wider syndication, increased conversion, improved usability and greater customer engagement, you&#8217;d better read on or you&#8217;ll end up missing the boat when it comes to better search engine rankings.</p>
<p><span id="more-12465"></span>
When it comes to AJAX and Flash, the onus is on you to render them search engine friendly. The major search engines just can&#8217;t cope with these Web 2.0 technologies very well at all.</p>
<p><b>Flash not friendly</b></p>
<p>Let&#8217;s start with Flash, a technology with which many of us already are familiar. Some search engines, including Google, have rudimentary means of extracting content and links from Flash. Nonetheless, any content or navigation embedded within Flash will, at best, rank poorly in comparison to a static, HTML-based counterpart, and at worst, not even make it into the search engine&#8217;s index.</p>
<p>Google&#8217;s view on Flash is that it doesn&#8217;t provide a user-friendly experience. Flash is wholly inaccessible to the vision-impaired, unrenderable on devices such as mobile phones and PDAs, and can&#8217;t be accessed without broadband connectivity.</p>
<p>In particular, Google frowns on navigational elements presented exclusively in Flash. Given this stance, Google isn&#8217;t likely to make big improvements on how it crawls, indexes and ranks Flash files anytime soon. So, it&#8217;s in your hands to either replace those Flash elements with a more accessible alternative like CSS/DHTML or to employ a Web design approach known as &#8220;progressive enhancement,&#8221; whereby designs are layered in a concatenated manner to provide an alternative experience for non-Flash users. This way, all users, including search engine spiders, will be able to access your content and functionality.</p>
<p>An example of progressive enhancement in action can be found at Amazon.com&#8217;s &#8220;<a href="http://www.amazon.com/gp/cyo/cyo-state-manager.html/103-4733660-0889417?ie=UTF8&#038;sequenceStep=step1&#038;pipelineID=cyor&#038;sequenceID=sequence1">Create Your Own Ring</a>&#8221; on the Web. Simply turn off the JavaScript capabilities in your browser and build your ring&mdash;with or without the Flash interaction. All customers are equally served.</p>
<p><b>Problems with AJAX</b></p>
<p>AJAX poses similar problems to spiders as Flash does. That&#8217;s because AJAX also relies on JavaScript&mdash;that&#8217;s what the &#8220;J&#8221; in AJAX stands for, after all (the complete acronym stands for Asynchronous JavaScript and XML). Search engine spiders can&#8217;t execute JavaScript commands (or Java either, for that matter). AJAX can be used to pull data seamlessly in the background onto an already loaded Web page, sparing the user from the &#8220;click-and-wait&#8221; frustrations associated with more conventional Web sites. It&#8217;s a great timesaver for users, but the additional content that&#8217;s pulled in via AJAX is invisible to the spiders unless it&#8217;s preloaded into the page&#8217;s HTML and simply hidden from the user via CSS.</p>
<p>Here, progressive enhancement renders a non-JavaScript version of the AJAX application for spiders and JavaScript-incapable browsers. A low-tech alternative to progressive enhancement is to place an HTML version of your AJAX application within noscript tags (see <a href="http://www.thecleanermovie.com">TheCleanerMovie.com</a> for an example).</p>
<p>Other options include rendering static HTML pages from product searches, as the vertical shopping engine Become.com does. Google&#8217;s guidelines warn that your search result pages must provide value to users to warrant inclusion in its index. So, extra care must be taken if employing this approach.</p>
<p>Widgets, the mini applications webmasters are encouraged to place on their sites to pull data from an external source, are in most cases inaccessible to search engine spiders. Most widgets are built in search engine unfriendly Flash or AJAX.</p>
<p>A well-loved widget in the blogosphere is <a href="http://www.eurekster.com">Eurekster&#8217;s</a> &#8220;<a href="http://swicki.eurekster.com">Swicki</a>,&#8221; with its &#8220;What&#8217;s Popular&#8221; buzzcloud, which you may have seen in the sidebars of popular blogs like <a href="http://www.techcrunch.com">TechCrunch</a>. The &#8220;What&#8217;s Popular&#8221; buzzcloud. Under our tutelage, Eurekster made its widget more search engine friendly and reaped the benefits with a huge influx of search-referred traffic.</p>
<p>As you head down the road of Web 2.0, just remember that user-friendly doesn&#8217;t readily translate into search engine friendly without some assistance. But know that help is available, particularly in the form of progressive enhancement,  when you embrace this brave new World Wide Web.</p>
<p><i>Stephan Spencer is founder and president of <a href="http://www.netconcepts.com/">Netconcepts</a>, a 12-year-old web agency specializing in search engine optimized ecommerce. He writes for several publications and blogs at the <a href="http://www.naturalsearchblog.com/">Natural Search Blog</a>. The <a href="http://searchengineland.com/lands/100-organic.php">100% Organic</a> column appears Thursdays at <a href="http://searchengineland.com">Search Engine Land</a>.</i></p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/the-search-engine-unfriendliness-of-web-20-12465/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Meta Keywords Tag 101: How To &#8220;Legally&#8221; Hide Words On Your Pages For Search Engines</title>
		<link>http://searchengineland.com/meta-keywords-tag-101-how-to-legally-hide-words-on-your-pages-for-search-engines-12099</link>
		<comments>http://searchengineland.com/meta-keywords-tag-101-how-to-legally-hide-words-on-your-pages-for-search-engines-12099#comments</comments>
		<pubDate>Wed, 05 Sep 2007 23:42:21 +0000</pubDate>
		<dc:creator>Danny Sullivan</dc:creator>
				<category><![CDATA[Ask: SEO]]></category>
		<category><![CDATA[Features: General]]></category>
		<category><![CDATA[Google: SEO]]></category>
		<category><![CDATA[How To: SEO]]></category>
		<category><![CDATA[Microsoft: Bing SEO]]></category>
		<category><![CDATA[SEO: Flash]]></category>
		<category><![CDATA[SEO: General]]></category>
		<category><![CDATA[SEO: Tagging]]></category>
		<category><![CDATA[SEO: Writing & Body Copy]]></category>
		<category><![CDATA[Yahoo: SEO]]></category>

		<guid isPermaLink="false">http://searchengineland.com/beta/meta-keywords-tag-101-how-to-legally-hide-words-on-your-pages-for-search-engines-12099.php</guid>
		<description><![CDATA[If there&#8217;s anything I particularly hate when it comes to SEO, it&#8217;s the meta keywords tag. I so wish it had never been invented. It&#8217;s practically useless, yet people still obsess over it. In this article, I&#8217;ll explain more about why you shouldn&#8217;t worry about it except perhaps for misspellings, as well as which search [...]]]></description>
			<content:encoded><![CDATA[<p>If there&#8217;s anything I particularly hate when it comes to SEO, it&#8217;s the meta keywords tag. I so wish it had never been invented. It&#8217;s practically useless, yet people still obsess over it. In this article, I&#8217;ll explain more about why you shouldn&#8217;t worry about it except perhaps for misspellings, as well as which search engines support it.</p>
<p>The meta keywords tag is one of several of meta tags that you can insert into your web pages to provide search engines with information about your pages that isn&#8217;t visible on the page itself. For example, my <a href="http://searchengineland.com/070305-204850.php">Meta Robots Tag 101: Blocking Spiders, Cached Pages &amp; More</a> article covers how you can use a different meta tag &#8212; the meta robots tag &#8212; to block pages from being indexed. Users don&#8217;t see this information (unless they look at your source code), but search engines do.</p>
<p><strong>Meta Tags &amp; Your Header</strong></p>
<p>Meta tags go within the header area of your web pages. A typical head might look like this:</p>
<blockquote>&lt;head&gt;
&lt;title&gt;Welcome To Shoe Central!&lt;/title&gt;
&lt;meta name=&#8221;description&#8221; content=&#8221;All the best prices on shoes!&#8221; /&gt;
&lt;meta name=&#8221;robots&#8221; content=&#8221;noodp&#8221; /&gt;
&lt;meta name=&#8221;keywords&#8221; content=&#8221;shoe, shoes, shoee, shos, footwear&#8221; /&gt;
&lt;/head&gt;</blockquote>
<p>The header is the section that begins &lt;head&gt; and ends &lt;/head&gt;. Between those elements, in our example, you have these tags:</p>
<ul>
<li><strong>Title: </strong>The text here becomes the title that is shown in search engine listings, in most cases.</li>
<li><strong>Description:</strong> The text here is text that search engines sometimes use as a description for your web page when listing it (a meta tag lesson for another time).</li>
<li><strong>Robots:</strong> This particular tag is configured to ensure that the page isn&#8217;t described using the a description that the Open Directory might have for it (<a href="http://searchengineland.com/070305-204850.php">Meta Robots Tag 101</a> explains this more).</li>
<li><strong>Keywords: </strong>This tag is the topic of this article, so read on!</li>
</ul>
<p><strong>History Of Meta Keywords</strong></p>
<p>I&#8217;ve long written about search engines and meta tags, but I have never been able to pin down exactly who created the meta keywords tag. There&#8217;s a December 1995 internet draft memo that&#8217;s the earliest and most authoritative mention of the tag I know of. <a href="http://tools.ietf.org/html/draft-musella-html-metatag-01">It says</a>:</p>
<blockquote>&lt;META HTTP-EQUIV= &#8220;Keywords&#8221; CONTENT= &#8220;Italy Product, Italy Tourism&#8221;&gt;</p>
<p>The spaces between a comma and a word or vice versa are ignored&#8230;.</p>
<p>These &#8216;keywords&#8217; were specifically conceived for exhaustively and completely catalogue the HTML document. This allows the software agents to index at best your own document. To do a preliminary indexing, it&#8217;s important to use at least the http-equiv meta-tag &#8220;keywords&#8221;.</blockquote>
<p>Sounds good, right? Like this is designed for the search engines to use? The issue is that HTML specs like these (especially drafts) are not necessarily used by the search engines. They can use them, ignore them or build upon them as they see fit.</p>
<p>As it turns out, several of the major search engines <a href="http://www.w3.org/Search/9605-Indexing-Workshop/ReportOutcomes/Spidering.txt"> got together</a> in May 1996 to talk about meta data. That meeting gave birth to a common standard for the meta robots and the meta description tags. As for the meta keywords tag, it was discussed, but no specification emerged.</p>
<p>Despite no specification, both Infoseek (later Go.com, these days no longer crawling the web) and AltaVista (now owned and powered by Yahoo) offered support for the meta keywords tag in 1996. If you looked at their help files at the time, they encouraged site owners to use the tag. Inktomi (now owned by Yahoo) also provided support when it began operations later in 1996, and Lycos (no longer crawling the web) added support in 1997.</p>
<p>That year &#8212; 1997 &#8212; was the last year that the meta keywords tag enjoyed support among the majority of major crawlers out there (4 out of 7 &#8211; Excite, WebCrawler and Northern Light, also crawling the web that year, did not support it).</p>
<p><strong>Support Dies Off</strong></p>
<p>When new search engines emerged in 1998, such as Google and FAST, they didn&#8217;t support the tag. The reason was simple. By that time, search engines had learned that some webmasters would &#8220;stuff&#8221; the same word over and over into the meta keywords tag, as a way of trying to rank better. At the time, search engines didn&#8217;t rely so heavily on link analysis, so page stuffing like this was more effective. Alternatively, some site owners would insert words that they weren&#8217;t relevant for.</p>
<p>In July 2002, AltaVista dropped its support of the tag. That left Inktomi as the only major crawler still supporting it, causing me to somewhat famously in the SEO world to <a href="http://searchenginewatch.com/showPage.html?page=2165061">declare</a> the tag dead, since it was no longer a major ranking factor for even Inktomi:</p>
<blockquote>Traffick.com&#8217;s Andrew Goodman wrote recently in an essay about meta tags, &#8220;If somebody would just declare the end of the metatag era, full stop, it would make it easier on everyone.&#8221;</p>
<p>I&#8217;m happy to oblige, at least in the case of the meta keywords tag. Now supported by only one major crawler-based search engine &#8212; Inktomi &#8212; the value of adding meta keywords tags to pages seems little worth the time. In my opinion, the meta keywords tag is dead, dead, dead. And like Andrew, good riddance, I say!</blockquote>
<p>Since that time, Inktomi was rolled up into Yahoo, which continues to support the meta keywords tag as part of its Yahoo search engine. Or does it?</p>
<p><strong>Search Engine Rep Confusion</strong></p>
<p>Last month, I moderated a panel of search reps when that perennial favorite question came up during the session. Who supports the meta keywords tag?</p>
<p>Sigh. But if this question still coming up wasn&#8217;t depressing enough, then the search engine reps starting responding with a load of confusion. To paraphrase:</p>
<blockquote>No, we don&#8217;t support it. Well, we read it. We read it, but it doesn&#8217;t matter. Actually, maybe we don&#8217;t read it.</blockquote>
<p>Even Evan Roseman from Google said at one point that Google reads the meta keywords tag, suggesting no doubt to some that Google uses the tag.</p>
<p>To be clear, Google doesn&#8217;t. I&#8217;ll prove it further below, but it doesn&#8217;t, OK?</p>
<p>I gave Evan (hopefully) some good humored hassle afterward for saying this. He&#8217;s at least the second Google rep to declare this on panels I&#8217;ve moderated in as many years, and the problem is that the engineers (from any of the search engines) often take the question too literally.</p>
<p><strong>Indexing Versus Retrieval Versus Ranking</strong></p>
<p>To understand, let me talk about three different things a search engine does when it crawls and lists your page:</p>
<ul>
<li><strong>Indexing: </strong>This is where the search engine effectively makes a copy of your page. The search engine is going to read and store the HTML content it finds &#8212; all of it. Evan was right when he said that the meta keyword tag is indexed by Google. Google knows that the tag exists and has recorded what&#8217;s in it. <strong>But that doesn&#8217;t mean it does anything else with it.</strong></li>
<li><strong>Retrieval: </strong>This is where the search engine finds all the matching documents relevant for what you searched for. Most of those documents will actually have the words you searched for on them, in the sections that the search engine searches against (there are some exceptions, such as when anchor text is used to find pages. <a href="http://searchengineland.com/070315-221747.php">Google Now Reporting Anchor Text Phrases</a>, <a href="http://searchengineland.com/070125-230048.php">Google Kills Bush&#8217;s Miserable Failure Search &amp; Other Google Bombs</a> and <a href="http://searchengineland.com/070420-121152.php">Google Declares Stephen Colbert As Greatest Living American</a> explain more about this). While the search engine has recorded the entire page, it won&#8217;t search against everything indexed for retrieval. In other words, Google will look to see if words you searched for appear in the body area of a document, but it will NOT look in the meta keywords tag for matching words. The keywords tag, while indexed, is not used for retrieval at Google. At Yahoo, it is.</li>
<li><strong>Ranking: </strong>This is where the search engine looks at all those documents retrieved for a search and puts them in order of most importance, according to its algorithm. Retrieval (or what information research professionals call &#8220;recall&#8221;) is about finding everything). Ranking (or what the IR folks call &#8220;precision&#8221; &#8212; see Tim Bray&#8217;s excellent <a href="http://www.tbray.org/ongoing/When/200x/2003/06/22/PandR">On Search: Precision and Recall</a> document) is about getting the best stuff up to the top. Yahoo, while using the tag for retrieval, really doesn&#8217;t assign much weight to it for ranking.</li>
</ul>
<p><strong>Testing For Retrieval</strong></p>
<p>Back to my panel experience. Since the reps were unclear, I declared to the audience that I&#8217;d just go out and test it again myself. It&#8217;s literally been about five years since I&#8217;ve last tested the tag, because I (<a href="http://www.seomoz.org/article/search-ranking-factors#f5">and many others</a>) feel it is so useless. There are better things to do with our time. But since that question needs a big old stake to the heart, I rolled up my sleeves and got cracking.</p>
<p>On the <a href="http://searchengineland.com/">Search Engine Land home page</a>, I inserted this meta keywords tag:</p>
<blockquote>&lt;meta name=&#8221;keywords&#8221; content=&#8221;qiskodslajdmnkd, ddakaieciuaj jkdalladpaoaw, wdaopeqndlkakljad&#8221; /&gt;</blockquote>
<p>I had searched for all of these words on the four major search engines of Google, Yahoo, Microsoft and Ask and found no pages that matched. If these search engines made use of the meta keywords tag, I&#8217;d know in short order, if my page started coming up.</p>
<p>The tag went up on August 28. I then needed to wait until I could see each search engine had the most current version of my page (<a href="http://searchengineland.com/070227-154718.php">Squeezing The Search Loaf: Finding Search Engine Freshness &amp; Crawl Dates</a> explains more on how to do this).</p>
<p><strong>Google: No</strong></p>
<p>It took two days, until August 30, for Google to show the latest version of my page in its index. I searched for each of the words, and my home page didn&#8217;t come up. The meta keyword tag was not used for retrieval and thus not supported.</p>
<p><strong>Microsoft Live: No</strong></p>
<p>It took five days, until September 2, for Microsoft to show a version of my page with the meta keywords tag on it. As an aside, Microsoft is kind of annoying. It will say something like this in the cached copy of the page:</p>
<blockquote>This is a version of http://searchengineland.com/ as it looked when our crawler examined the site on 9/2/2007. The page you see below is the version in our index that was used to rank this page in the results to your recent query. This is not necessarily the most recent version of the page &#8211; to see the most recent version of this page, visit the page on the web.</blockquote>
<p>If you glance quickly at the date, you might think the page has been revisited fairly recently. But as the text explains, it might be older. Indeed, when I looked on September 2 (as is the case today), the copy of the page in the index was as of August 30, as I could tell from the stories shown.</p>
<p>As with Google, I searched for each of the words, and my page didn&#8217;t come up. The meta keyword tag was NOT used for retrieval and thus not supported.</p>
<p><strong>Yahoo: Yes</strong></p>
<p>It took two days, until August 30, for Yahoo to have my latest page. Searches there did bring up the home page for all words. So the meta keywords tag IS used for retrieval.</p>
<p><strong>Ask: Yes</strong></p>
<p>Ask took the longest to show the most current version of my page, not reflecting the changes until today. Actually, when I look at the <a href="http://www.askcache.com/webcp?q=http://searchengineland.com&amp;t=http+searchengineland-com&amp;r=http%3A%2F%2Fsearchengineland.com&amp;cache=00*2h5reafh0o6h8&amp;qlang=3&amp;url=http://searchengineland.com/&amp;page=1&amp;o=0&amp;l=dir&amp;ws=1&amp;ax=1"> cached copy</a> even now, it says that the page is from August 13 and uses a redirection URL rather than my <a href="http://searchengineland.com"> http://searchengineland.com</a> address.</p>
<p>Still, I can tell Ask has a version with the meta keywords tag on it since I&#8217;m getting back my home page when searching for words in that tag. As with Yahoo, the meta keywords tag IS used for retrieval.</p>
<p><strong>Should You Use It? Sure, For Misspellings</strong></p>
<p>So there you have it &#8212; half of the major crawlers (Yahoo &amp; Ask.com) DO support the tag. Should you begin using it? My advice would be only for misspellings and really unusual words.</p>
<p>As explained, the tag can help with retrieval. A word in the tag is treated as if it were a word visible on the page itself. Now that&#8217;s handy for misspellings. For example, say you&#8217;re writing about Basset hounds. You suspect some people might misspell the name as Bassett hounds, adding an extra T. You could misspell the word yourself on the visible page, but that makes you look bad. You could insert the word and then try to hide it using CSS styles or putting it in the same color as the page background. But this type of &#8220;hidden&#8221; text is generally against search engine guidelines.</p>
<p>Enter the meta keywords tag. Just do this:</p>
<blockquote>&lt;meta name=&#8221;keywords&#8221; content=&#8221;bassett&#8221; /&gt;</blockquote>
<p>Now you&#8217;ve got the misspelling on your page in a &#8220;legal&#8221; means that will be read by Yahoo and Ask. You&#8217;re still out of luck for Google and Live.com, but two out four ain&#8217;t bad.</p>
<p><strong>But I Want To Rank!</strong></p>
<p>What about ranking better with the tag. I mentioned already that many experienced SEOs don&#8217;t find it useful. Believe me, if just putting a single word into that tag was going to rank your page better, everyone would be doing it. Instead, search for anything on Yahoo or Ask. You&#8217;ll see plenty of pages ranking well for words without those words appearing in the meta keywords tag. And if you do see the words in the tag, it&#8217;s more due to coincidence &#8212; the words also appear in the body copy, in the title tag and often in links pointing at the page. The words in the meta keywords tag aren&#8217;t the primary reason the page is ranking well. Promise.</p>
<p>Back to our Basset Hound example. Sure, you can add the correct spelling to your meta keywords tag. Go ahead, if you want. Just understand that it is not likely to make you rank any better than if you didn&#8217;t include it at all. Moreover, beginners are especially likely to spend far too long worrying about getting the &#8220;right&#8221; words in the meta keywords tag rather than just writing good body copy.</p>
<p><strong>Comma Conundrum</strong></p>
<p>One of the most common questions I used to get way back in the old days was over using commas in the meta keywords tag. Consider these options:</p>
<ol>
<li>&lt;meta name=&#8221;keywords&#8221; content=&#8221;bassett, hound, hounds, basset&#8221; /&gt;</li>
<li>&lt;meta name=&#8221;keywords&#8221; content=&#8221;bassett,hound,hounds,basset&#8221; /&gt;</li>
<li>&lt;meta name=&#8221;keywords&#8221; content=&#8221;bassett hound, bassett hounds, basset hound, basset hound&#8221; /&gt;</li>
<li>&lt;meta name=&#8221;keywords&#8221; content=&#8221;bassett hound,bassett hounds,basset hound,basset hound&#8221; /&gt;</li>
<li>&lt;meta name=&#8221;keywords&#8221; content=&#8221;bassett hound bassett hounds basset hound basset hound&#8221; /&gt;</li>
<li>&lt;meta name=&#8221;keywords&#8221; content=&#8221;bassett hound basset hounds&#8221; /&gt;</li>
</ol>
<p>Sigh. See why I hate this tag so much, when I&#8217;ve had to deal with people wondering about commas and spaces and variations like this. Let&#8217;s take it from the top, as to the motivations behind these versions:</p>
<ol>
<li>This is someone who thinks that each word should be on its own, separated by a comma and with a space in front of the next word.</li>
<li>This is someone who thinks that getting rid of the spaces means they can squeeze in more words.</li>
<li>This is someone who thinks that if there are particular phrases they want to be found for, those phrases should be together and set off by commas.</li>
<li>As with three, but losing the spaces to squeeze in more words.</li>
<li>Similar to three but thinking you don&#8217;t need commas at all.</li>
<li>This is Mr. or Ms. Paranoid. They&#8217;re concerned about saying any word too often. So they lose the commas, restrict repetition and hope that proximity will help (IE, put &#8220;basset&#8221; behind &#8220;hound&#8221; rather than in front and maybe you&#8217;ll still show up for &#8220;basset hound.&#8221;</li>
</ol>
<p>Which way should you go? I&#8217;d suggest number three, for these reasons:</p>
<ul>
<li>Yahoo has long recommended using commas and in particular supported them as a way to separate out distinct terms for those in their paid inclusion <a href="http://searchmarketing.yahoo.com/srchsb/index.php">programs</a>. I&#8217;ll update this page with the latest advice, but commas still seem to make sense.</li>
<li>Spaces just make things look nicer, and you shouldn&#8217;t be shoving a ton of terms in the tag anyway. How long is too long? No idea! In the past, the search engines just wouldn&#8217;t index content beyond around 250 to 1,000 characters. Maybe I&#8217;ll test this in the future.</li>
<li>You do want phrases kept together. &#8220;bassett, hound&#8221; is probably going to be seen as &#8220;bassett hound&#8221; anyway, but why risk it?</li>
</ul>
<p><strong>Other Uses</strong></p>
<p>I mentioned that misspellings were a key use for the tag. You could also use it for synonyms. For example, if you have a page all about shoes and you never say &#8220;footwear,&#8221; you could put that word in your tag. However, it&#8217;s far better if you just find a way to make use of the word in the body copy itself. That text is retrieved by all the major search engines, not just some.</p>
<p>Aside from synonyms, perhaps you have a page that&#8217;s all Flash or all images. Use the meta keywords tag to describe the page. Just remember that you&#8217;re still not likely to rank better than other pages that have textual information. Search engines are textual creatures. Give them what they want.</p>
<p><strong>Some Official Guidelines</strong></p>
<p>The W3C has <a href="http://www.w3.org/TR/html4/appendix/notes.html#h-B.4"> guidelines</a> (and <a href="http://www.w3.org/TR/html4/struct/global.html#h-7.4.4.2">here</a>) in HTML 4.0 about meta data and search engines, while the XHTML specs don&#8217;t get into it at all. Ignore the specs. YES, IGNORE THE SPECS. Some of them are wrong; some are outdated. The only thing I can see that they <a href="http://www.w3.org/TR/xhtml1/#h-4.6">explain</a> is the difference between these:</p>
<ul>
<li>&lt;meta name=&#8221;keywords&#8221; content=&#8221;bassett&#8221;&gt;</li>
<li>&lt;meta name=&#8221;keywords&#8221; content=&#8221;bassett&#8221; /&gt;</li>
</ul>
<p>See how the second tag ends /&gt; rather than &gt; in the first? As best I can tell, this is because a meta tag is an &#8220;empty element&#8221; in XHTML, where there&#8217;s not a &#8220;start&#8221; and a &#8220;finish&#8221; (as with a paragraph element: &lt;p&gt; is the beginning, with &lt;/p&gt; the end). Empty elements in XHTML need that /&gt; format.</p>
<p>I haven&#8217;t tested things without the /&gt;, but there are so many (so very, very many) pages out there not following that syntax that it is virtually certain Yahoo and Ask will read the tag either way. Doing it fresh? Do it /&gt; style. But don&#8217;t go back and start changing things.</p>
<p>Aside from that, if you want to know how a search engine deals with meta data officially, you go to the search engine itself. <a href="http://about.ask.com/en/docs/about/webmasters.shtml">Ask&#8217;s webmaster guidelines</a> don&#8217;t mention the meta keywords tag, so that leaves Yahoo:</p>
<ul>
<li><a href="http://help.yahoo.com/l/us/yahoo/search/basics/basics-18.html"> Yahoo Quality Guidelines</a>: &#8220;Metadata (including title and description) that accurately describes the contents of a web page.&#8221; This is telling you don&#8217;t lie with your keywords. Don&#8217;t insert words that aren&#8217;t somehow related to the topic of your page.</li>
<li><a href="http://help.yahoo.com/l/us/yahoo/search/ranking/ranking-02.html"> How do I improve the ranking of my web site in the search results?</a>: &#8220;Use a &#8216;keyword&#8217; meta-tag to list key words for the document. Use a distinct list of keywords that relate to the specific page on your site instead of using one broad set of keywords for every page.&#8221; Note that it doesn&#8217;t say you&#8217;ll automatically rank better by doing this. Also, unique words for each page would be my advice, as well &#8212; but do NOT worry if you decide to use the same set of key terms on each of your pages. It isn&#8217;t that big of a deal.</li>
</ul>
<p>Looking for the exact format that you should use for the meta keywords tag from Yahoo? You know, commas, spaces and all that. Sorry &#8212; they don&#8217;t provide it, which is another sign you&#8217;re probably worrying too much about it.</p>
<p><strong>Freaked? Skip It</strong></p>
<p>Overall, here&#8217;s the best advice I can offer anyone dealing with this tag. If you begin to feel confused, concern, tired or uncertain when pondering it, SKIP THE TAG ENTIRELY. It&#8217;s not going to hurt you to not have it, and it&#8217;s not worth the time fretting about it.</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/meta-keywords-tag-101-how-to-legally-hide-words-on-your-pages-for-search-engines-12099/feed</wfw:commentRss>
		<slash:comments>18</slash:comments>
		</item>
		<item>
		<title>Don&#8217;t Expect A Magic Flash Solution For Search Engines</title>
		<link>http://searchengineland.com/dont-expect-a-magic-flash-solution-for-search-engines-10826</link>
		<comments>http://searchengineland.com/dont-expect-a-magic-flash-solution-for-search-engines-10826#comments</comments>
		<pubDate>Tue, 27 Mar 2007 14:38:05 +0000</pubDate>
		<dc:creator>Danny Sullivan</dc:creator>
				<category><![CDATA[SEO: Flash]]></category>

		<guid isPermaLink="false">http://searchengineland.com/beta/dont-expect-a-magic-flash-solution-for-search-engines-10826.php</guid>
		<description><![CDATA[
]]></description>
			<content:encoded><![CDATA[<p>Call me Mr. Rain On The Parade, but after reading an interview in The
Guardian with Google product manager of crawl services Dan Crow, I can&#8217;t help
but feel people are going to come away with the mistaken idea that Google will magically make
their Flash content suddenly more accessible to Google&#8217;s crawlers. That ain&#8217;t
going to happen.</p>
<p><span id="more-10826"></span></p>
<p>
<a href="http://blogs.guardian.co.uk/technology/archives/2007/03/27/interview_googles_dan_crow.html">
Interview: Google&#8217;s Dan Crow</a> is the article, and in it, we&#8217;re told:</p>
<blockquote>
<p>If we hit a web page with a Flash movie on it, we just extract the text out
of it and index that text. But a Flash movie is much richer than that. So one
of the projects I&#8217;m working on is to try and improve our Flash processing -
that&#8217;s an example of an area where we could be better. But it&#8217;s not unique to
us.</p>
<p>At the moment our advice is that webmasters need to give us a lot of help &#8211; a
Flash movie is basically a set of virtual pages. If you used HTML for the links,
then we&#8217;d be able to see the overall structure. The content will still be in
Flash, but we can at least get some of it. Ultimately we&#8217;d like to be smart
enough to look inside the Flash movie. We&#8217;re not quite there today.</p>
</blockquote>
<p>Not quite there? Not quite there? Try nowhere, and that&#8217;s not really the
fault of Google or any search engine.</p>
<p>Look, most Flash content I&#8217;ve seen is not made up of words. It&#8217;s video. It&#8217;s
images. It&#8217;s non-textual content that cannot be indexed by a search engine
designed to process text. There are no words to index.</p>
<p>I have a long-running way I try to explain this to people who build in Flash
and who get frustrated that search engines can&#8217;t handle their content. I say
it&#8217;s like taking a TV commercial into a radio station, then getting mad that the
radio station won&#8217;t play the pictures in the commercial. It can&#8217;t. It&#8217;s radio.
Radio plays only sound. If you want images, that&#8217;s TV!</p>
<p>Search engines don&#8217;t understand images. They understand text. Show them
images, and it&#8217;s like trying to play those images on the radio.</p>
<p>Yes, someday in the far, far future, search engines will understand what&#8217;s in
an image. They&#8217;ll see an image and be able to describe it. But that&#8217;s not now.
That&#8217;s not anytime soon. And I say this because
<a href="http://searchengineland.com/061219-091641.php">after a decade</a> of us
being told image recognition is coming, we&#8217;re still no significantly further
along.</p>
<p>Even when they do, it still won&#8217;t be that helpful for general purpose,
textually driven web search. I mean, they&#8217;ll see an image and be able to tell
you &quot;Now the company logo is flying horizontally. Now it is flying vertically.
Now it&#8217;s getting all big. Now it&#8217;s getting all small.&quot; How descriptive is that?</p>
<p>Search engines ARE better at finding actual text that can be embedded in
Flash movies. So if you have a lot of text in there, then maybe they&#8217;ll extract
that text, and your page will do better. Search engines, including Google, have
been doing that for several years now (and
<a href="http://www.google.com/support/webmasters/bin/answer.py?answer=35267">
here&#8217;s</a> Google&#8217;s help page about that). But if you have that much text in
Flash &#8212; why are you putting it in Flash? Put in regular HTML, and save the
Flash for pictures and images.</p>
<p>Overall, don&#8217;t expect that things are going to change soon. They haven&#8217;t for
years, and even suggestions of secret talks with Adobe in this article (&quot;I can&#8217;t
talk about that&quot;) aren&#8217;t going to make some dramatic change. It will be terrible
if those finally getting the message that Flash means problems with search
engines suddenly seize on this interview as hope that a solution is coming. It&#8217;s
not. You want a solution &#8212; try the workarounds
<a href="http://blog.360.yahoo.com/blog-5pHDFJo_erJZfePu.0dv7Hk-?cq=1&#038;p=104">
you&#8217;ll find here</a>, for a start.</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/dont-expect-a-magic-flash-solution-for-search-engines-10826/feed</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
	</channel>
</rss>

<!-- Dynamic page generated in 0.480 seconds. -->
<!-- Cached page generated by WP-Super-Cache on 2012-05-25 23:40:33 -->
<!-- Compression = gzip -->
