<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>searchengineland.com &#187; Legal: Crawling &amp; Indexing</title>
	<atom:link href="http://searchengineland.com/library/legal/legal-crawling-indexing/feed" rel="self" type="application/rss+xml" />
	<link>http://searchengineland.com</link>
	<description>Search Engine Land: Must Read News About Search Marketing &#38; Search Engines</description>
	<lastBuildDate>Sun, 22 Nov 2009 16:39:14 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>NAR Changes Its Mind: Google Is Not A Scraper Site</title>
		<link>http://searchengineland.com/nar-google-is-not-scraper-site-30105</link>
		<comments>http://searchengineland.com/nar-google-is-not-scraper-site-30105#comments</comments>
		<pubDate>Wed, 18 Nov 2009 19:17:47 +0000</pubDate>
		<dc:creator>Matt McGee</dc:creator>
				<category><![CDATA[Google: Critics]]></category>
		<category><![CDATA[Legal: Crawling & Indexing]]></category>

		<guid isPermaLink="false">http://searchengineland.com/?p=30105</guid>
		<description><![CDATA[Backtracking on a controversial decision earlier this year, the National Association of REALTORS® (NAR) has adopted a new policy that allows real estate professionals to have their sites &#8212; including home listings that belong to others &#8212; indexed by search engines.
The controversy reached a peak this year when the NAR agreed with a local decision [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fsearchengineland.com%2Fnar-google-is-not-scraper-site-30105"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fsearchengineland.com%2Fnar-google-is-not-scraper-site-30105" height="61" width="51" /></a></div><p>Backtracking on a <a href="http://searchengineland.com/google-is-scraper-says-national-association-of-realtors-19046">controversial decision</a> earlier this year, the National Association of REALTORS® (NAR) has adopted a new policy that allows real estate professionals to have their sites &#8212; including home listings that belong to others &#8212; indexed by search engines.</p>
<p>The controversy reached a peak this year when the NAR agreed with a local decision in Indianapolis that said real estate agents couldn&#8217;t let Google and other search engines index the property listings on their sites if those listings belonged to other brokers/agents. In March, the Indianapolis board sent a letter to some agents that essentially called search engines &#8220;scraper&#8221; sites:</p>
<blockquote><p>&#8220;&#8230;the National Association of REALTORS® (NAR) is in agreement with our interpretation of the policy that the above described practice of &#8216;indexing your Web site&#8217; as you have called it, is a method of scraping or reproducing the data&#8221;</p></blockquote>
<p>At their meeting this week, the NAR Board of Directors <a href="http://www.realtor.org/inis.nsf/HTMLNewstest/SpecialINS11162009">revised</a> its policy on home listings and search engines to say that participants &#8220;are not required to prevent indexing of their Web sites by recognized search engines.&#8221; </p>
<p>That Realtor.org link is also interesting for its comments about the development of REALTORS® Property Resource, which many say is essentially a national search engine for property listings.</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/nar-google-is-not-scraper-site-30105/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Twitter Not Giving Access To Private Tweets</title>
		<link>http://searchengineland.com/twitter-not-giving-access-to-private-tweets-28122</link>
		<comments>http://searchengineland.com/twitter-not-giving-access-to-private-tweets-28122#comments</comments>
		<pubDate>Tue, 20 Oct 2009 19:44:34 +0000</pubDate>
		<dc:creator>Danny Sullivan</dc:creator>
				<category><![CDATA[Features: General]]></category>
		<category><![CDATA[Legal: Crawling & Indexing]]></category>
		<category><![CDATA[Legal: Privacy]]></category>
		<category><![CDATA[Top News]]></category>
		<category><![CDATA[Twitter]]></category>

		<guid isPermaLink="false">http://searchengineland.com/?p=28122</guid>
		<description><![CDATA[Is Twitter allowing search engines access to protected tweets or not? Not,  Twitter tells me, though the company probably needs to do a bit more to prevent  this type of confusion in the future.
The LA Times reported  yesterday about a &#8220;Twitter hole&#8221; that it believed allowed Google special  access to protected [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fsearchengineland.com%2Ftwitter-not-giving-access-to-private-tweets-28122"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fsearchengineland.com%2Ftwitter-not-giving-access-to-private-tweets-28122" height="61" width="51" /></a></div><p>Is Twitter allowing search engines access to protected tweets or not? Not,  Twitter tells me, though the company probably needs to do a bit more to prevent  this type of confusion in the future.</p>
<p>The LA Times <a href="http://latimesblogs.latimes.com/technology/2009/10/twitter-see-protected-tweets.html">reported  yesterday</a> about a &#8220;Twitter hole&#8221; that it believed allowed Google special  access to <a href="http://help.twitter.com/forums/10711/entries/14016">protected  tweets</a>, tweets made from Twitter accounts where owners have deliberately  chosen not to have their tweets be made public.</p>
<p><a href="http://www.techcrunch.com/2009/10/19/the-new-twitter-hole-that-probably-isnt/">Not  so</a>, said TechCrunch. The so-called protected tweets that the LA Times was  finding in Google looked to be those made from before particular account holders  locked down their accounts.</p>
<p>I checked with Twitter and got back the official word from their press  office:</p>
<blockquote><p>The TechCrunch article seems to sum up the confusion pretty well. It seems  that the LA Times piece references tweets that were public but later the user  protected the account, thus all subsequent tweets are private along with the  profile. The tweets prior to that time <strong>cannot</strong> be un-cached.</p>
<p>Google has not been given a key to the castle&#8230;so to speak.</p></blockquote>
<p>I&#8217;m good with this answer except for the word I&#8217;ve bolded &#8212; that formerly  public tweets cannot be uncached. That&#8217;s incorrect.</p>
<p>Let&#8217;s take an example. Let&#8217;s assume you started your Twitter account in  March. You started tweeting publicly, then in July decided to be private.  Twitter doesn&#8217;t try to protect any of your past tweets. In fact, it&#8217;s pretty  clear about this in its help page <a href="http://help.twitter.com/forums/10711/entries/14016">on the topic</a>:</p>
<blockquote><p>If you have a public account and you protect it, all updates after the time  of protection will be protected. Your profile will only be visible to approved  followers, and existing followers will not be affected.</p>
<p>Please note that tweets from protected profiles will not appear in search  results. People will still be able to find your account using the Find People  search tool but only people you&#8217;ve approved to follow your account will be able  to see your tweets. Also note that any tweets posted while your profile is  private will remain private indefinitely, and tweets posted while your account  is public will remain public indefinitely</p></blockquote>
<p>But Twitter could try to protect those formerly public tweets. As best I can  tell, if you lock down an account, Twitter does make ALL tweets (formerly public  or not) inaccessible to everyone accept those the account holder has authorized  to see them. That includes search engines like <a href="http://www.google.com/">Google</a> or a Twitter-specific search engine  like <a href="http://topsy.com/">Topsy</a>.</p>
<p>Well, if Google can&#8217;t get in to tweets after an account has been protected,  why does it show some? And why does Twitter say this will happen?</p>
<p>Google seems to rely on the last information for a tweet that it could see.  So you tweeted something in March. Google sees the tweet and records it. If in  August, you protect your account. Google tries to revisit your tweets as it does  with any web page, to make sure it has fresh information. It can&#8217;t get to any of  your tweets now.</p>
<p>The ones from August, it never saw them, since they were never  public &#8212; so it doesn&#8217;t list them.</p>
<p>That tweet in March? It keeps showing the  information from the last time it saw it. And apparently, it will keep doing  this for weeks or months.</p>
<p>Google didn&#8217;t send me a comment about this (I did ask, and I might get one later today). But that&#8217;s  just how I know Google works and can see it specifically working with some  protected tweets I investigated today.</p>
<p>As for Topsy, they <a href="http://twitter.com/Topsy/statuses/5023777459">told me</a>:</p>
<blockquote><p>Topsy only displays tweets that were once public. The refresh button will  make them vanish if the account is now private.</p></blockquote>
<p>Back to Google. Eventually it should update its old copy of the tweet with  what it currently shows to non-authorized visitors, a message that says &#8220;This  person has protected their tweets&#8221; (you can see this for millions of people <a href="http://www.google.com/#hl=en&amp;safe=off&amp;q=site%3Atwitter.com+%22This+person+has+protected+their+tweets%22">on  Google now</a>).</p>
<p>Twitter could speed that process along by explicitly  blocking tweets from a protected account with a <a href="../../meta-robots-tag-101-blocking-spiders-cached-pages-more-10665">meta  robots tag</a> configured to remove the page from the index entirely and from  cached copies being allowed (the NOINDEX, NOARCHIVE commands).</p>
<p>That wouldn&#8217;t guarantee that formerly public tweets are all taken private, of  course. Once something&#8217;s put out on the public web, it&#8217;s very difficult to pull  it back. But it could help and seems an easy enough change to do.</p>
<p>If you have a protected account, also keep in mind that those who follow you  might retweet what you tweet to the world. If you&#8217;re that worried, make sure you  pick your followers carefully and regularly keep them informed that you don&#8217;t  want things retweeted. Otherwise, be prepared for your private tweets to leak  out.</p>
<p>For more about search and tweets, see my <a href="../../what-is-real-time-search-definitions-players-22172">What  Is Real Time Search? Definitions &amp; Players</a> post which cover some ways to  make use of Google and its <a href="../../up-close-with-google-search-options-26985">search  options</a> feature to drill-down into tweets.</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/twitter-not-giving-access-to-private-tweets-28122/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Yahoo Wins In Lawsuit Over Search Descriptions</title>
		<link>http://searchengineland.com/yahoo-wins-in-lawsuit-over-search-descriptions-25473</link>
		<comments>http://searchengineland.com/yahoo-wins-in-lawsuit-over-search-descriptions-25473#comments</comments>
		<pubDate>Wed, 09 Sep 2009 16:20:06 +0000</pubDate>
		<dc:creator>Barry Schwartz</dc:creator>
				<category><![CDATA[Legal: Crawling & Indexing]]></category>
		<category><![CDATA[Legal: General]]></category>
		<category><![CDATA[Yahoo: Legal]]></category>

		<guid isPermaLink="false">http://searchengineland.com/?p=25473</guid>
		<description><![CDATA[Eric Goldman updated us on a case where Yahoo was sued for showing spam and porn pages for a search on a person&#8217;s name.  Beverly Stayart was upset that when you searched for her name on Yahoo, Yahoo returned some results that were not to her liking.  So she sued Yahoo but the [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fsearchengineland.com%2Fyahoo-wins-in-lawsuit-over-search-descriptions-25473"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fsearchengineland.com%2Fyahoo-wins-in-lawsuit-over-search-descriptions-25473" height="61" width="51" /></a></div><p>Eric Goldman <a href="http://blog.ericgoldman.org/archives/2009/09/yahoos_search_r.htm">updated</a> us on a case where <A href="http://searchengineland.com/yahoo-sued-for-showing-spam-pages-about-beverly-16601">Yahoo was sued</a> for showing spam and porn pages for a search on a person&#8217;s name.  Beverly Stayart was upset that when you searched for her name on Yahoo, Yahoo returned some results that were not to her liking.  So she sued Yahoo but the court wouldn&#8217;t have it.</p>
<p>Goldman explained that the court rejected her suit on three different levels.   She sued Yahoo for Lanham Act false endorsement and it was denied by the court, as expected.  The reason it was rejected? Goldman sums it up as follows:</p>
<blockquote><p>The court denies Various&#8217; 230 defense because its association with the banner ad was unclear. Having dismissed the federal Lanham Act claims completely, the court then declines supplemental jurisdiction over the state law claims. The court also rejects Stayart&#8217;s guffaw-inducing request for sanctions against the defendants for having the temerity of moving to dismiss her complaint.</p></blockquote>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/yahoo-wins-in-lawsuit-over-search-descriptions-25473/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>US Justice Dept. Formally Confirms Google Books Inquiry</title>
		<link>http://searchengineland.com/us-justice-dept-confirms-google-books-inquiry-21958</link>
		<comments>http://searchengineland.com/us-justice-dept-confirms-google-books-inquiry-21958#comments</comments>
		<pubDate>Fri, 03 Jul 2009 12:34:38 +0000</pubDate>
		<dc:creator>Greg Sterling</dc:creator>
				<category><![CDATA[Features: Analysis]]></category>
		<category><![CDATA[Google: Book Search]]></category>
		<category><![CDATA[Google: Critics]]></category>
		<category><![CDATA[Google: Legal]]></category>
		<category><![CDATA[Legal: Copyright]]></category>
		<category><![CDATA[Legal: Crawling & Indexing]]></category>
		<category><![CDATA[Top News]]></category>

		<guid isPermaLink="false">http://searchengineland.com/?p=21958</guid>
		<description><![CDATA[It&#8217;s not really a surprise or even news that the US Department of Justice (DOJ) confirmed yesterday it was formally investigating the terms of the Google Book Search settlement. This was known as far back as April and essentially confirmed last month by various publications reporting that formal requests (called &#8220;civil investigative demands”) had been [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fsearchengineland.com%2Fus-justice-dept-confirms-google-books-inquiry-21958"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fsearchengineland.com%2Fus-justice-dept-confirms-google-books-inquiry-21958" height="61" width="51" /></a></div><p>It&#8217;s not really a surprise or even news that the US Department of Justice (DOJ) <a href="http://www.techmeme.com/090703/p3#a090703p3">confirmed yesterday</a> it was formally investigating the terms of the Google Book Search <a href="http://searchengineland.com/google-settles-copyright-litigation-for-125-million-paves-way-for-novel-services-15282">settlement</a>. This was known <a href="http://searchengineland.com/reports-us-dept-of-justice-looking-at-antitrust-issues-over-google-books-lawsuit-18238">as far back as April</a> and essentially confirmed last month by various publications <a href="http://searchengineland.com/doj-increases-scrutiny-of-google-book-settlement-20795">reporting</a> that formal requests (called &#8220;civil investigative demands”) had been issued to book publishers by the DOJ.</p>
<p>What happened yesterday was that the DOJ provided procedural notification to the court that the US is exploring the potential anti-trust dimensions of the Google settlement. The so-called &#8220;fairness hearing&#8221; to finalize the terms of the settlement will happen on October 7 and the US has until September 19 to &#8220;present its views in writing.&#8221; That&#8217;s not a lot of time.</p>
<p>What did the DOJ say in its <a href="http://www.scribd.com/doc/17045068/SDNY-Order-DOJ-Letter">two paragraph letter</a> notifying the court of its formal investigation? Here&#8217;s the body of the letter:</p>
<p><img class="alignnone size-full wp-image-21959" title="picture-18" src="http://searchengineland.com/figz/wp-content/seloads/2009/07/picture-18.png" alt="picture-18" width="470" height="443" /></p>
<p>The <a href="http://www.america.gov/st/educ-english/2008/April/20080423212813eaifas0.42149.html">Sherman Anti-Trust Act</a> is mentioned in the first paragraph of the government&#8217;s letter. The Sherman Act (and related subsequent legislation) is intended to promote competition and prevent consolidation of power and control over markets. For example, Microsoft was <a href="http://news.bbc.co.uk/2/hi/business/700084.stm">found</a> in 2000 to have violated the Sherman Anti-Trust Act for too closely tying Internet Explorer to the Windows operating system.</p>
<p>One of the central questions the DOJ is exploring in its review of the Google Book Search settlement is whether Google gains exclusive rights or benefits to which other parties don&#8217;t have access. Some legal scholars <a href="http://www.slideshare.net/naypinya/reflections-on-the-google-book-search-settlement-by-pamela-samuelson">have argued</a> that Google does gain what amounts to a monopoly over so-called &#8220;orphan books,&#8221; books that are out of print but not yet in the public domain and where copyright owners are effectively impossible to locate.</p>
<p>Google has repeatedly said that the agreement is non-exclusive but has not discussed that contention in precise detail. Questions in the anti-trust investigation will include how much control Google has over the &#8220;corpus&#8221; (material scanned), over pricing of book sales and to what degree third parties can gain access to the material. (New York University is <a href="http://searchengineland.com/google-book-search-lawsuit-gets-conference-treatment-21746">holding a conference</a> on the Book Search Settlement for interested parties on October 8, a day after the fairness hearing.)</p>
<p>Unlike the Google-Yahoo <a href="http://searchengineland.com/as-expected-yahoo-announces-10-year-google-paid-search-deal-14198">paid search deal</a> that Google <a href="http://searchengineland.com/citing-risk-google-ends-yahoo-paid-search-deal-15375">walked away from to avoid a legal battle with the DOJ</a>, the Book Search Settlement is different. Because of the underlying lawsuit Google cannot simply walk away here. Putting aside some of the less substantive issues (e.g., whether the attorneys fees are too high) as a basic matter there are essentially only two potential outcomes: approval of the existing settlement or modification of selected terms. If I had to bet I would bet that we will see some sort of modification further expanding third party access to the scanned material.</p>
<p>You can read the <a href="http://books.google.com/googlebooks/agreement/press.html">settlement itself</a> or take a look at the public-facing Google Book Settlement <a href="http://www.googlebooksettlement.com/">website</a> for rights holders and publishers. For additional context and background here&#8217;s our &#8220;library&#8221; of <a href="http://searchengineland.com/library/google/google-book-search">Google Book Search stories</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/us-justice-dept-confirms-google-books-inquiry-21958/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Website Guilty Over Google&#8217;s Automated Snippet</title>
		<link>http://searchengineland.com/website-guilty-over-googles-automated-snippet-20347</link>
		<comments>http://searchengineland.com/website-guilty-over-googles-automated-snippet-20347#comments</comments>
		<pubDate>Tue, 02 Jun 2009 13:46:16 +0000</pubDate>
		<dc:creator>Barry Schwartz</dc:creator>
				<category><![CDATA[Google: Legal]]></category>
		<category><![CDATA[Google: Web Search]]></category>
		<category><![CDATA[Legal: Crawling & Indexing]]></category>
		<category><![CDATA[Top News]]></category>

		<guid isPermaLink="false">http://searchengineland.com/?p=20347</guid>
		<description><![CDATA[The Register reports a Dutch court has ruled against a Dutch web site, where the web site was sued over the automated snippet used in Google for a keyword phrase. 
When someone searched for [Zwartepoorte] and [bankrupt], Google showed Miljoenhuizen.nl and the snippet in the Google search results for Miljoenhuizen.nl was:
Complete name: Zwartepoorte Specialiteit: BMW&#8230;This [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fsearchengineland.com%2Fwebsite-guilty-over-googles-automated-snippet-20347"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fsearchengineland.com%2Fwebsite-guilty-over-googles-automated-snippet-20347" height="61" width="51" /></a></div><p>The Register <a href="http://www.theregister.co.uk/2009/06/02/google_snippet/">reports</a> a Dutch court has ruled against a Dutch web site, where the web site was sued over the automated snippet used in Google for a keyword phrase. </p>
<p>When someone searched for [Zwartepoorte] and [bankrupt], Google showed Miljoenhuizen.nl and the snippet in the Google search results for Miljoenhuizen.nl was:</p>
<blockquote><p>Complete name: Zwartepoorte Specialiteit: BMW&#8230;This company has been declared bankrupt, it has been acquired by the motordealer I have worked for Boat Rialto&#8230;</p></blockquote>
<p>What is the issue?  Well, Zwartepoorte did not go bankrupt and was upset when the Google snippet showed they did.  So Zwartepoorte sued Miljoenhuizen.nl, not Google for this error, and won.  The site owner had to change their web page so Google would remove the snippet.</p>
<p>This is pretty crazy if you ask me.  Google algorithmically makes the snippets.  No where on the page did the site say the company went bankrupt.  In any event, the court required the web site to modify the page so Google would remove the automated snippet.  </p>
<p>&#8220;The Court argues that it might be true that the website had no control over the functioning of Google but suggests that these questions about the opacity of Google’s functioning should be addressed in a broader context&#8230; then concludes that defendant had its own responsibility,&#8221; says van Hoboken, a PhD candidate at the Institute for Information Law at the University of Amsterdam.</p>
<p>Hat tip to <a href="http://www.blogstorm.co.uk/court-rules-website-liable-for-google-snippet/">Blogstorm</a> for spotting this case.</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/website-guilty-over-googles-automated-snippet-20347/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Open Letter To Google &amp; The AP: Reveal The Licensing Terms</title>
		<link>http://searchengineland.com/open-letter-to-google-the-ap-reveal-the-licensing-terms-20229</link>
		<comments>http://searchengineland.com/open-letter-to-google-the-ap-reveal-the-licensing-terms-20229#comments</comments>
		<pubDate>Fri, 29 May 2009 18:42:30 +0000</pubDate>
		<dc:creator>Danny Sullivan</dc:creator>
				<category><![CDATA[Features: Analysis]]></category>
		<category><![CDATA[Google: Business Issues]]></category>
		<category><![CDATA[Google: News]]></category>
		<category><![CDATA[Legal: Copyright]]></category>
		<category><![CDATA[Legal: Crawling & Indexing]]></category>
		<category><![CDATA[Top News]]></category>

		<guid isPermaLink="false">http://searchengineland.com/?p=20229</guid>
		<description><![CDATA[Discussions between Google and the Associated Press about renewing their content licensing  deal continue, I assume, but all&#8217;s quiet recently on the negotiation front. I  want to disrupt that. It would be wrong in this particular case for both parties  to reach a deal where &#8220;terms are not disclosed.&#8221; The future of [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fsearchengineland.com%2Fopen-letter-to-google-the-ap-reveal-the-licensing-terms-20229"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fsearchengineland.com%2Fopen-letter-to-google-the-ap-reveal-the-licensing-terms-20229" height="61" width="51" /></a></div><p><a href="../../sorry-tom-curley-no-google-ranking-boost-for-ap-18402">Discussions</a> between Google and the Associated Press about renewing their content licensing  deal continue, I assume, but all&#8217;s quiet recently on the negotiation front. I  want to disrupt that. It would be wrong in this particular case for both parties  to reach a deal where &#8220;terms are not disclosed.&#8221; The future of journalism, as  well as Google&#8217;s own reputation, deserves for things to open up.</p>
<p>After threatening a lawsuit against Google several years ago, AP <a href="../../google-news-now-hosting-wire-stories-promises-better-variety-in-results-12064">won</a> its first licensing deal with Google in 2006. Google was at pains to stress this  wasn&#8217;t a deal designed to gain the rights to merely list AP stories. It was  supposed to cover more &#8220;new&#8221; and &#8220;extensive&#8221; uses of AP material.</p>
<p>Bull. Today, you can still search at Google and find that it fails to list just one &#8220;originating&#8221; AP story in many cases. Sure, Google hosts AP stories, but it  wasn&#8217;t like Google set out trying for that goal. It was just as happy as  pointing outward. The bottom line was that the deal was a nice wrapper to go  around getting the AP off Google&#8217;s back.</p>
<p>Now the AP&#8217;s been on again that without the right deal, it&#8217;ll pull content  from Google. Technically, how they&#8217;ll do that is absurd. Will the AP robots.txt out  its entire site? And ensure that all of its members do the same for any AP  story? More likely, it would fall back to the lawsuit front.</p>
<p>But what&#8217;s the right deal? What&#8217;s Google paying the AP now, and what happened  in three years that this amount wasn&#8217;t enough? Since the terms were never  disclosed, the public can&#8217;t judge.</p>
<p>And the public needs to judge, especially as newspapers have <a href="http://correspondents.theatlantic.com/james_warren/2009/05/shhhh_newspaper_publishers_are_quietly_holding_a_very_very_important_conclave_today_will_you_soon_be.php">secret  stealth meetings</a> (which involve AP CEO Tom Curley) which involve sessions  like:</p>
<blockquote><p>Journalism Online: Presentation on proposed service to charge for access to  newspaper content and to license that content that (sic) online  aggregators</p></blockquote>
<p>(For more on the stealth meeting, also see the Nieman Journalism Lab <a href="http://www.niemanlab.org/2009/05/newspaper-execs-treading-carefully-on-antitrust-laws/">report</a> and <a href="http://www.techmeme.com/090528/p53#a090528p53">Techmeme</a>).</p>
<p>See, some in the newspaper industry persist in the assumption that merely  listing headlines and summarizing stories is a copyright offense. The AP itself wants to  charge for this and already has guidelines that make some people think they  might be violating fair use laws when they aren&#8217;t (see <a href="http://daggle.com/do-newspapers-owe-google-fees-for-researching-stories-611">Do  Newspapers Owe Google “Fair Share” Fees For Researching Stories?</a>).</p>
<p>In other quarters, we have people like Forbes.com CEO Jim Spanfeller <a href="../../forbes-spanfeller-attacks-google-stumbles-into-cesspool-18654">pulling  $60 million dollar figures out of the air</a>, about what he thinks Google owes  him and suggesting that Google is helping to destroy &#8220;one of the core building  blocks of our democracy.&#8221;</p>
<p>Since so much is at stake here &#8212; democracy itself! &#8212; it doesn&#8217;t seem right  that Google and the AP will reach a deal that no one gets the details too.  Shouldn&#8217;t all journalistic enterprises, be they traditional or not, understand  exactly the value Google&#8217;s willing to pay to one with a big mouth and a staff of  lawyers?</p>
<p>Moreover, shouldn&#8217;t Google be taking a stand on behalf of the majority of  publishers who I&#8217;d argue have no problem with aggregators or search engines  listing their content. A small number of publishers with inflated opinions about  their importance are pushing for (and winning) <a href="http://daggle.com/newspaper-tax-break-626">concessions</a> to compensate  for their dying business models and lack of foresight. Aren&#8217;t the wrong people  being rewarded?</p>
<p>You have to appreciate the rich irony. I can&#8217;t get the AP to talk with me at  all (<a href="http://daggle.com/do-newspapers-owe-google-fees-for-researching-stories-611">their  execs are all busy, I&#8217;ve been told</a>). When their CEO attends a stealth  meeting, it&#8217;s not even the AP that first reports about it despite the AP being  so convinced that they originate stories. Instead, the AP story <a href="http://www.sfgate.com/cgi-bin/article.cgi?f=/n/a/2009/05/28/financial/f150308D57.DTL">points</a> to an Atlantic blog <a href="http://correspondents.theatlantic.com/james_warren/2009/05/shhhh_newspaper_publishers_are_quietly_holding_a_very_very_important_conclave_today_will_you_soon_be.php">post</a> that broke the news. And Google, which <a href="../../google-as-open-as-it-wants-to-be-ie-when-its-convenient-12624">touts  being open any time it is convenient</a>, is being hush-mouthed here.</p>
<p>Google needs to step up. Defend the &#8220;fair use&#8221; rights it believes it  has on its own behalf and those of the greater web ecosystem. Alternatively, negotiate a deal  to solve its AP problem but do it openly, so everyone else understands what  exactly is being given up.</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/open-letter-to-google-the-ap-reveal-the-licensing-terms-20229/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Google Is A Scraper Site, Says National Association Of Realtors</title>
		<link>http://searchengineland.com/google-is-scraper-says-national-association-of-realtors-19046</link>
		<comments>http://searchengineland.com/google-is-scraper-says-national-association-of-realtors-19046#comments</comments>
		<pubDate>Tue, 12 May 2009 18:09:50 +0000</pubDate>
		<dc:creator>Matt McGee</dc:creator>
				<category><![CDATA[Features: General]]></category>
		<category><![CDATA[Google: Critics]]></category>
		<category><![CDATA[Google: Web Search]]></category>
		<category><![CDATA[Legal: Crawling & Indexing]]></category>
		<category><![CDATA[Top News]]></category>
		<category><![CDATA[Yahoo: Real Estate]]></category>

		<guid isPermaLink="false">http://searchengineland.com/?p=19046</guid>
		<description><![CDATA[With support from the National Association of REALTORS® (NAR), the Indianapolis Metropolitan Board of REALTORS® (MIBOR) has forced some of its members to stop allowing certain MLS listings to be crawled and indexed by Google because Google (and other search engines) is considered a &#8220;scraper&#8221; site.
This is the latest episode in a long-running battle over [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fsearchengineland.com%2Fgoogle-is-scraper-says-national-association-of-realtors-19046"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fsearchengineland.com%2Fgoogle-is-scraper-says-national-association-of-realtors-19046" height="61" width="51" /></a></div><p>With support from the National Association of REALTORS® (NAR), the Indianapolis Metropolitan Board of REALTORS® (MIBOR) has forced some of its members to stop allowing certain MLS listings to be crawled and indexed by Google because Google (and other search engines) is considered a &#8220;scraper&#8221; site.</p>
<p>This is the latest episode in a long-running battle over who controls home listings that are part of the Multiple Listing Service, or MLS. Some of the affected real estate professionals plan to ask the NAR to change its opinion on search engines at its national convention this week.</p>
<p><a href="http://www.hometoindy.com/">Paula Henry</a> is one of about 15 real estate agents affected by the NAR/MIBOR decision. Like many agents/brokers, she shows not only her own listings on her web site, but also listings from other agents and brokers who participate in a data sharing agreement through the Internet Data Exchange (IDX) system. There are strict rules over how listings from other agents may be displayed. In a <a href="http://agentgenius.com/g-rants-insanity-more/real-estate/did-google-scrape-my-website-you-be-the-judge/">post on AgentGenius.com</a>, Paula explains the one that has led to Google&#8217;s classification as a scraper site:</p>
<blockquote><p>Section 15.2.2  &#8211; participants must protect IDX information from misappropriation by employing reasonable efforts to monitor and prevent &#8217;scraping&#8217; or other unauthorized accessing, reproduction, or use of the BLC database&#8221;</p></blockquote>
<p><em>(BLC is the Indianapolis version of MLS.)</em></p>
<p>After apparently getting a complaint from another agent, MIBOR consulted with the NAR and the NAR confirmed that the rule above applies to search engines. On March 27, MIBOR sent Red Door Real Estate (where Paula Henry does business) a cease-and-desist letter detailing two issues:</p>
<ol>
<li>&#8220;&#8230;the National Association of REALTORS® (NAR) is in agreement with our interpretation of the policy that the above described practice of &#8216;indexing your Web site&#8217; as you have called it, is a method of scraping or reproducing the data&#8221;</li>
<li>&#8220;Under IDX policy &#8230; participants have no authority to advertise those listings [from other participants] in any other way, including Internet search engines&#8221;</li>
</ol>
<p>The C&amp;D letter asks Red Door to either use robots.txt (actually referred to as &#8220;rebot.txt&#8221; in the letter) to block search engines from crawling the listings on their sites that belong to other agents/brokers, or to remove the non-Red Door listings altogether. In her <a href="http://agentgenius.com/g-rants-insanity-more/real-estate/did-google-scrape-my-website-you-be-the-judge/">blog post</a>, Paula writes that additional demands were made in April requiring Red Door to remove the MLS number, street address, and other listing data from the Page Titles and Meta Description tags of pages on their web sites.</p>
<p>Not mentioned in all of this is the fact that Google is practically running its own national MLS database, with the same kind of search and sort options (Price, Beds, Baths, Area, etc.) in Google Maps that you&#8217;d expect to find on full-fledged real estate sites.</p>
<p><img title="goog-real-estate" src="http://searchengineland.com/figz/wp-content/seloads/2009/05/goog-real-estate.jpg" alt="goog-real-estate" width="540" height="380" /></p>
<p>Real estate brokers and agents submit their own listings to Google (or they have a third party do it), so the rules about how other agents&#8217; listings are displayed don&#8217;t apply. And it&#8217;s not just about Google; <a href="http://realestate.yahoo.com/">Yahoo Real Estate</a> also gets listings from agents and brokers, and places them on highly-optimized pages that, in my experience, often rank well for popular real estate search terms.</p>
<p>As for Google being a scraper site, Paula Henry will be one of two real estate agents speaking to the NAR on Thursday. She says they&#8217;ll ask the organization to review and change its policy so that Google (and other search engines) is allowed to index all real estate listings from the IDX system, no matter whose web site they appear on.</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/google-is-scraper-says-national-association-of-realtors-19046/feed</wfw:commentRss>
		<slash:comments>10</slash:comments>
		</item>
		<item>
		<title>Forbes.com CEO Spanfeller Attacks Google, Stumbles Into His Own Cesspool</title>
		<link>http://searchengineland.com/forbes-spanfeller-attacks-google-stumbles-into-cesspool-18654</link>
		<comments>http://searchengineland.com/forbes-spanfeller-attacks-google-stumbles-into-cesspool-18654#comments</comments>
		<pubDate>Wed, 06 May 2009 03:24:33 +0000</pubDate>
		<dc:creator>Danny Sullivan</dc:creator>
				<category><![CDATA[Features: Analysis]]></category>
		<category><![CDATA[Google: AdWords]]></category>
		<category><![CDATA[Google: Business Issues]]></category>
		<category><![CDATA[Google: Critics]]></category>
		<category><![CDATA[Google: News]]></category>
		<category><![CDATA[Google: Web Search]]></category>
		<category><![CDATA[Legal: Crawling & Indexing]]></category>
		<category><![CDATA[Legal: Trademarks]]></category>
		<category><![CDATA[Top News]]></category>

		<guid isPermaLink="false">http://searchengineland.com/?p=18654</guid>
		<description><![CDATA[Another publisher is complaining that Google isn&#8217;t giving them their &#8220;fair  share&#8221; &#8212; this time Forbes CEO Jim Spanfeller. Google makes $60 million off the Forbes brand, he claims (with no proof), and boosting &#8220;quality publishers&#8221; like  Forbes would help Eric Schmidt&#8217;s web sewage problem. This is from the publication  that sells [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fsearchengineland.com%2Fforbes-spanfeller-attacks-google-stumbles-into-cesspool-18654"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fsearchengineland.com%2Fforbes-spanfeller-attacks-google-stumbles-into-cesspool-18654" height="61" width="51" /></a></div><p>Another publisher is complaining that Google isn&#8217;t giving them their &#8220;fair  share&#8221; &#8212; this time Forbes CEO Jim Spanfeller. Google makes $60 million off the Forbes brand, he claims (with no proof), and boosting &#8220;quality publishers&#8221; like  Forbes would help Eric Schmidt&#8217;s <a href="http://searchengineland.com/amid-tensions-googles-eric-schmidt-addresses-newspaper-conference-17237">web sewage problem</a>. This is from the publication  that sells paid links that helps the sewage rise in Google?</p>
<p>I&#8217;ve already dissected some of the misconceptions that Spanfeller and other  publishers have about Google in my <a href="../../sorry-tom-curley-no-google-ranking-boost-for-ap-18402">Sorry,  Tom Curley: Don’t Expect A Google Ranking Boost For The AP</a> post from last  week. This time, I&#8217;ll do a counterpoint to points in Spanfeller&#8217;s <a href="http://www.paidcontent.org/entry/419-google-and-newspapers/">article</a> over at PaidContent.org.</p>
<blockquote><p><strong>Spanfeller: </strong>For some time there have been murmurings about the  relative value generated by Google  vs. the parasitical nature of its business  model. In short, is Google being disproportionally compensated for what is  fundamentally other people’s work?</p></blockquote>
<p><strong>Counterpoint:</strong> Google sends Forbes and other publishers millions of  visits for free. Usually smaller publishers complain if for some reason they  lose that traffic due to a ranking change. Newspapers and magazine publishers  seem unique in being upset that getting all those free visitors simply isn&#8217;t  enough. Perhaps Google itself isn&#8217;t being properly compensated?</p>
<blockquote><p><strong>Spanfeller: </strong>There is a strong case to be made that Google is indeed  getting a bigger piece of the pie than it deserves. It certainly feels that way  to content-producing companies when the advertising cycle is in a trough (as it  is now) and the advertising lifeblood for branded professional journalism seems  to be shrinking by the day. But is there substance to this feeling beyond the  pain of lower ad dollars?</p></blockquote>
<p><strong>Counterpoint:</strong> Print publications have had 15 years to find a business  model in a world that was moving digital. They haven&#8217;t gotten there. Google  didn&#8217;t even exist when they first started having these pains, but it&#8217;s their  bane now? These publications also seemed more than happy to fritter away the  past three years or so of rising online ad revenues without planning for the  inevitable downturns in ad cycles.</p>
<blockquote><p><strong>Spanfeller:</strong> Let’s consider some basic issues, all of which have been  discussed in the industry for some time. First off, does the last click get too  much credit? Just about everyone I talk with these days agrees that it does.  Google is by far the biggest winner in this ill-conceived metric, and by selling  branded keywords to the highest bidder, the company is, in fact, working hard to  maximize it.</p></blockquote>
<p><strong>Counterpoint: </strong>Somehow I suspect the vast majority of people who search  for your brand, Forbes, aren&#8217;t clicking on paid ads that show up. Most of them  instead are almost certainly clicking on your free listing. Isn&#8217;t Forbes by far  the winner in those last clicks?</p>
<p>How about an illustration:</p>
<p><a title="Forbes.com On Google Search by search-engine-land, on Flickr" href="http://www.flickr.com/photos/searchengineland/3505748073/"><img src="http://farm4.static.flickr.com/3585/3505748073_892a053c8f.jpg" border="0" alt="Forbes.com On Google Search" width="500" height="246" /></a></p>
<p>Here&#8217;s the rundown on each item I&#8217;ve numbered:</p>
<p><strong>(1) </strong>This is the free listing that you have on Google for your own  name. Right there at the top, which survey after survey will tell you is where  the majority of people click. How much did you pay for that? Right, nothing.</p>
<p><strong>(2)</strong> This is someone selling your freaking magazine. You know, the  thing that if its circulation rises, you do better?</p>
<p>Now I can agree, this is annoying if you prefer to sell direct. You might not  want to have some third-party competing with you on your own name. Usually smart  publishers concerned about this ensure that their official affiliates are barred  from doing so. If it&#8217;s not an official affiliate, then sorry, the law in the US  <a href="http://searchengineland.com/forbes-spanfeller-attacks-google-stumbles-into-cesspool-18654">still seems to allow</a> ads to be triggered from branded terms. You just can&#8217;t use  the brand name in the text (and these folks are trying to get around that with  the whole &#8220;Frbs&#8221; thing.</p>
<p>Maybe Google should make nice with you and other brand holders by disallowing  branded terms from triggering ads, despite legal cases. Might be a good PR move.  But then again, you&#8217;re not the boss of the word &#8220;Forbes.&#8221; Sorry, you&#8217;re not. You  have protected trademark rights relating to it. But if someone&#8217;s not passing  themselves off as you &#8212; or if someone else out there has a &#8220;Forbes&#8221; brand &#8212;  they have rights too.</p>
<p>Hey, perhaps Google should never allow the word &#8220;iPhone&#8221; to be a trigger word  for ads that aren&#8217;t from Apple. Then the next time my iPhone breaks, and I want  someone to fix it, Apple can be the sole advertiser listed &#8212; even if others might do the job better and cheaper.</p>
<p>Maybe I have an old Forbes magazine collection that I want to sell. Forbes  should have the right to prevent me from advertising this fact?</p>
<p>Or hey, you know sometimes people don&#8217;t like companies like McDonald&#8217;s. We  should prevent them from running ads that might point to content explaining  their issues with those companies. Let&#8217;s make it a big huge brandfest, shall we?</p>
<p>By the way, you really want to poke at Google about branded terms. Pick a  better target &#8212; complain that there&#8217;s an essential unfairness that Google  allows brand terms as trigger words except for Google itself.</p>
<p><strong>(3) </strong>Hey, you got me. No idea why these folks think running an ad  against &#8220;Forbes&#8221; makes sense. But who exactly were you losing to the Forbes web  site that decided instead that Atlanta real estate was for them? This is also a  good time to mention that not everyone who enters &#8220;Forbes&#8221; actually wants to go  to your web site (though most of them probably do &#8212; and probably do get there,  as I said)</p>
<p><strong>(4)</strong> That&#8217;s you! It&#8217;s your own ad, for your newsletters. Now let me  understand this. Google&#8217;s sending you tons of traffic for free, but that&#8217;s not  enough &#8212; so you&#8217;re buying ads at Google. If you&#8217;re buying ads on Google, then  hopefully you&#8217;re making money off that traffic. And if you are, I gotta believe  you&#8217;re making money off all those free people Google sends to you each day. How  much, would you say? Oh, and free SEO tip. Perhaps if you spent a little time  ensuring pages in your online store had even the basics of SEO applied, <a href="http://www.google.com/search?hl=en&amp;q=site:www.newsletters.forbes.com">like</a> unique page titles, Google probably would send  you more traffic directly there for free.</p>
<blockquote><p><strong>Spanfeller: </strong>For the most part, a marketer’s top-producing search terms  incorporate its brand name &#8230; where value chains are muddied beyond  recognition—is when Google makes marketers buy their own brand name so that  their competitors will not &#8230; All of the value of “brand advertising” is  discounted as consumers take action on that advertising by using their search  bars to navigate to the marketer’s web site. I have been told that in many cases  over 90% of search spending by large-brand advertisers is targeted at keywords  that are themselves (or have incorporated within them) the actual name of the  brand.</p></blockquote>
<p><strong>Counterpoint:</strong> I&#8217;m sorry, did a horde of heavily armed Googlers break  into the Forbes ivory tower and force you at gunpoint to buy your brand name? I don&#8217;t  think so. And looking today, just who are your competitors showing up that  you&#8217;re so concerned about?</p>
<p>Let&#8217;s stick with real life rather than the theoretical debate:</p>
<ul>
<li>How many visitors per day make it to your web site searching for &#8220;Forbes&#8221;  and other terms incorporating that word? Hundreds? Thousands? Hundreds of  thousands? You know the answer &#8212; let&#8217;s have it.</li>
<li>How many people does it take until you&#8217;d feel you were getting a fair  shake?</li>
<li>What do your own search marketers tell you why they are buying ads for your  own brand. Is it really just because they worry about competitors, or might it  be that because Forbes wants to push particular revenue-generating assets in  front of Google&#8217;s audience &#8212; and also because search marketers have long <a href="../../ppc-and-your-good-name-sales-from-brand-searches-arent-incremental-10825">reported</a> that a paid ad for your name in conjunction with that nice free link Google  gives you makes for better results.</li>
</ul>
<p>By the way, if your brand is really so big, why are all those consumers so  stupid as to not simply enter your URL into the address bar of their  browser?</p>
<blockquote><p><strong>Spanfeller: </strong>Search is not really all that great at the moment, a  comment repeated time and again by much more astute folks then me. This is  especially true when looking for high-quality professionally created content.  This is not to say that user-generated content or ecommerce options or product  specs should not be returned in search results, simply that there is clearly a  better way to showcase the different paths an end user might be pursuing. The  idea that everyone is forced into trying to “game” the system so that they get  their “fair” (or sometimes not so fair) share is testament to how terribly wrong  this entire process has become.</p>
<p>For all the discussion about the vaunted search algorithms, is there any  consideration for paid journalism over or even separate from ecommerce options  or user-generated blogs and the like?</p></blockquote>
<p><strong>Counterpoint: </strong>Yes, let&#8217;s go with the anecdotes. I suppose the economy  is picking up now, because I&#8217;ve heard a bunch of people tell me they&#8217;re feeling  better about it. Why don&#8217;t you cite that in Forbes?</p>
<p>The reality is that we have no dependable stats on whether search has  improved or gotten worse. But I do have my suspicions when a publisher who  believes he has &#8220;high-quality professional created content&#8221; tells me that it&#8217;s  &#8220;especially true&#8221; that their publications aren&#8217;t being well served.</p>
<p>As for that separate place you want, my jaw dropped when I read that. You  have it. Google has all that high-quality professionally created content in a  super-secret place, but I&#8217;ll reveal its location now. It&#8217;s called <a href="http://news.google.com/">Google News</a>. Only you and about 5,000 or so  other publications get in. For free. Special privileges (see <a href="http://daggle.com/googles-love-for-newspapers-how-little-they-appreciate-it-443">Google’s  Love For Newspapers &amp; How Little They Appreciate It</a>). Did you send  Google the thank you note yet?</p>
<p>As for different paths to showcase different type of content, here&#8217;s a  newsflash. Google has what&#8217;s called <a href="../../google-20-google-universal-search-11232">universal  search</a>, where they actually segment things into groups like news, products  and user-generated content (blog). Sort of what you&#8217;re wishing for, except that  wish came true back in 2007.</p>
<blockquote><p><strong>Spanfeller: </strong>When a colleague in the industry recently famously went to  find information about the not-too-distant hostilities in the Gaza Strip, and  did not get an actual professionally created journalistic result until the third  page, something is wrong.</p></blockquote>
<p><strong>Counterpoint:</strong> Gosh, did that colleague get any searches right? Maybe  most of the time? Or are they commonly having this issue?</p>
<blockquote><p><strong>Spanfeller: </strong>At Forbes.com, we have estimated that Google makes roughly $60  million a year directing folks to our site. And by the way, 40 percent of those  dollars are derived from the search terms of Forbes, Forbes.com or Forbes  Magazine—simple navigation. Seems like a very nice chunk of change for simply  being there.</p></blockquote>
<p>Seriously, $60 million? Based on what? How on earth did you estimate this?  Did you examine how many ads run against any search involving &#8220;Forbes?&#8221; Did you  then magically discover which ones actually got clicks? When you did that, how  did you figure out the cost of those ads, when each price is calculated in  real-time based on what the advertiser is willing to spend as well as their  account history. You want to put a figure out there like that, make it more than  a throwaway talking point. Publish it in Forbes itself, as an investigative  piece. Because that&#8217;s what it would take to get a really reliable number, if  such a thing could be arrived it at all from outside Google.</p>
<p>Oh, after you calculate that &#8212; then use the average price paid per click,  multiple that by all the branded terms that send Forbes traffic for free from  Google. Then tell me which is higher. My guess? The free traffic far outweighs  the paid traffic you think Google has sucked from you.</p>
<blockquote><p><strong>Spanfeller: </strong>If this inequity of support continues along these lines,  we will see a continuing destruction of our journalistic enterprises—enterprises  that are one of the core building blocks of our democracy.</p></blockquote>
<p><strong>Counterpoint: </strong>Again, print publications where in trouble before we had  Google, but now you&#8217;ve got scapegoat to blame for more than a  decade of failing to grapple with your fundamental business problems. And let&#8217;s  add a dash of Google undermining democracy to stoke the fires.</p>
<p>By the way, why is it just Google? I mean, Yahoo sends sites a lot of  traffic, too. Yahoo News has a bigger audience than Google News. Yahoo operates  pretty much the same as Google. And Microsoft wants to do the same. Why don&#8217;t  you have an Axis Of Evil Search Engines?</p>
<blockquote><p><strong>Spanfeller: </strong>Last year, while addressing the magazine publishers and  editors of the MPA at the Google Campus, Eric Schmidt suggested that the web was  a “cesspool” and that it was up to the major journalistic brands to clean it up.  Well Eric, in a great many ways, Google has helped to create that cesspool, and  as such I would hope that it can be part of the solution.</p></blockquote>
<p><strong>Counterpoint:</strong> Let&#8217;s all help, starting with Forbes itself. Forbes has long been  known in the SEO world for doing things against <a href="http://www.google.com/support/webmasters/bin/answer.py?hl=en&amp;answer=35769">Google&#8217;s webmaster guidelines</a>, which are  explicitly meant to help clean that cesspool. To put it bluntly, Forbes has been  pissing in the pool. Some light reading for you, documenting how Forbes has sold  links or hosted less than high quality content on its web site:</p>
<ul>
<li><a href="../../googles-pagerank-update-goes-after-paid-links-12523">Google’s  PageRank Update Goes After Paid Links?</a></li>
<li><a href="http://www.seobook.com/seo-dark-arts">Shining a Light on The Dark  Arts of Search Engine Optimization</a></li>
<li><a href="http://askjasonmarketing.blogspot.com/2008/06/update-forbes-takes-down-their-paid.html">Update:  Forbes Takes Down Their Paid Links</a></li>
<li><a rel="bookmark" href="http://www.gregboser.com/forbes-handjob/">Forbes.com  Gets a Handjob? </a></li>
<li><a rel="nofollow" href="http://www.forbes.com/wrongful_death_attorney.html">Forbes former &#8220;wrongful death attorney&#8221; page</a></li>
</ul>
<p>But a picture&#8217;s worth a thousand words they say:</p>
<p>Look here on your wine <a href="http://www.forbes.com/lifestyle/wine/">page</a>:</p>
<p><a title="Paid Links On Forbes.com? by search-engine-land, on Flickr" href="http://www.flickr.com/photos/searchengineland/3505748161/"><img src="http://farm4.static.flickr.com/3392/3505748161_4b8df7c044_o.jpg" border="0" alt="Paid Links On Forbes.com?" width="351" height="137" /></a></p>
<p>Did Forbes have any of its professional journalists review these &#8220;resources&#8221;  before providing <a href="../../google-now-reporting-anchor-text-phrases-10744">anchor  text-rich links</a> to them? I&#8217;m guessing not, and that&#8217;s an issue. But first,  another example:</p>
<p><a title="Paid Links On Forbes.com by search-engine-land, on Flickr" href="http://www.flickr.com/photos/searchengineland/3505748117/"><img src="http://farm4.static.flickr.com/3397/3505748117_e58d1f71a9_o.jpg" border="0" alt="Paid Links On Forbes.com" width="343" height="81" /></a></p>
<p>I assume these links are sold, since they&#8217;re labeled &#8220;Advertisement.&#8221; These  are &#8220;straight links,&#8221; as are the ones in the first example. This means they aren&#8217;t delivered in JavaScript or in some  other way that would prevent them from passing along credit from your quality  site to theirs. Effectively, they are a way someone has bought a vote on your  site that these two links are relevant for the words &#8220;World Currencies&#8221; and  &#8220;Forex.&#8221;</p>
<p>Now in the future, if you search on Google for <a href="http://www.google.com/search?q=world%20currencies">world currencies</a>, and  you get <a href="http://www.business.com/directory/financial_services/investment_banking_and_brokerage/sales_and_trading/forex/major_currencies/">this</a> page full of ads from Business.com ranking tops, please don&#8217;t whine that Google  should trust in the big brands to sort out the cesspool. We in the SEO space  know well that some of those same big brands use the authority of their web  sites to enrich themselves, rather than to save democracy.</p>
<p>For more, see <a href="http://www.techmeme.com/090505/p82#a090505p82">related  discussion</a> on Techmeme.</p>
<p><strong>Postscript: </strong>I&#8217;m aware that in doing the counterpoints, I probably come off as a Google fanboy. I&#8217;m not. I just like balance and accuracy, and I didn&#8217;t feel I got either in Spanfeller&#8217;s piece.</p>
<p>Google is far from perfect and has plenty of problems, which I&#8217;ve also covered (see today&#8217;s <a title="May 4, 2009" rel="bookmark" href="../../why-hasnt-google-cleared-fired-suspended-employee-18560">Why Hasn’t Google Cleared, Fired Or Suspended Accused AdWords Employee?</a> or my <a href="../../google-master-of-closing-the-loop-10921">Google: Master Of Closing The Loop?</a> as just two examples of this.</p>
<p>Unfortunately, watching the newspaper and magazine publishers attack Google for their woes is like watching a Keystones Cops movie. They bumble about, puff themselves up with arrogance that they occupy some lofty position while simultaneously put their hands out for a Google bailout.</p>
<p>If they&#8217;re going to attack Google, then I want an attack that&#8217;s organized, that can&#8217;t be so easily shot full of holes and which warrants serious attention. Or I want them to stop attacking Google so it can be attacked, when it deserves it, on far more serious issues without such distraction.</p>
<p>More important, I love journalism and want to see the publishers waste less time looking for someone to blame for the industry&#8217;s problems and instead finding new solutions. And no, I hardly see Google as the main reason they&#8217;re having issues, though given the rhetoric over the past few weeks, it seems to be the chief boogeyman.</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/forbes-spanfeller-attacks-google-stumbles-into-cesspool-18654/feed</wfw:commentRss>
		<slash:comments>11</slash:comments>
		</item>
		<item>
		<title>Sorry, Tom Curley: Don&#8217;t Expect A Google Ranking Boost For The AP</title>
		<link>http://searchengineland.com/sorry-tom-curley-no-google-ranking-boost-for-ap-18402</link>
		<comments>http://searchengineland.com/sorry-tom-curley-no-google-ranking-boost-for-ap-18402#comments</comments>
		<pubDate>Fri, 01 May 2009 18:14:50 +0000</pubDate>
		<dc:creator>Danny Sullivan</dc:creator>
				<category><![CDATA[Features: Analysis]]></category>
		<category><![CDATA[Google: News]]></category>
		<category><![CDATA[Google: SEO]]></category>
		<category><![CDATA[Legal: Copyright]]></category>
		<category><![CDATA[Legal: Crawling & Indexing]]></category>
		<category><![CDATA[Top News]]></category>

		<guid isPermaLink="false">http://searchengineland.com/?p=18402</guid>
		<description><![CDATA[Apparently talks between Google and the Associated Press aren&#8217;t going well,  or so says  Forbes today, with AP chief executive Tom Curley threatening to take his  content and play elsewhere.
Where that will be is hard to say. Part of the AP&#8217;s original issue with  Google was that AP&#8217;s own member publications [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fsearchengineland.com%2Fsorry-tom-curley-no-google-ranking-boost-for-ap-18402"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fsearchengineland.com%2Fsorry-tom-curley-no-google-ranking-boost-for-ap-18402" height="61" width="51" /></a></div><p>Apparently talks between Google and the Associated Press aren&#8217;t going well,  or so <a href="http://www.forbes.com/2009/04/30/associated-press-google-business-media-apee.html">says  Forbes today</a>, with AP chief executive Tom Curley threatening to take his  content and play elsewhere.</p>
<p>Where that will be is hard to say. Part of the AP&#8217;s original issue with  Google was that AP&#8217;s own member publications would reprint AP material (which  they&#8217;re allowed to do as members), causing there to be no single source that  could benefit from AP traffic.</p>
<p><a href="../../google-news-now-hosting-wire-stories-promises-better-variety-in-results-12064">Google  News Now Hosting Wire Stories &amp; Promises Better Variety In Results</a> covers how after a 2006 deal was struck, AP stories started being hosted on  Google itself, as AP wanted (and has long hosted similarly at Yahoo). That would  have seen to have solved the AP&#8217;s concerns, but apparently not.</p>
<p>The AP wants more money, it seems. How much more is uncertain. Terms of the original deal  were never disclosed, and the best I&#8217;ve ever seen was Google CEO Eric Schmidt <a href="../../amid-tensions-googles-eric-schmidt-addresses-newspaper-conference-17237">saying</a> for the first time in April that it was a &#8220;multi-million deal.&#8221;</p>
<p>Part of AP&#8217;s current plans are to create its own news portal, as both <a href="http://www.paidcontent.org/entry/419-ap-launching-newspaper-industry-campaign-to-protect-news-content/">PaidContent.org</a> and <a href="http://mediamemo.allthingsd.com/20090410/ap-exec-to-the-untrained-eye-it-looks-like-were-stupid/">AllThingsD</a> have covered. From AllThingsD:</p>
<blockquote><p>This has been construed in some quarters as a plan to create a search engine  or news portal. But it’s really just an attempt to upgrade the AP’s search  engine optimization strategy–that is, trying to get its stuff to show up higher  on Google’s (GOOG) search results. It will do that via “search pages,” or “topic  pages,” which are par for the course in the Web world. Check out this New York  Times (NYT) page  on <a href="http://topics.nytimes.com/top/reference/timestopics/subjects/p/piracy_at_sea/index.html">Somali pirates</a>, or this Huffington Post page on <a href="http://www.huffingtonpost.com/news/newspapers">newspapers</a>, and you’ll get an idea of where the AP is going.</p>
<p>If the search page plan  works, the pages will be generating plenty of page views when people land on  them, and it’s possible that the AP will sell ads on that inventory, Kennedy  says. But their real function is to shuttle searchers to the original source  material from the AP’s members.</p></blockquote>
<p>I <a href="http://twitter.com/dannysullivan/statuses/1493351324">joked</a> on  Twitter that if you&#8217;re hoping to suck in Google traffic, it&#8217;s generally not a  good idea to advertise that fact. Reason? As some point, Google will review its  listings, recognized that listing 10 &#8220;topic&#8221; pages from all the publishers out  there probably isn&#8217;t providing a diverse results set and change up the  algorithm.</p>
<p>More important, Google&#8217;s web search quality team &#8212; which has nothing to do  with Google&#8217;s business folks &#8212; generally does not take well to people  suggesting they&#8217;re somehow going to own the search results. AP content probably  will start ranking well for some things, but if it started showing up  Wikipedia-style for everything, people outside the AP would start complaining  about favoritism.</p>
<p>That&#8217;s what makes the Forbes piece so puzzling. AP chief executive Tom Curley  (who the AP told me was &#8220;unavailable&#8221; to talk; nor after nearly two hours, does anyone else seem available) sounds naive enough to believe he  can force Google into a deal that would give AP preferential treatment in  regular search results:</p>
<blockquote><p><strong>Search rankings on Google News give priority to recognizable news brands  like the AP. But Google applies no such algorithmic discretion to general  searches. </strong>The broader search rankings spread AP content out across the Web,  says Curley, encouraging misappropriation by other sites. Curley wants Google to  &#8220;protect content from unauthorized use and pay us for the longtail.&#8221; By  &#8220;longtail,&#8221; Curley refers to the thousands of small sites that collectively  drive vast herds of traffic using AP content.</p></blockquote>
<p>I&#8217;ve bolded the key part. Look, I don&#8217;t know who &#8212; if anyone &#8212; is doing SEO  for the AP or providing them with advice. Judging from <a href="http://www.google.com/search?hl=en&amp;as_q=&amp;as_epq=&amp;as_oq=&amp;as_eq=&amp;num=100&amp;lr=&amp;as_filetype=&amp;ft=i&amp;as_sitesearch=ap.org&amp;as_qdr=all&amp;as_rights=&amp;as_occt=any&amp;cr=&amp;as_nlo=&amp;as_nhi=&amp;safe=images">page  titles alone</a>, like <a href="http://www.ap.org/pages/about/pressreleases/pr_040609a.html">this</a> key  press release about its content enforcement initiative that has no title (a key search ranking factor), they could use some  help. And from the statement above, Curley either doesn&#8217;t understand how Google  works or gets it and is purposely misspeaking. Or perhaps he was misinterpreted.  If he&#8217;s ever available to talk, I&#8217;ll ask him myself.</p>
<p>Google News doesn&#8217;t give &#8220;recognizable news brands&#8221; a boost. I&#8217;ve never seen  them say this, nor have I seen it actually happen in real life. Google News  includes large and small news sites and <a href="http://www.nytimes.com/2008/06/24/technology/24google.html?_r=1">lists</a> a diverse collection of stories. I know lesser-known news sites do well because  I run one of those. At times, I can have a headline story that beats the AP or  other mainstream outlets in Google News.</p>
<p>As for the web, actually there&#8217;s a considerable debate on whether Google HAS  given a brand boost recently (see <a href="../../google-searchs-vince-change-google-says-not-brand-push-16803">Google’s  Vince Update Produces Big Brand Rankings; Google Calls It A Trust “Change”</a>).  Certainly Google CEO Eric Schmidt has been stirring up the pot by repeatedly <a href="../../amid-tensions-googles-eric-schmidt-addresses-newspaper-conference-17237">remarking</a> what a &#8220;sewer&#8221; the internet is and how &#8220;brands&#8221; will sort it out.</p>
<p>Certainly if Google starts ranking brands better than other content, they&#8217;ll  have issues. Brands do not equal trust. Enron had a brand; AIG has a brand &#8212;  being a brand doesn&#8217;t mean that you are more trustworthy or deserve an automatic  ranking boost. From my perspective, Google&#8217;s algorithm has continued to change  over the past few years to reward trusted sites. Many brands have sites that  Google has decided are trustworthy, but some don&#8217;t.</p>
<p>Curley is foolish if he thinks he&#8217;ll browbeat Google into somehow changing  its algorithm in web search to reward AP as part of this deal. Google&#8217;s search  quality engineers wouldn&#8217;t stand for that, any more than a journalist would  stand for a newspaper CEO marching into a newsroom and demanding that certain  advertisers get favorable stories written about them.</p>
<p>If the AP wants traffic to its own site, actually hosting content there is a  good first step (and something I wrote about back last year, in <a href="http://daggle.com/hey-ap-how-about-running-a-real-news-web-site-377">Hey  AP! How About Running A Real News Web Site?</a>). If they&#8217;d done that years ago,  they&#8217;ve have already earned trust in Google&#8217;s algorithms. Instead, they&#8217;re going  to have to earn that from scratch just like everyone &#8212; by having content that lots of people  point at with links.</p>
<p>Therein lies another challenge they face. People aren&#8217;t going to link to your  content when you&#8217;re threatening them with your own made-up rules of what you  consider fair use, as the AP has been doing and recently became more <a href="../../ap-becomes-bad-cop-to-protect-news-from-misappropriation-17227">aggressive  about</a>.</p>
<p>In the end, perhaps the AP will walk away from Google. It&#8217;s always had the  option to block off of its content from being listed. The issue remains that its  member publications might continue to take AP stories and post them to the web.  AP might try to sue Google for listing this content, but I&#8217;d expect legally, it  would be the member publications that would be held accountable.</p>
<p>For more, see <a href="http://www.techmeme.com/090501/p45#a090501p45">related  discussion on Techmeme</a>.</p>
<p>For a longer look from me at recent  newspaper-Google issues, see my <a href="http://daggle.com/googles-love-for-newspapers-how-little-they-appreciate-it-443">Google’s  Love For Newspapers &amp; How Little They Appreciate It</a> post.</p>
<p><strong>Postscript:</strong> Be sure to watch Tom Curley <a href="http://www.charlierose.com/view/interview/10208">in this interview</a> with Charlie Rose, where both he and Arianna Huffington talk about the future of news:</p>
<p><object width="400" height="326" data="http://video.google.com/googleplayer.swf?showShareButtons=true&amp;docId=5441302334843085385%3A904000%3A1066000&amp;hl=en" type="application/x-shockwave-flash"><param name="src" value="http://video.google.com/googleplayer.swf?showShareButtons=true&amp;docId=5441302334843085385%3A904000%3A1066000&amp;hl=en" /><param name="allowfullscreen" value="true" /></object></p>
<p>It&#8217;s remarkable on several fronts:</p>
<p>1) The AP believes that it can divide news content into headlines, lead paragraphs and full stories, each of which can be licensed for use. The essence of being listed in any search engine &#8212; not just Google and not just Google News &#8212; has been to use the title tag (usually also the headline) of a web page.</p>
<p>Does AP really believe it might strike deals to explicitly give search engines permission list their pages using title tags, when this has been commonly done for over a decade and without any serious legal challenge? And will it put the RSS genie back in the bottle, when RSS feeds typically explicitly offer titles and summaries to the public?</p>
<p>2) The AP wants to create a &#8220;newsmap&#8221; that takes people to the &#8220;authorative source&#8221; of a story &#8212; to show &#8220;who broke the news.&#8221; The implication here is that only the AP or its members, or other mainstream publications that AP wants to enlist, originate news.</p>
<p>Much news is originated from non-mainstream publications (yes, such as blogs), that the mainstream press works off of. There are plenty of examples where these non-mainstream publications don&#8217;t get credit as originating sources. Do they get included in the newsmap? If not, is the AP effectively doing the same type of robbing that it complains happens to it?</p>
<p>3) Curley asks who is going to pay for the &#8220;hard work&#8221; and Freedom Of Information Act requests and other types of in-depth journalism. That&#8217;s a good question, and I have my own concerns about that (as my <a href="http://daggle.com/blogs-mainstream-media-we-can-do-get-along-344">Blogs &amp; Mainstream Media: We Can &amp; Do Get Along</a> post covers). But again, it also suggests that only the mainstream media is equipped to do this.</p>
<p>They aren&#8217;t, nor are they the only source of hard work journalism. They do, however, often get better access than non-mainstream publications based on what they built over the years.</p>
<p>Perhaps the mainstream publications should give up some of their reserved seats in the White House press room to bloggers who cover politics to wider audiences than some newspapers currently have?</p>
<p>Perhaps with that better access, those bloggers might earn even more traffic, more revenue and be able to spend more time on further hard work journalism, not hindered by the fact that their old business models don&#8217;t fit the new publishing world.</p>
<p>Perhaps we might even see some online publications win Pulitzers? See my <a href="http://daggle.com/time-google-fund-online-pulitzers-558">Time For Google To Fund An Online-Only Version Of The Pulitzers?</a> post for further thoughts on this, as well as how when it comes to online news breaking, often various publications work together to undercover a story, rather than it having to be a huge solo effort.</p>
<p>4) I came away left with the feeling that Curley has no real idea what bloggers and online journalists do. There&#8217;s a stereotype the AP seems to have formed, and it&#8217;s not an attractive one. I imagine the AP view is that we&#8217;re all sitting in basements somewhere in our pajamas scanning Google News for stories from the AP that we merrily reprint without adding value or original content to (I&#8217;m actually in sweats today; normally if it&#8217;s warmer, I&#8217;m in shorts. I rarely wear pajamas to my home office, which isn&#8217;t in a basement).</p>
<p>I understand the serious business issues they face and the problems with content theft (which has happened to online publications for over a decade, by the way). But I desperately feel like various AP executives need to get out of their offices, get out of the executive meeting rooms at their member publications and instead roll up their sleeves and visit some actual bloggers at online publications in actions. I&#8217;ve been in newsrooms. I came from a newsroom. I know the work that goes on there. I don&#8217;t think the AP folks have any clue of the real honest hard work that goes out outside the newsroom. They should get that education, if they really want to form an educated business model for the future.</p>
<p>5) Finally, what exactly is the AP getting from its current deal with Google? We&#8217;re hearing it wasn&#8217;t a fair shake. OK, cough up, AP. Publish the terms. Tell us exactly what you agreed to in the first place. You&#8217;re a news organization; you&#8217;re making this an issue, so it&#8217;s odd we don&#8217;t know your current terms, in order to judge them.</p>
<p>By the way, if you are from a newspaper, you might also want to see my <a href="http://daggle.com/quick-tips-for-newspapers-seo-409">Quick Tips For Newspapers &amp; SEO</a> post.</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/sorry-tom-curley-no-google-ranking-boost-for-ap-18402/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>AP Becomes &#8220;Bad Cop&#8221; To &#8220;Protect&#8221; Newspaper Content Against &#8220;Misappropriation&#8221;</title>
		<link>http://searchengineland.com/ap-becomes-bad-cop-to-protect-news-from-misappropriation-17227</link>
		<comments>http://searchengineland.com/ap-becomes-bad-cop-to-protect-news-from-misappropriation-17227#comments</comments>
		<pubDate>Tue, 07 Apr 2009 14:09:42 +0000</pubDate>
		<dc:creator>Greg Sterling</dc:creator>
				<category><![CDATA[Google: SEO]]></category>
		<category><![CDATA[Legal: Crawling & Indexing]]></category>
		<category><![CDATA[Legal: General]]></category>
		<category><![CDATA[Search Engines: News Search Engines]]></category>
		<category><![CDATA[Top News]]></category>

		<guid isPermaLink="false">http://searchengineland.com/?p=17227</guid>
		<description><![CDATA[The Associated Press announced an aggressive new policy whereby it is going to try and get more money for newspaper content online or take legal action against &#8220;copyright violators&#8221; and those that won&#8217;t pay. From the AP&#8217;s own press release:
The Associated Press Board of Directors today announced it would launch an industry initiative to protect [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fsearchengineland.com%2Fap-becomes-bad-cop-to-protect-news-from-misappropriation-17227"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fsearchengineland.com%2Fap-becomes-bad-cop-to-protect-news-from-misappropriation-17227" height="61" width="51" /></a></div><p>The Associated Press <a href="http://www.ap.org/pages/about/pressreleases/pr_040609a.html">announced</a> an aggressive new policy whereby it is going to try and get more money for newspaper content online or take legal action against &#8220;copyright violators&#8221; and those that won&#8217;t pay. From the AP&#8217;s own <a href="http://www.ap.org/pages/about/pressreleases/pr_040609a.html">press release</a>:</p>
<blockquote><p><em>The Associated Press Board of Directors today announced it would launch an industry initiative to protect news content from misappropriation online.</em></p>
<p><em>AP Chairman Dean Singleton said the news cooperative would                   work with portals and other partners who properly license content – and                   would pursue legal and legislative actions against those who                   don‘t.</em></p>
<p><em>“We can no longer stand by and watch others walk off with our work under misguided legal theories,“ Singleton said at the AP annual meeting, in San Diego. </em></p></blockquote>
<p>The quote at the end is vaguely reminiscent of the rogue US general in the 1964 Cold War satire <a href="http://en.wikipedia.org/wiki/Dr._Strangelove">Dr. Strangelove</a> who will no longer allow the &#8220;commies&#8221; to &#8220;pollute our precious bolidly fluids.&#8221;</p>
<p>What&#8217;s unclear in the new AP policy is whether the organization would now want money for what I&#8217;ve just done: linking to a piece of AP content and quoting three sentences. It might.</p>
<p>While some of the interviews and discussions surrounding AP&#8217;s announcement suggest that the news association is trying to shut down wholesale or substantial copying of its articles without permission or attribution, there&#8217;s reason to believe that they will be going after the kind of use in this article as well. If they don&#8217;t want money, perhaps they&#8217;ll want advance permission to use the content.</p>
<p>Danny has written a very <a href="http://daggle.com/090406-225638.html">long and detailed discussion on his personal blog</a> of why the AP policy is misguided, especially with regard to search engines (read: Google). Google has also responded on its <a href="http://googlepublicpolicy.blogspot.com/2009/04/some-questions-related-to-google-news.html">Public Policy Blog</a>:</p>
<blockquote><p><em>We show snippets and links under the doctrine of <a id="gxx0" title="fair use" href="http://www.law.cornell.edu/uscode/17/107.shtml">fair use</a> enshrined in the United States Copyright Act.  The fair use doctrine protects transformative uses of content, such as <a id="ttsj" title="indexing to make it easier to find" href="http://www.eff.org/files/filenode/Kelly_v_Arriba_Soft/20030707_9th_revised_ruling.pdf">indexing to make it easier to find</a> [pdf]. Even though the Copyright Act does not grant a copyright owner a veto over such uses, it is our policy to allow any rightsholder, in this case newspaper or wire service, to remove their content from our index &#8212; all they have to do is ask us or implement simple technical standards such as <a id="iokd" style="color: #551a8b;" title="robots.txt and metatags" href="http://googleblog.blogspot.com/2007/01/controlling-how-search-engines-access.html">robots.txt or metatags</a>. </em></p></blockquote>
<p>The <a href="http://www.copyright.gov/title17/92chap1.html#107">Fair Use Doctrine</a> says the following:</p>
<blockquote><p><em>[F]air use of a copyrighted      work, including such use by reproduction in copies or phonorecords or by any      other means specified by that section, for purposes such as criticism, comment,      news reporting, teaching (including multiple copies for classroom use), scholarship,      or research, is not an infringement of copyright. In determining whether the      use made of a work in any particular case is a fair use the factors to be      considered shall include —</em></p>
<p class="secondlevel"><em>(1) the purpose and character of the use, including whether      such use is of a commercial nature or is for nonprofit educational purposes;</em></p>
<p class="secondlevel"><em>(2) the nature of the copyrighted work;</em></p>
<p class="secondlevel"><em>(3) the amount and substantiality of the portion used      in relation to the copyrighted work as a whole; and</em></p>
<p class="secondlevel"><em>(4) the effect of the use upon the potential market for      or value of the copyrighted work.</em></p>
</blockquote>
<p class="secondlevel">The notion of what is &#8220;fair use&#8221; is somewhat ambiguous when applied to the Internet. It&#8217;s also at the center of this brewing controversy and may eventually get litigated by the AP. This would be a big risk for AP and newspapers more generally because a decision could go against them. To my incomplete knowledge the doctrine has not been tested in a web-related context around linking and excerpting content online. The AP <a href="http://www.nytimes.com/2009/04/07/business/media/07paper.html?scp=1&amp;sq=ap%20google&amp;st=cse">maintains</a>, however, this isn&#8217;t a &#8220;fair use&#8221; issue:</p>
<blockquote><p><em>News aggregators and search companies have long asserted that collecting snippets of articles — usually headlines and a sentence or two — is allowed under the legal doctrine of “fair use.” News organizations have been reluctant to test that idea in court, and it is still not clear whether The A.P. is willing to test the fair use doctrine. </em></p>
<p><em>“This is not about defining fair use,” said Sue A. Cross, a senior vice president of the group, who added several times during an interview that news organizations want to work with the aggregators, not against them. “There’s a bigger economic issue at stake here that we’re trying to tackle.”</em></p></blockquote>
<p>PaidContent&#8217;s <a href="http://www.paidcontent.org/entry/419-interview-dean-singleton-chairman-ap-ceo-medianews-setting-the-rules-of/">interview</a> with Dean Singleton sheds some light on the thinking behind the decision and announcement:</p>
<blockquote><p><em>“I think our industry has been very timid about protecting our content, probably because we’ve done so well in the past few years that we didn’t recognize that misappropriation is as serious an issue as it is. As we’re now relooking at business models, it’s become clear that we must protect the rights of our content. &#8230; We perhaps have been timid about enforcing [those rights]. No more. We own the content but we’ve let those who spend very little, if any, get the most advantage from it.” What can they get from it at this point? “I am very confident that we will develop new models that help us get more.”</em></p></blockquote>
<p>Newspapers want to exert more exclusive control over their content because they have lost faith in their ability to make money from advertising (read: search engine traffic) and will soon be trying to charge for access to it. Subscription and micro-payments models are being evaluated across the industry. Added Singleton:</p>
<blockquote><p><em>“What’s becoming clear is that for many, many decades, advertising has supported our news mission and it’s not supporting it today. &#8230; If we don’t solve these issues, then survival is not assured. And we expect to solve these issues.”</em></p></blockquote>
<p>So where do I start?</p>
<p>In fairness to the newspapers and AP there has been a lot of lifting of their content without permission, attribution or payment. Copyright holders do need to vigorously protect their interests or risk losing them. Having said that, the newspapers are partly hiding behind the copyright issue when, in reality, they have failed to compete successfully online. While they do create great content in many instances, they have lagged in innovating around business models and user experiences.</p>
<p>Danny suggests that the newspapers create &#8220;their own Hulu.&#8221; It&#8217;s a good suggestion. Long ago I also <a href="http://gesterling.wordpress.com/2006/03/01/newspapers-should-move-faster-part-ii/">suggested</a> that newspapers become news aggregators themselves. Why didn&#8217;t the newspapers invent Digg or Yelp or Trulia? Though there were and are <a href="http://gesterling.wordpress.com/2008/07/02/newspapers-and-social-nets/">some interesting experiments</a> going on across the industry, there&#8217;s something in the culture of newspapers, it would appear, that has effectively inhibited them from fully embracing the Internet and new thinking about their content &#8212; beyond putting it up online. In fairness, there&#8217;s also a resource issue too.</p>
<p>While I acknowledge the right of newspapers to control their content, there&#8217;s a way in which this policy flies in the face of the culture of the Internet. It&#8217;s consistent with the worldview and approach that has landed newspapers in the predicament they&#8217;re in today. On some level it seeks to turn back the clock.</p>
<p><span>And while it&#8217;s true hypothetically that if there never was a Google News or Yahoo News or <span>Topix</span> (itself owned by newspapers) perhaps people would be </span>going directly to newspapers sites to get their news and other local content. Yet the nature of the Internet made these aggregator sites all but inevitable. Why didn&#8217;t the newspapers develop them first?</p>
<p>I acknowledge the dire predicament newspapers are in and that they need to &#8220;do something.&#8221; (I&#8217;ve devoted <a href="http://gesterling.wordpress.com/?s=newspapers">lots of writing on my blog</a> to newspapers.) And honestly I wouldn&#8217;t have any magic solution if I were being asked to devise a strategy to &#8220;save the industry&#8221; at this point. However they should seek to exploit distribution &#8212; including mobile and its new opportunity to charge for content &#8212; rather than limit it.</p>
<p>There&#8217;s also a way in which this newly announced policy is a giant PR blunder for AP and by extension newspapers. They might have consulted with Google and other &#8220;aggregators&#8221; and negotiated a more online-friendly policy before going public with an approach that feels both hostile and desperate.</p>
<p>___</p>
<p><strong>Postscript</strong>: Google&#8217;s Eric Schmidt addresses the NAA (newspaper) conference (<a href="http://newshare.typepad.com/newshare/2009/04/audio-google-ceo-eric-schmidt-addresses-the-naa.html">audio</a>).</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/ap-becomes-bad-cop-to-protect-news-from-misappropriation-17227/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
