<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Search Engine Land &#187; Microsoft: Live Search Academic</title>
	<atom:link href="http://searchengineland.com/library/microsoft/microsoft-live-search-academic/feed" rel="self" type="application/rss+xml" />
	<link>http://searchengineland.com</link>
	<description>Search Engine Land: News On Search Engines, Search Engine Optimization (SEO) &#38; Search Engine Marketing (SEM)</description>
	<lastBuildDate>Fri, 25 May 2012 23:34:25 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
<xhtml:meta xmlns:xhtml="http://www.w3.org/1999/xhtml" name="robots" content="noindex" />
		<item>
		<title>Microsoft Burns Book Search &#8211; Lacks &#8220;High Commercial Intent&#8221;</title>
		<link>http://searchengineland.com/microsoft-burns-book-search-lacks-high-consumer-intent-14066</link>
		<comments>http://searchengineland.com/microsoft-burns-book-search-lacks-high-consumer-intent-14066#comments</comments>
		<pubDate>Fri, 23 May 2008 15:21:01 +0000</pubDate>
		<dc:creator>Danny Sullivan</dc:creator>
				<category><![CDATA[Microsoft: Live Search Academic]]></category>
		<category><![CDATA[Microsoft: Live Search Books]]></category>

		<guid isPermaLink="false">http://searchengineland.com/beta/microsoft-burns-book-search-lacks-high-consumer-intent-14066.php</guid>
		<description><![CDATA[Wow. Earlier this month, I heard Microsoft execs highlight Live Search Books as a key advance they were doing in search. Today, news that the site along with Live Search Academic are being closed next week, since they lack the &#8220;high consumer intent&#8221; that Microsoft is now focusing on. So much for trying to seem [...]]]></description>
			<content:encoded><![CDATA[<p>Wow. Earlier this month, I
<a href="http://searchengineland.com/080501-063136.php">heard</a> Microsoft
execs highlight <a href="http://search.live.com/results.aspx?q=&amp;scope=books">
Live Search Books</a> as a key advance they were doing in search. Today,
<a href="http://blogs.msdn.com/livesearch/archive/2008/05/23/book-search-winding-down.aspx">
news</a> that the site along with
<a href="http://search.live.com/results.aspx?scope=academic&amp;q=">Live Search
Academic</a> are being closed next week, since they lack the &#8220;high consumer
intent&#8221; that Microsoft is now focusing on. So much for trying to seem like the
much-needed &#8220;good&#8221; alternative to Google. From Microsoft&#8217;s blog post:</p>
<p><span id="more-14066"></span></p>
<blockquote>Today we informed our partners that we are ending the Live Search Books and
Live Search Academic projects and that both sites will be taken down next
week. Books and scholarly publications will continue to be integrated into our
Search results, but not through separate indexes.</p>
<p>This also means that we are winding down our digitization initiatives,
including our library scanning and our in-copyright book programs. We
recognize that this decision comes as disappointing news to our partners, the
publishing and academic communities, and Live Search users.</blockquote>
<p>Amazing. And disappointing. Microsoft got mileage out of the idea it was
working with the <a href="http://www.opencontentalliance.org/">Open Content
Alliance</a> as the &#8220;good&#8221; book search partner not encumbered by controversy
that the <a href="http://books.google.com/">Google Book Search</a> service has
encountered. For background on that, see:</p>
<ul>
<li><a href="http://searchengineland.com/061220-110934.php">Battle For Books:
Evil Google Versus The Altruistic Open Content Alliance</a>, Dec. 2006</li>
<li><a href="http://searchengineland.com/070306-101103.php">Dissecting
Microsoft Slams At Google As Copyright Infringer</a>, March 2007</li>
<li><a href="http://searchengineland.com/071106-102435.php">Google: As Open As
It Wants To Be (i.e., When It&#8217;s Convenient)</a>, Nov. 2007</li>
</ul>
<p>While I had wanted to see the two rival book scanning projects cooperate more
with each other, it was still nice to have both major companies involved. But
apparently Microsoft figures going after the consumer searchers (as with the
<a href="http://searchengineland.com/080521-133000.php">Live Search Cashback
program</a>) means there&#8217;s no time for important but less-revenue generating
projects like these. Writes Microsoft:</p>
<blockquote>Given the evolution of the Web and our strategy, we believe the next
generation of search is about the development of an underlying, sustainable
business model for the search engine, consumer, and content partner. For
example, this past Wednesday we announced our strategy to focus on verticals
with high commercial intent, such as travel, and offer users cash back on
their purchases from our advertisers.</blockquote>
<p>Gosh, Google somehow seems to be able to run a sustainable business model and
devote some energy and resources into indexing books and scholarly information,
even if those generate little to no revenue. They do it in part because they
think it&#8217;s good business to provide all types of searches, not just those that
will earn them money.</p>
<p>In the middle of a search war, I can understand that a &#8220;distraction&#8221; like
book and academic search might seem like something to Microsoft that has to go.
However, Microsoft&#8217;s not hurting for cash to keep it up, if it wanted. Dropping
it makes Google seem less like the evil giant working for its own benefit that
Microsoft would hope people view it as.</p>
<p>Oh well. For the record,
<a href="http://search.live.com/results.aspx?q=&amp;scope=books">Live Search Books</a>
launched in December 2006 (versus <a href="http://books.google.com/">Google Book
Search</a>, launched in December 2003).
<a href="http://search.live.com/results.aspx?scope=academic&amp;q=">Live Search
Academic</a> launched in April 2006 (versus
<a href="http://scholar.google.com">
Google Scholar</a>, launched back in November 2004). Both Google programs
continue.</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/microsoft-burns-book-search-lacks-high-consumer-intent-14066/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Microsoft On Poprank And Indexing Objects For Vertical &amp; Web Search</title>
		<link>http://searchengineland.com/microsoft-on-poprank-and-indexing-objects-for-vertical-web-search-10847</link>
		<comments>http://searchengineland.com/microsoft-on-poprank-and-indexing-objects-for-vertical-web-search-10847#comments</comments>
		<pubDate>Thu, 29 Mar 2007 06:06:47 +0000</pubDate>
		<dc:creator>Bill Slawski</dc:creator>
				<category><![CDATA[Microsoft: Bing]]></category>
		<category><![CDATA[Microsoft: Bing SEO]]></category>
		<category><![CDATA[Microsoft: Live Search Academic]]></category>

		<guid isPermaLink="false">http://searchengineland.com/beta/microsoft-on-poprank-and-indexing-objects-for-vertical-web-search-10847.php</guid>
		<description><![CDATA[
]]></description>
			<content:encoded><![CDATA[<p>A new paper from Microsoft Research, <a href="http://research.microsoft.com/users/jrwen/jrwen_files/publications/Web%20Object%20Retrieval.pdf">Web Object Retrieval</a> (pdf), discusses an approach towards Web indexing that changes focus from page level, to that of indexing objects found upon pages.</p>
<p>OK, so that does that mean? It&#8217;s easiest to show you first, rather than tell you&#8230;</p>
<p><span id="more-10847"></span>
<strong>Microsoft Product Search</strong></p>
<p>Take a look at Microsoft&#8217;s <a href="http://products.live.com/">Products Search</a> (http://products.live.com/).  Brian Smith went into a lot of detail on the Microsoft&#8217;s product search last May in <a href="http://www.comparisonengines.com/2006/05/19/ecommerce-msn-style/">eCommerce, Microsoft Style</a>.  Microsoft&#8217;s Live Product Search allows people to <a href="http://productupload.live.com/">upload product information</a> into their database, but it also crawls the Web, and extracts information about products.</p>
<p><strong>Libra Academic Search</strong></p>
<p>Another example of indexing on the object level from Microsoft Research Asia, <a href="http://libra.msra.cn/">Libra Academic Search</a>, is a computer science bibliography search engine.   The page &#8220;<a href="http://libra.msra.cn/about.htm">About the academic search</a>&#8221; includes links to a number of papers upon object level retrieval, including an earlier technical report version of the Web Object Retrieval paper.</p>
<p><strong>More than Products and Papers</strong></p>
<p>The product search and the paper search are narrow vertical searches that focus upon crawling Web pages, and finding information that fits within those areas.  The academic paper search not only tries to find the names of papers, but also authors, conferences, journals, and research communities.  The <em>Web Object Retrieval</em> paper focuses upon extracting that information from pages.  The goal of the research extends beyond products and papers.  As the authors tell us:</p>
<blockquote>We believe object-level Web search is particularly necessary in building vertical Web search engines such as product search, people search, scientific Web search, job search, community search, and so on.</blockquote>
<p><strong>Incorporation of Object Indexing into Live Search</strong></p>
<p>The product search and the academic paper search are useful, but how well would they do as part of the Web search that Microsoft offers?  According to a news article from Microsoft, <a href="http://research.microsoft.com/displayarticle.aspx?id=1492">Search Objective Gets a Refined Approach</a>, those searches have already been integrated into Windows Live:</p>
<blockquote>The “vertical” in Object-Level Vertical Search refers to a specific domain, such as academic search or product search, both of which have been incorporated into Windows Live™. The “object” is an item embedded in Web pages or Web databases, such as a product, a person, a paper, or an organization.</blockquote>
<p><strong>The Object-Level Vertical Search Process</strong></p>
<p>The news article also describes the process of extracting and indexing objects in a nice summary:</p>
<p>The first three steps are:</p>
<ul>
<li>Web Crawling: to collect relevant information on the Web efficiently </li>
<li>Classification: Does a page contain information on products, papers, people, or some other desired category? </li>
<li>Extraction: pulling specific information about the search query from the relevant Web pages. For a product, for instance, that could mean product name, brand, image, description, and price. </li>
</ul>
<p>In other words, after finding the information, and understanding that it relates to a specific category, they are putting it into a structured format so that, for instance, products can be compared to one another.  There&#8217;s more to the process, though:</p>
<ul>
<li>Integration: Combining the gathered object information into a concise whole. This includes resolving Web-page idiosyncrasies and naming conventions and making sure that similarly named objects are integrated only if they relate to the actual object being sought. </li>
<li>Ranking: There are two types of ranking. One, static rank, is handled well by the PopRank algorithm. The second, relevance, is trickier, because an object might be popular, but irrelevant to the query at hand. Because the object description is integrated from multiple Web pages, developing a ranking mechanism is a challenge.</li>
</ul>
<p>As they note in the article, this method could be used for job searches, for restaurant searches, and even for blog searches.</p>
<p><strong>Ranking Objects by Link Analysis, or PopRank</strong></p>
<p>The last item in the list above talks about ranking objects, and discusses two different parts to that ranking.  One is a matter of relevance.  The other is a query independent ranking, which they refer to as Poprank.  They state that ranking objects may be especially difficult because the object descriptions may come from more than one Web page.  So, what is this Poprank?</p>
<p>The answer to that question is likely in another Microsoft paper, <a href="http://www2005.org/cdrom/docs/p567.pdf">Object-Level Ranking: Bringing Order to Web Objects</a> (pdf):</p>
<blockquote>Because it is clear that the more popular the objects are, the more likely the user will be interested in them. So a natural question is: could the popularity of Web objects be effectively computed by also applying link analysis techniques? This paper targets to answer this question. Our answer to the question is yes, but quite different technologies are required because of the unique characteristics of object graph.</blockquote>
<p>To see Poprank in action, try out the Libra Academic Search linked to above.</p>
<p><strong>Ranking for Relevance</strong></p>
<p>Another Microsoft paper that provides an overview of this object extraction and indexing process, <a href="http://www-db.cs.wisc.edu/cidr/cidr2007/papers/cidr07p26.pdf">Object-level Vertical Search</a> (pdf), introduces the concept of relevancy ranking in its last section, but doesn&#8217;t go into much detail on the topic.</p>
<p>Our <a href="http://research.microsoft.com/users/jrwen/jrwen_files/publications/Web%20Object%20Retrieval.pdf">newest paper</a> (pdf), referred to at the top of this post, does explain how Microsoft might use different language models to estimate the relevance between an object and a query.</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/microsoft-on-poprank-and-indexing-objects-for-vertical-web-search-10847/feed</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Dissecting Microsoft Slams At Google As Copyright Infringer</title>
		<link>http://searchengineland.com/dissecting-microsoft-slams-at-google-as-copyright-infringer-10670</link>
		<comments>http://searchengineland.com/dissecting-microsoft-slams-at-google-as-copyright-infringer-10670#comments</comments>
		<pubDate>Tue, 06 Mar 2007 14:11:03 +0000</pubDate>
		<dc:creator>Danny Sullivan</dc:creator>
				<category><![CDATA[Google: Book Search]]></category>
		<category><![CDATA[Google: Critics]]></category>
		<category><![CDATA[Google: General]]></category>
		<category><![CDATA[Legal: Copyright]]></category>
		<category><![CDATA[Legal: Crawling & Indexing]]></category>
		<category><![CDATA[Microsoft: General]]></category>
		<category><![CDATA[Microsoft: Live Search Academic]]></category>
		<category><![CDATA[Microsoft: Live Search Books]]></category>

		<guid isPermaLink="false">http://searchengineland.com/beta/dissecting-microsoft-slams-at-google-as-copyright-infringer-10670.php</guid>
		<description><![CDATA[
]]></description>
			<content:encoded><![CDATA[<p>Last October, Microsoft&#8217;s Steve Ballmer
gave us a taste of how Microsoft was going to position Google as a copyright
leech. Today, Microsoft launched a full-out assault on the company. Google
deserves some of this, no doubt. But the idea as Microsoft as some altruistic
copyright savior deserves some critical analysis, as well. Below, I&#8217;ll dissect
Microsoft&#8217;s slams against Google, pointing out where they can be redirected back
at Microsoft itself. But overall, I remain in agreement that Google should shift
book search to an opt-in basis when dealing with copyrighted works.</p>
<p>As noted, last October
<a href="http://www.businessweek.com/technology/content/oct2006/tc20061011_940241.htm">
in BusinessWeek</a>, Microsoft&#8217;s Steve Ballmer positioned Google as
&quot;transferring the wealth out of the hands of rights holders.&quot; In December,
Google came <a href="http://searchengineland.com/061220-110934.php">under more
fire</a> from the Open Content Alliance, of which Microsoft is part of, as
trying to create a closed book search system for only its benefit.</p>
<p>Those were only warm-ups to today&#8217;s broadside. Tom Rubin, associate general counsel for Microsoft, delivered a speech  to the Association Of American Publishers painting Google as a copyright
infringer not to mention dastardly evil ad seller. You&#8217;ll find the entire speech
<a href="http://www.microsoft.com/presspass/exec/trubin/03-05-07AmericanPublishers.mspx">
here</a>. Coverage also comes from the
<a href="http://www.ft.com/cms/s/3109938c-cb61-11db-b436-000b5df10621.html">
Financial Times</a>, the
<a href="http://hosted.ap.org/dynamic/stories/W/WA_MICROSOFT_GOOGLE_WAOL-?SITE=AP&#038;SECTION=HOME&#038;TEMPLATE=DEFAULT">
Associated Press</a> plus also see roundup coverage from Techmeme
<a href="http://www.techmeme.com/070306/p2#a070306p2">here</a> and
<a href="http://www.techmeme.com/070306/p23#a070306p23">here</a>. Below, I&#8217;ll go through the speech with his references to
Google and give you my running commentary on the accusations.</p>
<p><span id="more-10670"></span></p>
<p><b>Google Creates Nothing</b></p>
<blockquote>
<p><i>The second reason I’m pleased to be here is because we have much in
common. I recognize, of course, that the works that you help create and
publish, and the works we create at Microsoft, seem very different. Still, we
share a common understanding of the creative process. We both understand the
time and commitment it takes to develop the first germ of an idea into a
finished work. More importantly, we both understand the risk it involves –
that despite all of our best efforts, a book or software product can still
fall flat in the market. I suspect we share many of the same values when it
comes to preserving incentives for creativity, so that people will continue to
invest in creating works of the very highest quality, not just today, but long
into the future.</i></p>
<p><i>Remaining true to these values is particularly important as content
moves online. I think we can all agree that using the Internet to enhance the
market for works is a crucial endeavor and that doing so creates tremendous
new opportunities to reach customers. However, the reality, as many of you
know, is that authors and publishers often find it difficult just to cover
their costs, let alone make a profit, in this new online world. At the same
time, companies that create no content of their own, and make money solely on
the backs of other people’s content, are raking in billions through
advertising revenue and IPOs.</i></p>
</blockquote>
<p>This is the first slam. Google is suggested (you&#8217;ll see it IS Google later
on) as making no content and just cashing in on others. In contrast, Microsoft
creates content (IE, software). That&#8217;s not entirely so. If software is content,
Google makes plenty of it, such as:</p>
<ul>
<li>
<a href="http://gmail.google.com/">
Gmail</a></li>
<li><a href="http://www.google.com/calendar">Google Calendar</a></li>
<li>
<a href="http://desktop.google.com/">Google Desktop</a></li>
<li>
<a href="https://www.google.com/accounts/ServiceLogin?service=writely&#038;passive=true&#038;continue=http://docs.google.com/&#038;ltmpl=WR_tmp_2_lfty&#038;nui=1&#038;utm_campaign=en&#038;utm_source=en-et-more&#038;utm_medium=more">Google Docs
&amp; Spreadsheets</a></li>
<li><a href="http://earth.google.com/">Google Earth </a></li>
<li><a href="http://www.google.com/talk">Google Talk</a></li>
<li>
<a href="http://toolbar.google.com/">Google Toolbar</a></li>
<li><a href="http://picasa.google.com/">Picasa</a></li>
</ul>
<p>Aside from that, assembling information from others IS content. It&#8217;s not easy
to do, and it is a real benefit to both consumers and content owners. When
content czar Lloyd Braun at Yahoo (now no longer there) &quot;fumed&quot;
<a href="http://www.nytimes.com/2005/09/24/technology/24yahoo.html?ex=1285214400&#038;en=baa0c95217958be5&#038;ei=5090&#038;partner=rssuserland&#038;emc=rss">
that</a> Yahoo didn&#8217;t have its &quot;own&quot; content on Yahoo News about the Discovery
space shuttle mission in 2005, I
<a href="http://blog.searchenginewatch.com/blog/050926-085249">fumed</a> that he
was missing the point:</p>
<blockquote>
<p>Create your own programming [as a search engine], and people may not trust
you&#8217;re going to point them elsewhere. Indeed, that type of pointing IS
programming and worked to bring people to Google in droves back when search
engines became portals and decided they needed to have &quot;channels&quot; and their
own content.</p>
</blockquote>
<p><b>Google&#8217;s Not Innovative</b></p>
<blockquote>
<p><i>Microsoft, I’m pleased to say, has chosen the former path. At its soul,
Microsoft is an innovation company, and we’ve been working hard for many years
to develop innovative technologies that allow readers to experience books
online in new and exciting ways. </i></p>
</blockquote>
<p>After this statement, Rubin then goes on to talk about the two main book
projects Microsoft has, both of which are reactions to preexisting and
long-standing products from Google. If the suggestion is Google&#8217;s not innovative
&#8211; as I take it &#8212; the reality is Microsoft is following in footsteps here. For
the record, the projects are:</p>
<ul>
<li><font face="Verdana, Helvetica, sans-serif">
<a href="http://search.live.com/results.aspx?q=&#038;scope=books">Live Search Books</a>,
launched December 2006 (versus <a href="http://books.google.com/">Google Book
Search</a>, launched back in December 2003)</font></li>
<li><a href="http://search.live.com/results.aspx?scope=academic&#038;q=">Live
Search Academic</a>, launched April 2006 (versus
<font face="Verdana, Helvetica, sans-serif">
<a onclick="s_objectID=&quot;http://scholar.google.com/_1&quot;;return this.s_oc?this.s_oc(e):true" href="http://scholar.google.com">
Google Scholar</a>, launched back in November 2004)</font></li>
</ul>
<p><b>Watch For Google Ads In Books!</b></p>
<blockquote>
<p><i>What I find exciting about all of these initiatives is that they use
great advances in technology to dramatically expand access to works, yet in a
way that respects copyright. We believe this is the right path and is one that
adheres to the three principles I already mentioned. We also think this
distinguishes our approach from one that all of you are familiar with: the
Google Book Search project.</i></p>
<p><i>The stated goal of Google’s Book Search project is to make a copy of
every book ever published and bring it within Google’s vast database of
indexed content. While Google says that it doesn’t currently intend to place
ads next to book search results, Google’s broader business model is
straightforward – attract as many users as possible to its site by providing
what it considers to be “free” content, then monetize that content by selling
ads. I think Pat Schroeder put it best when she said Google has “a hell of a
business model – they’re going to take everything you create, for free, and
sell advertising around it.” </i></p>
</blockquote>
<p>All of which can be entirely applied to Microsoft&#8217;s own projects. They don&#8217;t
&quot;currently&quot; carry ads either. But I don&#8217;t think Microsoft has ever ruled that
out. Remember, Microsoft has no problem selling
<a href="http://flickr.com/photos/dannysullivan/290902490/in/set-72157594281310826/">
virtual billboards</a> in 3D representations in its mapping world &#8212; ads where
no ads actually exist. So let&#8217;s not pretend that somehow books would be
considered off limits, unless we see a pledge like that.</p>
<p><b>Tricking Libraries &amp; Getting &quot;Several&quot; Publishers To Cooperate</b></p>
<blockquote>
<p><i>To accomplish its book search goals, Google persuaded several libraries
to give it unfettered access to their collections, both copyrighted and public
domain works. It also entered into agreements with several publishers to
acquire rights to certain of their copyrighted books. Despite such deals, in
late 2004 Google basically turned its back on its partners. Concocting a novel
“fair use” theory, Google bestowed upon itself the unilateral right to make
entire copies of copyrighted books not covered by these publisher agreements
without first obtaining the copyright holder’s permission. </i></p>
</blockquote>
<p>&quot;Persuaded&quot; suggests that libraries somehow can&#8217;t think for themselves. To my
knowledge, nothing prevents these libraries from also working with others. In
fact, I believe the University Of California, to name one, is working with both
Microsoft and Google.</p>
<p>Before this part of his speech, Rubin talked a lot about Microsoft&#8217;s
publisher program, the way it gets copyrighted content into its system:</p>
<blockquote>
<p><i>The second source is our Publisher program, under which we receive books
still under copyright from publishers with their express permission, either in
digital form directly from the publisher, or scanned from hard copy.
Participating publishers have access to an online site – or dashboard – that
enables them to manage their publications on Live Search Books. They can
choose the amount of text that a reader may preview, create click-to-buy links
next to their books, edit metadata, and so on. Several major publishers have
signed on to the Publisher Program. </i></p>
</blockquote>
<p>Google has a long-standing similar program, and that&#8217;s downplayed. Agreements
with &quot;several&quot; publishers for &quot;certain&quot; copyrighted books as Rubin describes is
a far cry from hundreds (I think perhaps thousands) of publishers that
voluntarily have contributed thousands of books to the Google program. But I
think a picture is worth a thousands words:</p>
<p>
<a href="http://www.flickr.com/photos/dannysullivan/262511284/" title="Photo Sharing">
<img src="http://static.flickr.com/103/262511284_92d4a83ddb.jpg" alt="Google Booth At Frankfurth Book Fair" border="0" width="500" height="375"></a></p>
<p>That&#8217;s the Google both at the Frankfurt Book Fair that I
<a href="http://daggle.com/061113-045255.html">attended</a> last September. I
believe the fair is the largest gathering of book publishers in the world. There
was massive building after massive building filled with publishers.</p>
<p>Notice the picket signs? Notice the angry publishers storming the Google
booth? Publishers are upset with Google, yes. But some of those same publishers
ironically are also partners in the program. And plenty are partners in the
program without being angry at Google. Google is actively involved in the
publishing community and has far more contributions than Rubin&#8217;s speech
suggests.</p>
<p><b>Microsoft Self-Rules On Copyright</b></p>
<blockquote>
<p><i>Google’s chosen path would no doubt allow it to make more books
searchable online more quickly and more cheaply than others, and in the short
term this will benefit Google and its users. But the question is, at what
long-term cost? In my view, Google has chosen the wrong path for the longer
term, because it systematically violates copyright and deprives authors and
publishers of an important avenue for monetizing their works. </i></p>
</blockquote>
<p>As a reminder, we have no idea if scanning books to make them searchable
(which is different from reprinting them online) is a copyright violation.
That&#8217;s the point of the current lawsuits against Google. For all we know, what
Google is doing is perfectly legal. As for the &quot;deprive&quot; argument, we&#8217;ve also
yet to see how this has happened. Google simply does NOT reprint books that are
in copyright online unless they have permission to do this.</p>
<p><b>Opt-In, Not Opt-Out</b></p>
<blockquote>
<p><i>Rather than delve into this arcane legal issue, what we really should be
asking is whether it would be possible for Google to provide its Book Search
service in a way that respects copyright. The answer to this question is: of
course there is. How am I so sure? Well, because we at Microsoft are doing it.
And not just Microsoft. We and others are working on search-driven projects
that are proceeding with the express permission and support of copyright
owners. And then there’s Google’s own Publisher Partner program, which makes
book content available online only after obtaining the necessary
authorization. </i></p>
<p><i>So, what we really have here are two fundamentally different paths.
Google takes the position that everything may be freely copied unless the
copyright owner notifies Google and tells it to stop. Microsoft and most other
companies, by contrast, take the position that they should get the copyright
owner’s consent before they copy. The Copyright Act, in our view, supports
this approach. It’s hard to see any justification for exempting Google from
its requirements.</i></p>
</blockquote>
<p>First, Google&#8217;s position is that everything may be freely copied to make a
searchable index. That&#8217;s different, as I&#8217;ve said (and read
<a href="http://daggle.com/061113-045255.html">in-depth here</a>), from putting
the actual books online. Rubin at least acknowledges that &quot;book content&quot; is only
put online (IE &#8212; reprinted) by Google with authorization.</p>
<p>But let&#8217;s be clear. In general, Microsoft does exactly what Google does in
terms of taking content and indexing it without permission. That&#8217;s how most of
its search services operate. They index pages unless site owners specifically
opt-out. Microsoft doesn&#8217;t call up a site owner and say &quot;Hey, we want to spider
your pages. OK?&quot; It just does that. And over in Belgium, that group that sued
Google over indexing news content? Yeah, don&#8217;t forget they
<a href="http://searchengineland.com/070118-085917.php">went after</a> Microsoft
as well.</p>
<p><b>Books Are Different!</b></p>
<p>Of course, I&#8217;ve long argued that any search engine asks for permission to
index content through things like the robots.txt blocking mechanism. Until
recently, I also struggled to understand why book publishers think they should
be so special. Time after time, I&#8217;d look at the site of a book publisher  that was upset
with Google for &quot;infringing copyright&quot; by indexing books. Those same publishers
don&#8217;t block Google from spidering their copyright-protected web pages. So why&#8217;s it OK to copy a web page
for indexing purposes but not a book? What makes my content online somehow free
game but more protected just because it&#8217;s printed on dead trees?</p>
<p><a href="http://daggle.com/061113-045255.html">Search Engines, Permissions &amp;
Moving Forward In Copyright Battles</a> is a post I did last year where I
finally switched tothe side of book publishers against Google. I felt they were
special because unlike online, there&#8217;s no automatic way for them to opt-out:</p>
<blockquote>
<p>Still, Google shouldn&#8217;t be scanning them, not the in copyright books, not
without permission. First and foremost, this is because unlike with the web,
there&#8217;s no automated way to ask permission. I fully support web indexing, but
I support it because there&#8217;s an easy way to get permission. That&#8217;s not the
case with books in copyright. Google can&#8217;t ask if indexing is OK. Since they
can&#8217;t ask, I don&#8217;t think they should do it.</p>
<p>Similar to with cached pages, I think Google should back down. Google
briefly paused scanning once before. I think they should again, say they feel
they&#8217;re on solid legal ground but again to be a good corporate citizen,
they&#8217;re putting things on hold until they can either work out an automated way
to seek permission or until they negotiate deals.</p>
</blockquote>
<p>As you can see, I&#8217;m with Rubin here. Moreover, I&#8217;m with the idea that if
Google hadn&#8217;t started with what&#8217;s often seen as its typical arrogant &quot;let&#8217;s just
do it&quot; attitude and instead  talked more with publishers, it might not be set up
as the copyright bad guy now. Having said that, any search player making such
accusations walks a dangerous line, because it&#8217;s easy to turn the focus back on
them and find similar examples &#8212; as we shall see.</p>
<p><b>Google&#8217;s Bad At Copyright Protection</b></p>
<blockquote>
<p><i>From the perspective of your business, Google’s approach is troubling
for another reason. It assumes, in effect, that Google is the only game in
town. Google argues that authors and publishers should simply notify Google if
they want to preserve their rights in their works. But what if, as is
inevitable, other companies around the world start taking the same approach?
Should copyright owners be obligated to track down everyone engaging in
unauthorized copying in order to preserve their exclusive rights in their
works? Presumably, the desire to preserve these rights is why they asserted
copyright in the first place. This approach would be absolutely unworkable in
practice, which is probably why Congress in enacting the Copyright Act placed
the burden on those who want to copy to get the express consent of the
copyright owner, rather than the other way around.</i></p>
<p><i>In essence, Google is saying to you and to other copyright owners:
“Trust us &#8211; you’re protected. We’ll keep the digital copies secure, we’ll only
show snippets, we won’t harm you, we’ll promote you.” But Google’s track
record of protecting copyrights in other parts of its business is weak at
best. Anyone who visits YouTube, which Google purchased last year, will
immediately recognize that it follows a similar cavalier approach to
copyright. Since YouTube’s inception, television companies, movie studios and
record labels have all complained that the site knowingly tolerates piracy. In
the face of YouTube’s refusal to take any effective action, copyright owners
have now been forced to resort to litigation. And Google has yet to come up
with a plan to restrain the massive infringements on YouTube. </i></p>
</blockquote>
<p>Google deserve huge slams over the issue with YouTube. At the same time, we
also know it inherited much of that mess, rather than created it. Had Microsoft
bought YouTube, it would be mum here.</p>
<p>But more important, does Rubin really want people to start searching on
Windows Live to see what copyright infringing content the search engine links
to? At least in the US, Microsoft &#8212; just like Google &#8212; isn&#8217;t required to pull
down links to such content until the content owners themselves raise issues. A
big problem here is that copyright law needs to change. But Microsoft is hardly
going to be as innocent as it sounds. No search engine will be. But their
nature, search engines can link to content that violates copyright without even
knowing.</p>
<p><b>Google Helping Copyright Infringers</b></p>
<blockquote>
<p><i>Another example is equally disturbing. Microsoft was surprised to learn
recently that Google employees have actively encouraged advertisers to build
advertising programs around key words referring to pirated software, including
pirated Microsoft software. And we weren’t the only victims – Google also
encouraged the use of keywords and advertising text referring to illegal
copies of music and movies. These actions bolstered websites dedicated to
piracy and reportedly netted Google around $800,000 in advertising revenues
from just four such pirate sites. These are not the actions of a company that
has the interests of copyright owners as one of its priorities.</i></p>
</blockquote>
<p>Ouch! And Google has that coming. My jaw certainly dropped to
<a href="http://searchengineland.com/070212-091048.php">discover</a> it had some
ad account people actively suggesting that some sites accused of offering
pirated movies should buy terms like:</p>
<ul>
<li>bootleg movie download</li>
<li>pirated</li>
<li>download harry potter movie</li>
</ul>
<p>For balance, this sounds more like some bad reps rather than a Google company
policy. But still, Google deserves the slam. Then again, how does Microsoft do
on those terms? Let&#8217;s do
<a href="http://search.live.com/results.aspx?q=bootleg+movie+download&#038;mkt=en-US&#038;form=QBRE&#038;go.x=0&#038;go.y=0&#038;go=Search">
bootleg movie download</a>:</p>
<p><a href="http://www.flickr.com/photos/dannysullivan/412616034/" title="Photo Sharing">
<img src="http://farm1.static.flickr.com/184/412616034_518de26089.jpg" width="500" height="285" alt="Bootleg Ads On Live Search" border="0" /></a></p>
<p>Hmm. Microsoft&#8217;s making money off the term. That&#8217;s not so bad, if the sites
are all legal. Are they? I suspect at least one of them might be iffy. What
about
<a href="http://search.live.com/results.aspx?q=pirated+software&#038;mkt=en-US&#038;form=QBRE&#038;go.x=16&#038;go.y=15&#038;go=Search">
pirated software</a>:</p>
<p><a href="http://www.flickr.com/photos/dannysullivan/412615951/" title="Photo Sharing">
<img src="http://farm1.static.flickr.com/180/412615951_445b5f0366.jpg" width="500" height="256" alt="Pirated On Live Search" border="0" /></a></p>
<p>I have to chuckle at that ad. Really, it makes me laugh. See, that ad is
making Microsoft money on a per click basis. When you click, you end up on a
page that is simply more ads from &#8212; Google! That page also contains scraped
search results for that search,
<a href="http://search.live.com/results.aspx?q=pirated+software&#038;mkt=en-US&#038;form=QBRE&#038;go.x=16&#038;go.y=15&#038;go=Search">
pirated software</a>, from Microsoft&#8217;s own search engine. Potentially, that&#8217;s a
copyright violation. So Microsoft is making money off someone making money off
Google with content that possibly is infringed from Microsoft.</p>
<p><b>Conclusion</b></p>
<p>Overall, I have to say it&#8217;s disappointing seeing Microsoft come out on an
attack stance rather than be positive about what it is doing. Google deserves
slams, and I wish they&#8217;d change to an opt-in policy for copyrighted books. But
for me, with perspective, Microsoft comes across as someone trying to play
catch-up and willing to be negative to do it. I don&#8217;t like that in political
campaigns, and I guess I don&#8217;t like it any more in the search wars. But most
important, it&#8217;s a dangerous game to play. The more Microsoft paints itself as
some type of pure protector of copyright, the harder it will fall as people find
examples where it fails to meet expectations.</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/dissecting-microsoft-slams-at-google-as-copyright-infringer-10670/feed</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Microsoft Releases Live Search Books Beta</title>
		<link>http://searchengineland.com/microsoft-releases-live-search-books-beta-9990</link>
		<comments>http://searchengineland.com/microsoft-releases-live-search-books-beta-9990#comments</comments>
		<pubDate>Wed, 06 Dec 2006 12:29:06 +0000</pubDate>
		<dc:creator>Barry Schwartz</dc:creator>
				<category><![CDATA[Microsoft: Live Search Academic]]></category>
		<category><![CDATA[Microsoft: Live Search Books]]></category>
		<category><![CDATA[Search Engines: Academic Search Engines]]></category>
		<category><![CDATA[Search Engines: Book Search Engines]]></category>

		<guid isPermaLink="false">http://searchengineland.com/beta/microsoft-releases-live-search-books-beta-9990.php</guid>
		<description><![CDATA[
]]></description>
			<content:encoded><![CDATA[<p>The Live Search Blog <a href="http://blogs.msdn.com/livesearch/archive/2006/12/05/live-search-books-beta-release.aspx">announced</a> that Microsoft released a beta named <a href="http://books.live.com/">Live Search Books</a> this morning.  Plus they enhanced <a href="http://academic.live.com/">Live Search Academic</a> by adding millions of new articles, plus indexing theses, dissertations, and books within these disciplines.</p>
<p><span id="more-9990"></span>
The details on Live Search Books Beta:</p>
<blockquote>The U.S. beta launch of Live Search Books is a big step forward in advancing the way people discover information through the integration of content that has been “off-limits</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/microsoft-releases-live-search-books-beta-9990/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

<!-- Dynamic page generated in 0.250 seconds. -->
<!-- Cached page generated by WP-Super-Cache on 2012-05-25 20:25:14 -->
<!-- Compression = gzip -->
