<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>searchengineland.com &#187; Q&amp;A Land</title>
	<atom:link href="http://searchengineland.com/library/columns/qa-land/feed" rel="self" type="application/rss+xml" />
	<link>http://searchengineland.com</link>
	<description>Search Engine Land: Must Read News About Search Marketing &#38; Search Engines</description>
	<lastBuildDate>Sat, 21 Nov 2009 03:30:01 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Karen Wickre: Mother Of The Google Blog On Google&#8217;s Official Blogging</title>
		<link>http://searchengineland.com/karen-wickre-mother-of-the-google-blog-on-googles-official-blogging-12462</link>
		<comments>http://searchengineland.com/karen-wickre-mother-of-the-google-blog-on-googles-official-blogging-12462#comments</comments>
		<pubDate>Thu, 18 Oct 2007 10:20:59 +0000</pubDate>
		<dc:creator>Danny Sullivan</dc:creator>
				<category><![CDATA[Google: Marketing]]></category>
		<category><![CDATA[Q&A Land]]></category>

		<guid isPermaLink="false">http://searchengineland.com/beta/karen-wickre-mother-of-the-google-blog-on-googles-official-blogging-12462.php</guid>
		<description><![CDATA[
]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fsearchengineland.com%2Fkaren-wickre-mother-of-the-google-blog-on-googles-official-blogging-12462"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fsearchengineland.com%2Fkaren-wickre-mother-of-the-google-blog-on-googles-official-blogging-12462" height="61" width="51" /></a></div><p><a href="http://www.flickr.com/photos/dannysullivan/1612052795/" title="Photo Sharing">
<img src="http://farm3.static.flickr.com/2235/1612052795_5ffdf8a934.jpg" width="500" height="249" alt="Karen Wickre" border="0" /></a></p>
<p>When Google launched the <a href="http://googleblog.blogspot.com/">Official
Google Blog</a> back in 2004, it started as sort of a ho-hum event. There wasn&#8217;t
anything particularly gripping, and some wondered if the company should be more
edgy with its posts. Since then, Google has launched more than 70 additional
official blogs over the years. Some have gotten edgy; a few even have comments, but most
important, they&#8217;ve turned into an essential communications vehicle for the
company.</p>
<p>A few weeks ago, I sat down with Karen Wickre, who aside from her
formal title at Google of senior manager, global communications &amp; public
affairs, is who I&#8217;d best describe as &quot;mother of the blogs.&quot; Karen is ultimately
responsible for how they all work, and she shared some insight  on how Google makes use of blogging.
Will the big three of Google &#8212; Larry, Sergey and Eric &#8212; ever blog? Will comments come to the main Google blog? Will Google&#8217;s blogging replace press releases? Read on&#8230;.</p>
<p><span id="more-12462"></span></p>
<p><b>Danny: </b>How would you characterize your role in relation to the blogs?</p>
<blockquote>
<p><b>Karen: </b>I shepherd them. I call myself the managing editor, and I&#8217;m
the gatekeeper for the new blogs. I&#8217;ve been with Google a long time and have
worked in publishing a long time, so I can&#8217;t help but pay attention to the
words &#8212; to the ways in which Google is communicating with the world.</p>
</blockquote>
<p><b>Danny: </b>I&#8217;ve tended to think of you as mother of the main blog and the
various blogs in general. Fair to say?</p>
<blockquote>
<p><b>Karen: </b>Mother of the blog is as fair as anything else. [Former PR staffer] Nate
Tyler and I pitched the idea of a blog to our team and got agreement to turn
it on in May 2004. This was after we&#8217;d announced the IPO, but before we went
public.</p>
<p>[NOTE: Posts to try out the blog went up in
<a href="http://googleblog.blogspot.com/2004_04_01_googleblog_archive.html">
April 2004</a> but were not made public until after the first
<a href="http://googleblog.blogspot.com/2004/05/is-this-thing-on.html">
official post</a> in May. The very first post
<a href="http://googleblog.blogspot.com/2004/05/whaddya-mean-we.html">seeking</a>
job recruits has since been deleted].</p>
<p>Initially the agreement was one post per month. There was no
opposition to the idea, but our lawyers literally combed over every word,
since we were then in our quiet period. I would say do not launch a blog
during a quiet period &#8212; or if you do, expect to work with a lot of lawyers on
every word!</p>
<p>I love editing and making things read better. That&#8217;s what an editor does. A
blog is a publication, after all. The idea that I could guide product managers
and other Googlers to get new topical blogs off the ground, for example, is
very appealing. </p>
</blockquote>
<p><b>Danny: </b>Google has so many blogs now. Do you have to read through each
post before they go live, or do some of the blogs have more freedom than others?</p>
<blockquote>
<p><b>Karen:</b> While it&#8217;s important to have a review, I never want to
overwrite what a Googler is saying about their topic or product. All posts are
reviewed by a few relevant people on the immediate team, plus a PR person for
approval. As a rule, this isn&#8217;t labor-intensive or overbearing. We try to
encourage original perspectives and stories insofar as company blogs can
feature those. We share drafts in Google Docs and do edits there. Again, I try
hard not to overwrite or have the team wordsmith to death. That&#8217;s not going to
get us interesting reads.</p>
<p>We have different categories of blogs, and the type
influences how reviews work. There are product blogs [like the
<a href="http://googlereader.blogspot.com/">Google Reader Blog</a>], developer
blogs [like the <a href="http://googleajaxsearchapi.blogspot.com/">Google AJAX
Search Blog</a>], country-specific blogs [like the
<a href="http://google-au.blogspot.com/">Google Australia Blog</a>] and some
vertical interest topics [like the
<a href="http://googlepublicpolicy.blogspot.com/">Google Public Policy Blog</a>].
All posts go through someone for approval.</p>
<p>The product blogs are the biggest group, and we have lots of people
interested in posting and writing up stuff.  With the developer blogs, there
tends to be much less editing. There&#8217;s no sense wordsmithing there. Developers
are a natural blog-reading constituency. In some cases, posts are so
well-written from the start, and have a PR staffer on the blog team, such as
the Google Public Policy blog, that there is very easy collaboration. </p>
</blockquote>
<p><b>Danny: </b>It feels like Google is running more official blogs than most
other companies. Any idea if this is the case, if Google is somehow unique in
the amount of blogging it is doing?</p>
<blockquote>
<p><b>Karen: </b>We do have more than 70 blogs now. I feel like Google is one
of the few places doing this differently. There are other companies with
many blogs, but those tend to be unofficial blogs written by individual developers,
such as those from Sun, IBM and Microsoft.  </p>
<p>Our network of corporate blogs are definitely all company-based, not
personal communications, but we aim to have lots of Googlers writing and covering
lots of topics in interesting ways.</p>
<p>Of course, in addition to our many official blogs, we&#8217;ve also got individuals like <a href="http://mattcutts.com/blog/">
Matt Cutts</a> [from Google's web search quality team] posting. We&#8217;d love more
like those.</p>
<p>For us, the blog platform is a fast way to publish news and notes about
Google and to directly reach millions of people. It&#8217;s so fast and easy
compared to newsletters or, God-forbid, press releases. It is a PR platform,
but we try hard to make it not traditional PR-like. That&#8217;s why we want our
individuals from teams actually &quot;in the trenches&quot; to do the bulk of the
writing.</p>
</blockquote>
<p><b>Danny: </b>So are the blogs working well enough that you might give up
press releases?</p>
<blockquote>
<p><b>Karen: </b>Press releases are not going away. There are legal requirements
and business requirements for press releases, so they serve a function. Down
the line, maybe the whole industry will adopt blogs for public communications.
This isn&#8217;t up to Google to determine, of course. But we&#8217;ve never been a company that issues a ton of press releases.</p>
<p>Much more often than a press release or a Google Gram [these are special
email alerts sometimes sent to selected reporters], we&#8217;ll issue a blog post. </p>
<p>[Note From Danny: Indeed, over the past two years or so, product coverage
I've done often revolves around when a blog post will go up announcing it,
rather than a press release being issued. Google often will tell me and others, &quot;and the blog post will go out on....&quot;]</p>
</blockquote>
<p><b>Danny: </b>How many readers do the various blogs have?</p>
<blockquote>
<p><b>Karen: </b>The main Google Blog has over 500,000 [this was in mid-August -- now it is
nearly 700,000 readers]. After that, the Google China Blog has many readers,
as does Google LatLong. Anything to do with geo stuff is always hugely
popular!</p>
<p>NOTE: Karen emailed this week that she also tallied up everything at the
end of September using a combination of Google Analytics and FeedBurner.
Across the blog network, Google had 16.6 million readers in the first 9 months
of the year. She also sent this list of the blogs with the most subscribers
according to FeedBurner:</p>
<ol>
<li><a href="http://googleblog.blogspot.com/">Google Blog</a>: 724,522 subscribers<br />
&nbsp;</li>
<li><a href="http://googlereader.blogspot.com/">Google Reader Blog</a>: 79,553 subscribers <br />
&nbsp;</li>
<li><a href="http://blog.orkut.com/">Orkut Blog [Portuguese]</a>: 45,956 subscribers
[Orkut is huge in Brazil, and this is reflected in the fact that the
official Orkut blog at <a href="http://blog.orkut.com/">blog.orkut.com</a>
is in Portuguese, while English posts about Orkut are relegated to
<a href="http://en.blog.orkut.com/">en.blog.orkut.com</a>].<br />
&nbsp;</li>
<li><a href="http://googlejapan.blogspot.com/">Google Japan Blog</a>: 19,781 subscribers <br />
&nbsp;</li>
<li><a href="http://googlewebmastercentral.blogspot.com/">Google Webmaster Central Blog</a>: 19,926 subscribers <br />
&nbsp;</li>
<li><a href="http://google-code-updates.blogspot.com/">Google Code Blog</a>: 19,257 subscribers <br />
&nbsp;</li>
<li><a href="http://adsense.blogspot.com/">Inside AdSense</a>: 16,301 subscribers </li>
</ol>
</blockquote>
<p><b>Danny: </b>Is there any attempt to try and coordinate postings across the
blogs, what people are blogging about, or set any schedule or number of posts to
happen?</p>
<blockquote>
<p><b>Karen:</b> We do coordinate posts when we have news that touches on
different areas. For example, a Google.org initiative that ties in to our
corporate environmental projects gets posted on two blogs, but each one is
different. We don&#8217;t duplicate posts. A post that&#8217;s about YouTube policy may
run on the YouTube blog and another one on Google blog. </p>
<p>We do encourage a focus on more interesting items and a regular frequency
and not just an endless stream of a product updates and how-to info. So we
always aim for useful information and a good read. We don&#8217;t have a quota
system. We do encourage frequency, but writing something just to have a post
doesn&#8217;t work.</p>
</blockquote>
<p><b>Danny:</b> How do you work with those teams to encourage more quality
posting?</p>
<blockquote>
<p><b>Karen: </b>First, a team has to propose the blog and really understand
what&#8217;s required to maintain it. A PR &quot;designated hitter&quot; has to be involved and
work with the product team to get a product blog going. We&#8217;re all in
agreement by the time it&#8217;s live. There have been some country blogs where
there have been one or two interested people running it. In a smaller
market, that would be OK. In a bigger country, we would need more
people to contribute.</p>
</blockquote>
<p><b>Danny: </b>Sometimes a post on one of Google&#8217;s sub-blogs also runs on the
main blog. Is there competition to be &quot;good enough&quot; for that?</p>
<blockquote>
<p><b>Karen: </b>Definitely. Someone will write me and say, &quot;I don&#8217;t know if
this is big enough for the Google Blog.&quot; I have a wide-ranging readership in mind, which
is global for the Google Blog, and so I ask if the post reflects the
multiplicity of Google enough. Things that are small or a feature update on
some product probably won&#8217;t go on Google blog unless we want to elevate its
visibility.</p>
</blockquote>
<p><b>Danny: </b>First the Google Librarian Blog, then Google Webmaster Central
Blog gained comments <a href="http://searchengineland.com/070214-061805.php">
earlier this year</a>, something some
<a href="http://searchengineland.com/070102-090541.php">argue</a> is required
for a blog to call itself a blog. How has that been going? Will more blogs gain
them?</p>
<blockquote>
<p><b>Karen: </b>If the team is going to be vigilant about watching them,
sure. We want a good read and a conversation, which means watching for spam
comments and way off-topic stuff. Developers generally know how to handle
comments, so those are easy blogs to enable comments for.</p>
</blockquote>
<p><b>Danny: </b>How about comments on the main Google Blog?</p>
<blockquote>
<p><b>Karen: </b>We&#8217;ve been talking about it for a long time internally. Some
of us would like to, even though there&#8217;s plenty of concern about the time
required to monitor comments. We aren&#8217;t out to censor criticism, mind you, but
to keep the reading and the exchanges useful and informative takes constant
review and ability to reply quickly. </p>
<p>Lots of people even outside of Google tell me this is just too hard to
manage. After all, big sites like The Washington Post have
<a href="http://www.poynter.org/column.asp?id=31&#038;aid=95618">had
problems</a> with comment abuse. So we continue to watch, and evaluate, for
the Google Blog itself. Some of our more targeted blogs with smaller
readership following very defined topics have an easier time monitoring to
maintain an interesting exchange.</p>
</blockquote>
<p><b>Danny:</b> <a href="http://searchengineland.com/070622-092755.php">Google
News &amp; Finance</a>, <a href="http://searchengineland.com/070510-080343.php">
Google Maps &amp; Google Earth</a> and the
<a href="http://searchengineland.com/070618-060937.php">Google Public Policy
Blog</a> are all new blogs that have gone up in the past few months. What&#8217;s
next?</p>
<blockquote>
<p><b>Karen:</b> We may have a few more coming from Google Japan [<a href="http://analytics-ja.blogspot.com/">Google
Analytics Blog Japan</a> went up since we talked; over in China,
<a href="http://www.googlechinawebmaster.com/">one</a> for webmasters has also
been launched]. Perhaps one on Picasa, to cover tips on working with photos [<a href="http://googlephotos.blogspot.com/">Google
Photo Blog</a> went up last month].</p>
</blockquote>
<p><b>Danny: </b>I&#8217;ve joked in the past that I can never remember the address of
<a href="http://googleblog.blogspot.com">googleblog.blogspot.com</a> for the
main official blog, much less <a href="http://google-latlong.blogspot.com">
google-latlong.blogspot.com</a> for the Google Maps &amp; Earth Blog, Google
Lat-Long. Will we get a better address?</p>
<blockquote>
<p><b>Karen: </b>We do want to move to blogs.google.com eventually. We&#8217;re
still working out how to do it with <a href="http://www.blogger.com/">Blogger</a>
(Google&#8217;s own blog authoring tool, that it uses for its own blogs).</p>
</blockquote>
<p><b>Danny: </b>Are all the blogs on Blogger? Isn&#8217;t
<a href="http://googlechinablog.com/">Google China Blog</a> not using Blogger? </p>
<blockquote>
<p><b>Karen: </b>We use Blogger wherever it is offered. In China, we don&#8217;t offer Blogger as
a consumer product,
so we publish our various Chinese Google blogs within Blogger and use a third
party host.</p>
</blockquote>
<p><b>Danny: </b>Has anyone said they want to use a different blogging platform?</p>
<blockquote>
<p><b>Karen: </b>I don&#8217;t think that&#8217;s ever come up. Googlers with their own
individual blogs can use whatever they want [Matt Cutts uses WordPress, for
example]. But we want to showcase Blogger as much as we can.</p>
</blockquote>
<p><b>Danny: </b>How about the Google founders Larry Page and Sergey Brin
blogging, or Google CEO Eric Schmidt?</p>
<blockquote>
<p><b>Karen: </b>The big three? I don&#8217;t know. They&#8217;re so busy. We&#8217;d certainly
accommodate if they wanted to. They do, however, appreciate our blogs. I think they appreciate the
&quot;direct to you&quot; approach. I&#8217;ve worked with
them on a few of the posts that are essentially  statements. These are a nice
way to state our position so that reporters can work from it directly. It also becomes
our standing statement on something. That&#8217;s been nice for us in a few
instances where we&#8217;d never put out a press release.</p>
<p>The <a href="http://searchengineland.com/070125-230048.php">Google bombing
phenomena</a> is an example of this. I think it was almost two years ago, we
put out an
<a href="http://googleblog.blogspot.com/2005/09/googlebombing-failure.html">
explanation</a> of what this thing was. That has had huge legs in the
blogosphere. People have pointed to it, linked to it and said &quot;Here&#8217;s the
definitive statement on this thing.&quot;</p>
</blockquote>
<p><b>Danny: </b>What&#8217;s are the most popular posts you&#8217;ve done?</p>
<blockquote>
<p><b>Karen: </b>Anything to do with Google Earth and Google Book Search has a
lot of readers. And our funny bits on the main Google Blog, like the
<a href="http://googleblog.blogspot.com/2007/02/strawberries-are-red-stems-are-green.html">
explanation</a> about the Valentine&#8217;s Day doodle or
<a href="http://googleblog.blogspot.com/2005/12/im-feeling-silly.html">100
pounds of Silly Putty</a>. </p>
<p>NOTE: Karen has since sent me this comprehensive list:</p>
<ol>
<li>
<a href="http://googleblog.blogspot.com/2007/02/strawberries-are-red-stems-are-green.html">
Strawberries are red, stems are green</a>, Feb. 14, 2007: 226,131 pageviews
[Official Google Blog]<br />
&nbsp;</li>
<li>
<a href="http://blog.orkut.com/2007/08/pr-visualizao-da-reestilizao.html">
Pré-visualização da Reestilização: Simplicidade Fiel</a>, Aug. 24, 2007:
117,507 pageviews [Orkut Blog]<br />
&nbsp;</li>
<li>
<a href="http://googleblog.blogspot.com/2007/08/update-on-google-video-feedback.html">
An Update on Google Video Feedback</a>, Aug. 20, 2007: 111,967 pageviews
[Official Google Blog]<br />
&nbsp;</li>
<li>
<a href="http://googleblog.blogspot.com/2005/12/build-your-own-google-homepage.html">
Build your own Google homepage</a>, Feb. 13, 2005: 103,701 pageviews
[Official Google Blog]<br />
&nbsp;</li>
<li><a href="http://googlereader.blogspot.com/2007/06/doing-shuffle.html">
Doing the Shuffle</a>, June 22, 2007: 80,811 pageviews [Google Reader Blog]<br />
&nbsp;</li>
<li>&quot;<a href="http://googlereader.blogspot.com/2007/04/go-go-reader-gadget.html">Go
Go (Reader) Gadget</a>, April 6, 2007: 72,252 pageviews [Google Reader Blog]<br />
&nbsp;</li>
<li>
<a href="http://googlewebmastercentral.blogspot.com/2007/02/discover-your-links.html">
Discover your links</a>, Feb. 5, 2007: 62,985 pageviews [Google Webmaster
Central Blog]<br />
&nbsp;</li>
<li>
<a href="http://googlewebmastercentral.blogspot.com/2007/01/quick-word-about-googlebombs.html">
A quick word about Googlebombs</a>, Jan. 25, 2007: 62,640 pageviews [Google
Webmaster Central Blog]<br />
&nbsp;</li>
<li><a href="http://googlejapan.blogspot.com/2006/09/google-earth.html">&#26085;&#26412;&#35486;&#12398;
Google Earth</a>, Sept. 14, 2006: 61,953 pageviews [Google Japan Blog]<br />
&nbsp;</li>
<li>
<a href="http://googlereader.blogspot.com/2007/09/breaking-up-isnt-hard-to-do.html">
Breaking up isn&#8217;t hard to do</a>, Dec. 18, 2006: 60,229 pageviews [Google
Reader Blog]<br />
&nbsp;</li>
<li>
<a href="http://google-code-updates.blogspot.com/2007/02/speaking-of-summer.html">
Speaking of Summer</a>, Feb. 15, 2007: 35,652 pageviews [Google Code Blog]<br />
&nbsp;</li>
<li>
<a href="http://google-code-updates.blogspot.com/2006/08/announcing-tesseract-ocr.html">
Announcing Tesseract OCR</a>: Aug. 30, 2006: 30,536 pageviews&nbsp; [Google
Code Blog]<br />
&nbsp;</li>
<li>
<a href="http://google-code-updates.blogspot.com/2007/04/google-releases-patches-that-enhance.html">
Google releases patches that enhance the manageability and reliability of
MySQL</a>, April 23, 2007: 29,339 pageviews&nbsp; [Google Code Blog]<br />
&nbsp;</li>
<li>
<a href="http://adsense.blogspot.com/2007/10/introducing-video-units.html">
Introducing video units</a>, Oct. 8, 2007: 17,197 pageviews [Inside AdSense]<br />
&nbsp;</li>
<li><a href="http://adsense.blogspot.com/2007/08/get-inline.html">Get inline</a>,
Aug. 21, 2007: 16,272 pageviews [Inside AdSense]</li>
</ol>
</blockquote>
<p>Looking for more about Google and blogging? Last year, Karen did an hour-long podcast on working on the
blog you&#8217;ll find
<a href="http://www.thecorporatebloggingshow.com/2006/11/28/google-senior-editor-karen-wickre-on-how-she-manages-googles-official-corporate-blog/">
here</a>. Also see <a href="http://searchengineland.com/070122-160804.php">10
Google Feeds You Should Subscribe To</a> on how to keep up on everything from
all Google blogs, if you want.</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/karen-wickre-mother-of-the-google-blog-on-googles-official-blogging-12462/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Q&amp;A With Seth Godin, Founder &amp; CEO Of Social Search Service Squidoo</title>
		<link>http://searchengineland.com/qa-with-seth-godin-founder-ceo-of-social-search-service-squidoo-11284</link>
		<comments>http://searchengineland.com/qa-with-seth-godin-founder-ceo-of-social-search-service-squidoo-11284#comments</comments>
		<pubDate>Wed, 23 May 2007 13:34:11 +0000</pubDate>
		<dc:creator>Chris Sherman</dc:creator>
				<category><![CDATA[Q&A Land]]></category>
		<category><![CDATA[SEM Industry: General]]></category>
		<category><![CDATA[Search Engines: Social Search Engines]]></category>

		<guid isPermaLink="false">http://searchengineland.com/beta/qa-with-seth-godin-founder-ceo-of-social-search-service-squidoo-11284.php</guid>
		<description><![CDATA[]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fsearchengineland.com%2Fqa-with-seth-godin-founder-ceo-of-social-search-service-squidoo-11284"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fsearchengineland.com%2Fqa-with-seth-godin-founder-ceo-of-social-search-service-squidoo-11284" height="61" width="51" /></a></div><p>
<a href="http://searchengineland.com/guides/qa_land.php">
</a> Seth Godin is widely known as one of the foremost advocates of &#8220;permission marketing,&#8221; promotional campaigns that don&#8217;t rely on interrupting the attention of your customers, but rather engage them and even turn them into enthusiastic advocates that volunteer time and effort to help you promote your products or services.</p>
<p>Godin doesn&#8217;t shy away from controversy. He&#8217;s tweaked the noses of other well-known pundits, and in recent years has also blasted search marketers, accusing them of a variety of sins.  But Godin is no stranger to search: Yoyodyne, an interactive direct marketing company  he company he founded and ran was acquired by Yahoo in 1998. One of his current efforts is a human-powered search service (though he wouldn&#8217;t call it that): <a href="http://www.squidoo.com">Squidoo</a>.</p>
<p>I asked Godin to elaborate on his issues with search marketing and to talk a bit about his goals and aspirations with Squidoo.  Read on for his fascinating views and opinions on what search marketers are currently doing wrong and what they can do to improve their campaigns and create a more effective dialogue with searchers.</p>
<p><span id="more-11284"></span>
<b>Q. A few years ago, you wrote that you thought most search engine optimization wasn&#8217;t worth the money.  Since then, your stance seems to have softened somewhat, with your acknowledging the work of some pros like Andrew Goodman and Aaron Wall.  What&#8217;s your position on SEO today?</b></p>
<p>I think it&#8217;s important for everyone to act as if it doesn&#8217;t work. When it becomes a crutch, when you use SEO as a replacement for something else, you fail. I do agree that common sense white hat SEO is a smart tool&mdash;if you&#8217;ve done everything else&mdash;and your organization has appropriate scale. Short version: No magic bullets.</p>
<p><b>Q: What about companies that are operating in ruthlessly competitive areas, like travel, gambling, jewelry, etc? One of Yahoo&#8217;s chief spam cops, Tim Mayer, once said &#8220;If you&#8217;re being entirely organic and going after &#8216;Viagra,&#8217; it&#8217;s like taking a sword to a gunfight. You just aren&#8217;t going to rank.&#8221; After all, there are black hats out there&mdash;shouldn&#8217;t you be realistic and compete with the same techniques they&#8217;re using?</b></p>
<p>Sure, travel is a brutally competitive area. And if you treat it like a commodity, you&#8217;ll get treated like a commodity in return. My point is that a tough business is not an excuse to cheat or to rely on tactics that don&#8217;t last.</p>
<p>If I ran a travel site, I&#8217;d engage my best customers to build blogs and Squidoo lenses and to use Digg to point to reviews and insights and things that would make people WANT to seek me out.</p>
<p>Spending money on ads or commodity-focused SEO is the last gasp of someone who is short on innovation, imagination and great stuff!</p>
<p><b>Q. At SES New York you said that people were attending the conference because &#8220;search is broken.&#8221; Can you elaborate on that?</b></p>
<p>Well, if search worked, then you wouldn&#8217;t need a strategy! People would find you when you needed to be found, and find someone else the rest of the time. Of course, search is always going to be a bit broken (though it keeps getting better) and the more human person to person recommending that gets included (including <a href="http://squidoo.com">squidoo.com</a>), the better it&#8217;s going to work.</p>
<p>Q: You describe Squidoo as &#8220;1) thousands of people creating a handbuilt catalog of the best stuff online 2) a free and fun way to make your own page and get traffic 3) a place to find what you&#8217;re looking for, fast. How does Squidoo differ from other community-built sites, like About.com, Wikipedia or others?</p>
<p>Wikipedia has one voice. Every contributor is anonymous. About.com has a few hundred voices, and only page on a topic. We&#8217;re a cross between the two, but we add in 65,000 people building the lenses&#8230; so you can find the people who you trust, the judgment that you rely on, and getting something more focused and up to date than about.com.</p>
<p><b>Q: I&#8217;ve found Squidoo to be helpful, but have also searched for some things where nobody has yet built a &#8220;lens&#8221; (focused topic page) on the subject. That seems to be a problem with all people-driven search projects&mdash;they don&#8217;t scale. How will you deal with this?</b></p>
<p>Well, they do scale (Wikipedia is xxxx times as big as the Encyclopedia Britannica) but not as fast as you (or I) might like. The win happens when each lensmaster brings in ten more. Do that three times and you&#8217;re 1000 times bigger than you were.</p>
<p><b>Q: I&#8217;ve also been critical of people-driven search because it&#8217;s often hard to tell whether someone is trustworthy or credible in their recommendations. How do you address these issues of trust and reliability?</b></p>
<p>You&#8217;re exactly right. You can&#8217;t tell. You can&#8217;t tell if a Squidoo lens is legit, or if Wikipedia got hacked. But as long as a) it&#8217;s better than the alternatives and b) improvable and c) another point of view then I argue that it&#8217;s worthwhile.</p>
<p>Blogs aren&#8217;t reliable either, but I&#8217;m glad they exist!</p>
<p><b>Q. If someone were just starting out (or starting over) with a web site, what key things would you recommend they spend time doing to attract visitors and engage with customers online?</b></p>
<p>Far and away job one: Make it easy for your best customers to tell their friends. Make products worth blogging about, sites that host consumer content that they promote, affiliate programs they want to embrace. Build the entire site about permission and not a pretty edifice.</p>
<p><b>Q: The Wall Street Journal recently ran <a href="http://online.wsj.com/public/article/SB117919015237802706-UCx1mxt5G_B3Ywpu15sxmhia6HQ_20080514.html">an article</a> about companies that have sprung up  solely to help promote web sites by encouraging people to email articles to friends, prodding them to vote for content on sites like Digg and so on. This is a close cousin of what you&#8217;re recommending, but it&#8217;s also somewhat artificial. Do you think this is a legitimate approach?  </b></p>
<p>I hate this.</p>
<p>It&#8217;s another reminder of the selfishness and lack of passion that marketers often show.</p>
<p><b>Q. You&#8217;ve said that many search advertisers miss the boat by sending searchers to landing pages that try to get them to buy or convert immediately.  Instead, savvy advertisers develop an &#8220;offer culture&#8221; where they experiment with landing pages and create a step-by-step process that encourages participation rather than a single event. While this sounds great, what about searchers who want to make an immediate purchase?</b></p>
<p>I think what I meant is that many advertisers make one of two common mistakes: a) they get lazy and send everyone to a generic page or worse, to a home page, instead of customizing an offer to match the ad itself, or b) they get impatient and try to turn that first click into a sale right now. While that works sometimes, we all know that real sales take time, and the beauty of permission is that you can educate and follow up later.</p>
<p><b>Q: Many search marketers are small business people&mdash;mom &#038; pops or individuals working the web part-time. That&#8217;s what makes search appealing to them&mdash;it&#8217;s a relatively efficient way to reach many customers without spending a lot of time or money. Permission marketing seems more like a process that takes time and perhaps additional resources to follow up after the first contact.  What suggestions can you offer for search marketers who have limited time or resources to adopt this approach? </b></p>
<p>I think if you&#8217;re not planning on being around in three years, you&#8217;re right, don&#8217;t build an asset that grows over time. Slowing down is sometimes the very best way to speed up.</p>
<p><b>Q. You wrote of the rumored Yahoo/Microsoft merger, &#8220;The best things to ever come out of Yahoo, as far as I&#8217;m concerned, have been the work of individuals. Not of some hyperbolic purple and yellow machine, but from people, strong-willed individuals willing to buck the bureaucracy.&#8221; Can you give some examples? </b></p>
<p>Yahoo Mail, Zeitgeist, Yahoo Finance, Flickr&#8230; these were all fairly renegade efforts, some internal, some not. Compare them to Yahoo Auctions, which was a strategic, top down, carefully focused effort to beat eBay. <i>(Editor&#8217;s note: Yahoo recently announced that it was terminating auctions, working with partner eBay instead.)</i></p>
<p>It&#8217;s not just Yahoo, of course. Most organizations work this way.</p>
<p><b>Q: Yahoo just changed its mission statement&mdash;it&#8217;s now &#8220;To connect people to their passions, communities, and the world’s knowledge.&#8221; Microsoft&#8217;s mission is much vaguer: &#8220;At Microsoft, we work to help people and businesses throughout the world realize their full potential.&#8221; Do you think the two companies would make a good fit?</b></p>
<p>I don&#8217;t think the missions have nearly as much to do with a match as the people. And I worry about the people fit.</p>
<p><b>Q: Crystal ball time: Describe the world of the web 5 years from now from a marketer&#8217;s standpoint. Will people still be using search engines as heavily as now, or will there be new ways we seek out information and products online? Will there be a shift away from traditional types of marketing to new ways of engaging with customers online? And who will the major players be?</b></p>
<p>I think there is going to be a major shift on a number of fronts.</p>
<p>First, permission marketing is really going to hit home when advertisers realize that the only ads they can deliver are ads that people sought out (at least among the most desirable consumers).</p>
<p>Second, anonymous media is going to take a real hit as we discover that lots of anonymous people posting movies and comments and such is a real pain in the neck.</p>
<p>Third, RSS and similar private channels will become much bigger.</p>
<p>And fourth, flipping the funnel and talking through the converted will elect presidents and sell lots of products.</p>
<p><b>Thanks, Seth.</b></p>
<p><i>The <a href="http://searchengineland.com/guides/qa_land.php">Q&#038;A Land</a> column appears Wednesdays at <a href="http://searchengineland.com">Search Engine Land</a>.</i></p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/qa-with-seth-godin-founder-ceo-of-social-search-service-squidoo-11284/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Q&amp;A With Garrett Camp, Founder &amp; Chief Architect, StumbleUpon</title>
		<link>http://searchengineland.com/qa-with-garrett-camp-founder-chief-architect-stumbleupon-10901</link>
		<comments>http://searchengineland.com/qa-with-garrett-camp-founder-chief-architect-stumbleupon-10901#comments</comments>
		<pubDate>Wed, 04 Apr 2007 16:34:51 +0000</pubDate>
		<dc:creator>Chris Sherman</dc:creator>
				<category><![CDATA[Q&A Land]]></category>

		<guid isPermaLink="false">http://searchengineland.com/beta/qa-with-garrett-camp-founder-chief-architect-stumbleupon-10901.php</guid>
		<description><![CDATA[]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fsearchengineland.com%2Fqa-with-garrett-camp-founder-chief-architect-stumbleupon-10901"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fsearchengineland.com%2Fqa-with-garrett-camp-founder-chief-architect-stumbleupon-10901" height="61" width="51" /></a></div><p>
<a href="http://searchengineland.com/guides/qa_land.php">
</a> Search engines are great tools to help you find things you suspect exist on the web, but they leave a lot to be desired when it comes to discovering the unexpected or unknown.  Sure, you may find something really cool after doing the result-page/back-button tango for a while, but that&#8217;s an inefficient approach.</p>
<p>Enter <a href="http://www.stumbleupon.com">StumbleUpon</a>, a service that lets you &#8220;channel surf&#8221; the internet, displaying sites that others have &#8220;stumbled upon&#8221; and recommended. StumbleUpon is a hybrid: Part directory of human selected sites, part search engine (the millions of pages recommended by &#8220;stumblers&#8221; have been fully indexed and categorized behind the scenes), part social bookmarking service. It&#8217;s all of these things, and yet offers a unique web discovery experience unlike any other I&#8217;ve seen.</p>
<p>In his first &#8220;Let&#8217;s Get Social&#8221; column, <a href="http://searchengineland.com/070123-050031.php">Forget ABCs &#8211; The Social Media Alphabet Is DNRS</a>, Neil Patel introduced StumbleUpon.  In today&#8217;s Q&#038;A, I asked Garrett Camp, StumbleUpon&#8217;s Founder and Chief Architect, to take us behind the scenes of this intriguing discovery engine.</p>
<p><span id="more-10901"></span>
<b>Q: You started StumbleUpon in November 2001 &#8211; a pretty dark time when lots of web sites &#038; online businesses were crashing and burning.  What inspired you to start at that time? And why did you see a need for this type of service?</b></p>
<p>We realized that search engines are useful when you know exactly what you are looking for, but for discovering new and personally interesting online media keyword search wasn&#8217;t always the best option. We created StumbleUpon so people can &#8220;stumble upon&#8221; sites that have been submitted and rated by like-minded people, rather than presented with the most popular sites for a given keyword. We designed StumbleUpon to blend collaborative human input with machine learning techniques, so users could discover great content they wouldn&#8217;t have thought to search for. We&#8217;ve been focused on social media and collaborative discovery long before the phrase &#8220;Web 2.0&#8243; became popular in 2004, and now have over 2M members discovering and sharing interesting web content.</p>
<p><b>You&#8217;ve said that &#8220;stumbling&#8221; is more a process of discovery than search&mdash;you&#8217;ve even called StumbleUpon a &#8220;remote control&#8221; for the web. Can you describe how StumbleUpon resembles &#8220;channel surfing&#8221; and how you customize the experience for each user?</b></p>
<p>Search is great for textual content or if you know exactly what you want, but if you&#8217;re simply looking to browse through relevant and high-quality web content we think the &#8220;channel-surfing&#8221; metaphor makes more sense. Instead of having to search, scan, click, go back, and repeat, you simply surf directly between relevant content which has been given the &#8220;thumbs-up&#8221; by friends and other web surfers with similar interests. We have nearly 500 topics that users may choose to indicate preferences, so your recommendations become precisely tailored to your interests.
Essentially we combine community reviews and social feedback with clustering and evolutionary methods to create a personalized recommendation engine. This helps people discover interesting websites, photos and videos recommended by their friends and like-minded people.</p>
<p><b>You&#8217;ve got about 9 million sites indexed&mdash;how dynamic is that index as new content is added to the web or old content disappears?</b></p>
<p>We like to think we have the best of the web in our database. Our 2 million registered users stumble around 5 million times a day, so we have a pretty active user base. If they find something new, it&#8217;s incredibly easy for them to submit it to us. All they need to do is click the thumbs-up button on the toolbar and it&#8217;s submitted to our database. We get over 16,000 new URL submissions a day &#8211; all new and unique content endorsed by our members.</p>
<p><b>I&#8217;ve been skeptical of many aspects of social search. StumbleUpon incorporates a number of social search techniques (thumbs up-or-down voting for web pages, tagging, collaborative filtering and so on), and yet also does a lot of algorithmic heavy-lifting in the background.  What kind of indexing and classification are you doing in addition to observing user behavior?</b></p>
<p>We have a classification engine which automatically places content into one of 500 predefined categories based upon on-the-page factors. This means most content submitted can be distributed to interested members even before tags have been applied. We also have the entire database indexed so you can stumble by keyword if you want to get fairly specific&#8230; for example stumbling through just pictures of &#8220;sunsets&#8221; or discovering new &#8220;fonts&#8221;.  We use classification and indexing in combination with social networks and user ratings to improve the quality of recommendations and coordinate the distribution of relevant content.</p>
<p><b>StumbleUpon offers a lot of community features. You can browse the StumbleUpon community to find users with similar interests and even become &#8220;friends&#8221; with them. You also encourage people to share a lot of information about themselves. Does the user experience change depending upon how much self-info you&#8217;ve provided, or the network of connections you&#8217;ve made? How?</b></p>
<p>Your stumbling experience improves as you get more involved in the community. Once you add a picture and start reviewing pages, you&#8217;ll start meeting other community members who like similar types of sites. We&#8217;ve found that most active members will upload a picture or fill out their favorite music, movies etc&#8230; so they can start meeting like-minded people. Over time the social aspect of StumbleUpon makes it more compelling, since you&#8217;ll discover pages your friends have recommended and learn about their tastes as you stumble.</p>
<p><b>What about people who aren&#8217;t comfortable sharing personal information? What kind of privacy measures do you have in place?</b></p>
<p>We have a few privacy features, and how much information you share is completely up to you. If you want to remain an anonymous member without a public presence simply don&#8217;t upload a photo or fill in your profile. You can also choose to not share your ratings if you like, but we&#8217;ve found the vast majority of people don&#8217;t mind other people visiting their profile or reading the reviews they write.</p>
<p><b>One of my favorite features of StumbleUpon is the automatic addition of StumbleUpon ratings and a link to reviews in search results from Google, Yahoo, Ask, AOL and Microsoft. This adds a &#8220;second layer&#8221; of relevance to the results, showing you at a glance which pages have been given a thumbs-up or good reviews by stumblers. The StumbleUpon toolbar also lets you search by keyword, but sends you directly to a web page rather than showing search results. Why take that approach rather than showing a list of the top pages StumbleUpon has indexed for a particular keyword?</b></p>
<p>When developing the Search Reviews feature, we figured that most people didn&#8217;t want to switch to a different search engine, and would rather just have social aspects added to their current search engine of choice. So by adding links to Google, Yahoo et al, you can still query a multi-billion-page index, and get additional social metadata when simple queries are submitted. This means you don&#8217;t need to change your search behavior at all to enjoy the benefits of community endorsements when they exist. As for Stumbling by keyword, it&#8217;s really to just keep the navigation experience consistent, and maintain the sense of serendipity.</p>
<p><b>We&#8217;ve seen small but growing traffic at Search Engine Land from StumbleUpon users.  Any recommendations for search marketers wanting to tap into the StumbleUpon community?</b></p>
<p>StumbleUpon has a unique business model that works well for marketers where we can deliver traffic directly to your site. You can target by category, age, gender and location. So for product launches, distributing audio/visual content or just getting feedback on your blog, StumbleUpon often works better than PPC approaches since targeting is precise and no click through is required. More information on opportunities is available on our <a href="http:///www.stumbleupon.com/ads/">advertisers page</a>.</p>
<p><b>Talk about your future plans.</b></p>
<p>We are always looking to improve the user experience and the technology behind StumbleUpon.  We are currently focused broadly on both social search and video discovery, as well as algorithmic improvements to our recommendation engine. We are also re-designing our entire site and investigating bringing StumbleUpon to other platforms beyond the browser. Stay tuned&#8230;</p>
<p><b>Thanks much, Garrett.</b></p>
<p><em>The <a href="http://searchengineland.com/guides/qa_land.php">Q&#038;A Land</a> column appears Wednesdays at <a href="http://searchengineland.com">Search Engine Land</a>.</em></p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/qa-with-garrett-camp-founder-chief-architect-stumbleupon-10901/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Q&amp;A With François Bourdoncle, CEO Of Exalead</title>
		<link>http://searchengineland.com/qa-with-francois-bourdoncle-ceo-of-exalead-10775</link>
		<comments>http://searchengineland.com/qa-with-francois-bourdoncle-ceo-of-exalead-10775#comments</comments>
		<pubDate>Wed, 21 Mar 2007 04:58:39 +0000</pubDate>
		<dc:creator>Chris Sherman</dc:creator>
				<category><![CDATA[Q&A Land]]></category>
		<category><![CDATA[Search Engines: Outside USA]]></category>

		<guid isPermaLink="false">http://searchengineland.com/beta/qa-with-francois-bourdoncle-ceo-of-exalead-10775.php</guid>
		<description><![CDATA[]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fsearchengineland.com%2Fqa-with-francois-bourdoncle-ceo-of-exalead-10775"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fsearchengineland.com%2Fqa-with-francois-bourdoncle-ceo-of-exalead-10775" height="61" width="51" /></a></div><p>
<a href="http://searchengineland.com/guides/qa_land.php">
</a> <a href="http://www.exalead.com">Exalead</a> is one of those great search engines that most people haven&#8217;t heard of.  And that&#8217;s a shame, because it offers some really powerful and unique features that most information professionals love.</p>
<p>Exalead is also the name of an enterprise search company, co-founded by today&#8217;s Q&#038;A participant, François Bourdoncle.  Working on web technologies from the very early days, he co-founded Exalead in 2000 with the goal of revolutionizing the search engine software market by providing users with a unified technology platform to access information in the enterprise. Exalead.com, the web search engine, showcases the company&#8217;s capabilities in search.</p>
<p>Francois and Exalead also play a key role in the development of the government-funded pan-European search engine Quaero, which has had its share of controversy.  Read on for our Q&#038;A&#8230;</p>
<p><span id="more-10775"></span>
<b>You have a long history in web search, having worked on the LiveTopics search refinement tool for AltaVista back in 1996-97, among other things.  What are the most notable changes you&#8217;ve observed in the web search world over the past decade?</b></p>
<p><img src="http://corporate.exalead.com/enterprise/files/img/management/bourdoncle.gif" align="left" hspace="5" vspace="3" border="1" width="101" height="150"> I am intrigued by the sheer number of searches conducted daily. The volume is a reflection that search has moved to the center of all we do online and, for that matter, offline. Using a PC has changed from simply creating a document or some other file form to a process that revolves around search. We&#8217;ve seen the development of search-centric computing and search-based online activities.</p>
<p>Further, we&#8217;re seeing search evolve from a consumable process, that is entering a keyword or two, running the query, finding it and moving on from there. Search activities are increasingly involving the creation of a temporary &#8211; or long-term &#8211; storage or archiving of the search input and findings.</p>
<p>Lastly, there is a negative development we&#8217;re seeing, one that everyone is aware of, if not experiencing: frustration. Finding information continues to be harder than it should be. People quickly lose patience if they can&#8217;t find what they&#8217;re searching for right away, yet most search engines essentially ask people to scroll through lists and lists on pages and pages of results, without any automated organizational assistance.</p>
<p><b>Exalead offers a lot more information on search result pages than other search engines, including thumbnail images, and an option to display other content from a site including directory listings, audio, video or RSS feeds.  Are your users interacting with these additional details?  </b></p>
<p>The short answer is: without a doubt, yes. These features are what set Exalead apart from other Web search engines, and help to increase our customer loyalty and usage. Over the past 18 months, most recently in New York, we&#8217;ve been conducting intensive research studies to gain insight into users&#8217; search habits and preferences.</p>
<p>What we&#8217;ve found is that the thumbnail images Exalead offers for all search results is the most popular feature. We&#8217;ve designed the user interface so that users can turn this feature on or off, but the feedback we&#8217;ve received is that the ability to search visually is a huge advantage. The Internet is second-nature for most people today, especially the younger generations, as young as high school students. They don&#8217;t like to be slowed down by having to click through to a new browser window to see results.</p>
<p>The way we also present &#8220;in-context&#8221; audio, video, and RSS content makes it very easy for novice users to find multimedia content on the Web.</p>
<p><b>Exalead uses Open Directory Project data for directory listings, and it also looks like you give a relevance boost to sites included in the ODP.  A lot of people have said that the ODP is walking dead now, with little if any development or maintenance of the directory. Have you found that to be the case?</b></p>
<p>We use ODP as a way to categorize search results in a richer way that what our automatic classification technology (Related Terms) provides. This has proven very useful for our &#8220;Search By Serendipity&#8221;™ search experience. It is true that ODP has been less frequently updated than it used to be, and that social tagging sites are now being used as a way to organize the Web. We are working on this new trend actively and will introduce new products later this year.</p>
<p><b>I&#8217;ve been critical of tagging, because most people apply tags carelessly. Tags on the web lack a controlled vocabulary and aren&#8217;t based on disciplined ontologies, so they can be ambiguous or even misleading. How do you remove the &#8220;noise&#8221; from tags to make them useful?</b></p>
<p>I am not sure that tags and disciplined ontologies serve the same purpose. Disciplined ontologies are best used for organizing &#8220;long-lived information,&#8221; whereas tags are best used for quick &#038; efficient organization of &#8220;short-lived information,&#8221; such as project-related documents, bookmarks, etc. Therefore, tags offer an interesting trade-off between ease-of-use vs. efficiency for finding relevant information. An interesting example of a mixed approach is Wikipedia, which uses semi-controlled tags that are very similar to ontologies, with the exception that they are not hierarchical.</p>
<p><b>Your advanced search offers some great features no other web search engine offers &#8211; things like phonetic search, approximate spelling, proximity and so on. Other search engines say they don&#8217;t offer these because they are costly and searchers don&#8217;t use them. Why does Exalead have them?</b></p>
<p>Our raison d&#8217;être is to make search easier, less frustrating and more efficient for everyone.  Our advanced features help make searching on exalead.com a more satisfying experience. Both novice and so-called power-searchers use these features to serve their own purposes, to their own advantages. The notion that those features and functions drive up the cost is bogus; expense management occurs at the core infrastructure level, not at the functional level, which would include advance search features. Also, using an enterprise search technology to power a Web search engine is the easiest way to provide these features efficiently.</p>
<p><b>Your &#8220;about&#8221; page states: &#8220;It also adapts to user habits for a uniquely satisfying and efficient search experience.&#8221; Does Exalead do personalization or tailor search results for individual users? If so, how does this work?</b></p>
<p>Yes and no. Right now users can personalize their home page by creating shortcuts, selecting a language preference or by deciding how they want to view results. In the near future, Exalead will be unveiling new search 2.0 features that take personalization and search organization to an entirely new level. We&#8217;ll have more to say on that for another Q&#038;A in the not-too-distant future.</p>
<p><b>There are two ends to the personalization spectrum: Google&#8217;s approach, which essentially observes your behavior and adjusts accordingly, without much other input from you, and Yahoo&#8217;s approach, which requires you to explicitly shape your personal web. I like the automated approach but I find it frustrating that Google won&#8217;t let me &#8220;override&#8221; results I personally consider poor.  What is Exalead&#8217;s approach?   And how do you balance personalization with privacy concerns?</b></p>
<p>Personalization is indeed a very important aspect of search, and one that Exalead is heavily investing in. As usual, we are trying to find the right trade-off between automation and flexibility, and will come up with new ideas for having the &#8220;best of both worlds&#8221; in our forthcoming products.</p>
<p><b>What is your or Exalead&#8217;s involvement with Quaero, the European search engine funded by governments?</b></p>
<p>I must note Quaero is one of the most talked-about, least-understood research projects in recent memory. Quaero is a research initiative funded, in part by the French government and, until recently, the German government. Quaero, which in Latin means &#8220;to search,&#8221; is focused on multimedia search (audio, video, music, images, etc.).</p>
<p>Exalead is one of the four lead companies in the project. These leaders define what the R&#038;D objectives in the project ought to be. Over time, the four leaders will integrate some of Quaero&#8217;s technologies into their products, when these technologies become mature enough. Our search technology is also used by other project members.</p>
<p>Despite what some politicians have said, Quaero is not a collaborative effort aimed at creating a &#8220;Google killer.&#8221; It is a research project on multimedia search technologies, period. In particular, image tagging, speech-to-text and machine translations, etc. It may very well produce technology or technology advancements that improve search, but Exalead is pursuing its own course of product development and go-to-market activity. What comes of Quaero will not determine what Exalead becomes or offers.</p>
<p><b>Thanks, Francois.</b></p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/qa-with-francois-bourdoncle-ceo-of-exalead-10775/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Q&amp;A With Stephen Baker, CEO Of Reed Business Search</title>
		<link>http://searchengineland.com/qa-with-stephen-baker-ceo-of-reed-business-search-10404</link>
		<comments>http://searchengineland.com/qa-with-stephen-baker-ceo-of-reed-business-search-10404#comments</comments>
		<pubDate>Wed, 31 Jan 2007 22:00:00 +0000</pubDate>
		<dc:creator>Chris Sherman</dc:creator>
				<category><![CDATA[Q&A Land]]></category>
		<category><![CDATA[Search Engines: B2B Engines]]></category>
		<category><![CDATA[Site & Enterprise Search]]></category>

		<guid isPermaLink="false">http://searchengineland.com/beta/qa-with-stephen-baker-ceo-of-reed-business-search-10404.php</guid>
		<description><![CDATA[]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fsearchengineland.com%2Fqa-with-stephen-baker-ceo-of-reed-business-search-10404"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fsearchengineland.com%2Fqa-with-stephen-baker-ceo-of-reed-business-search-10404" height="61" width="51" /></a></div><p><a href="http://searchengineland.com/guides/qa_land.php">
</a> Reed Elsevier is one of the largest, most influential publishers in the world, a powerhouse in the science and medical, legal, education and business markets.  But the company also has a major online presence, with more than 1,000 web sites in just its business group.</p>
<p>Last year, Reed Business decided to leverage this significant online presence by creating <a href="http://www.zibb.com">Zibb</a>, a business-to-business search engine that allows users to search across all of Reed Business&#8217; web sites.  But Zibb goes beyond that&mdash;Reed also indexes billions of other business-related URLs from all over the web, including competitor web sites.</p>
<p>The result is a very powerful B2B search engine that often returns much better results for business-related queries than mainstream search engines.</p>
<p>Reed Business also decided to integrate this technology into the site search tools for its web sites, which range from the Asia Food Journal to entertainment bellwether Variety. Using search technology from FAST, Zibb on Demand allows each of these sites to offer their own content and the best of the web that&#8217;s related to the site&#8217;s specialty.</p>
<p>To oversee these efforts, Reed Business hired Stephen Baker to be CEO of Reed Business Search.  Stephen is a long-time industry veteran, having put in stints with FAST during its AlltheWeb days, on to Overture and Yahoo, and then back to FAST as general manager of enterprise search. So he&#8217;s the ideal person to drive a search group that covers a vast vertical slice of the web, as well as content from a huge global enterprise.</p>
<p>I caught up with Stephen this week to find out what&#8217;s new with Zibb and talk about the opportunities he sees going forward in B2B search.</p>
<p><span id="more-10404"></span>
<b>Q: The companies that measure traffic say that Google continues to take market share.  Why decide now to start a specialized B2B search service? And why the name Zibb?</b></p>
<p>Why Zibb?  It&#8217;s the name of one of Reed Business&#8217; leading publications in the Netherlands.  We were looking for a name that is memorable and avoids language-boundaries, which Zibb does.  Also, brand names that begin with &#8220;z&#8221; are more memorable and it has the &#8220;BB&#8221; in it which could lead to some fun branding and logo design.</p>
<p>While Google is taking market share, we believe that there is a segment of the B2B market that is not 100% satisfied with the results they get from a &#8220;general&#8221; SE.  Specifically, Outsell reports that approximately 1/3 of B2B professionals do not find the information they are looking for when using Google.  Our research confirms this.</p>
<p>Furthermore, when you look at the amount of time that professionals spend searching, gathering, and analyzing information, you see that a lot of time is wasted &#8220;searching&#8221; for information.  Between 2001 and 2005, the number of hours spent on information-related tasks increased from 8/week to 13/week and the percentage of time spent &#8220;searching&#8221; for information increases from 45% to 55%.  All of these factors combined indicate that a) search is very important in the daily life of a B2B professional and b) they are not always finding what they are looking for at Google.  We feel that this need can be best addressed through a B2B vertical search service.</p>
<p><b>Q: Do you think all this wasted time and effort is a reflection on the skills of searchers?</b></p>
<p>Yes and No.  Certainly, based on our research, utilization of advanced search features on the major engines is low so when searchers are looking for B2B content, they are often confronted with noise in the results that could be eliminated, to a degree, through advanced search functions. However, this behavior points to the underlying expectation that advanced search features shouldn&#8217;t be required to find relevant results.  So, with Zibb (and other vertical SE&#8217;s), we are basically trying to give the searcher what they want:  the &#8220;right&#8221; answer within a narrower context.</p>
<p><b>Q: Do you plan to add advanced search or query refinement tools to maybe help people zero-in even more on their target?</b></p>
<p>Since we&#8217;re really focused on improving search productivity and it is our top priority is to provide relevant results with the search and browse paradigm that characterizes Zibb, currently.  However, we have a tremendous amount of enriched data describing the content that we aggregate that naturally lends itself to advanced features for the power searcher.  So &#8220;yes&#8221;, over time, we will bring some of that meta data to the forefront of the user experience.</p>
<p><b>Q: You&#8217;re using FAST technology to crawl the web, but also massaging the data you get. You&#8217;re crawling 4-5 billion documents every couple of months -What kinds of things are you doing to winnow out non B2B content from search results? And if you&#8217;ve got one big index, how and why do search results differ on Variety vs. a site like Flight International?</b></p>
<p>We are using a series of heuristics to identify relevant B2B content. These range from editorial processes &#8211; we have some of the best subject matter experts in house and they continually help us find sites and content that are relevant to their sector &#8211; to offline analysis of link structures and anchor texts to identify relevant sites during index build.  Of course, we are using an adaptive framework to train the crawler to get smatter with each crawl iteration.</p>
<p>Once we have discarded the non-B2B content, we categorize the content into one of 26 industry taxonomies and describe the content as being a news article, supplier listing, etc. The API&#8217;s for the Reed Business Search platform allow for individual websites to specify the query recall set at the industry, sub-industry, or content type level.  Hence, Variety can consume only media-related search results while Flight International consumes aerospace results.</p>
<p><b>Q: Why is Reed Business including content from competitors in Zibb search results?</b></p>
<p>The goal of Zibb is to provide a search utility for the B2B market.  In order to truly provide a valuable service, we believe that Zibb must be a) contextually relevant to the B2B sector and b) comprehensive across all relevant content sources.</p>
<p><b>Q: I would think some of your competitors wouldn&#8217;t want to be part of this&mdash;do you have any opting out, or putting other requirements on you to allow you to use their content?</b></p>
<p>Yes, we occasionally get calls from competitors asking why we are crawling their site so regularly and thoroughly.  Some have even blocked our crawlers, however, once they learn that Zibb is focused on becoming a B2B search destination focused on all relevant content, not just Reed Business content, and that our ranking algorithm is unbiased towards Reed Business content, they are usually happy to participate.</p>
<p><b>Q: How about some stats? How many companies indexed? How many topics, geographic entities and so on?</b></p>
<p>Currently, we track 17K public companies serving B2B markets, and about 23% of those have strong coverage in our indices.  &#8220;Coverage,&#8221; for our purposes, is a measure of web presence of a particular entity.</p>
<p>Down deep in the tail, we track 1.8M companies also serving B2B professionals, though often in a local or brick and mortar fashion and we&#8217;ve found that their online coverage is much weaker with only about 3% passing our &#8220;Strong coverage&#8221; threshold. Bear in mind that we still track and recognize all of these companies against our full index, we just recognize that many are not well represented online.</p>
<p>Our product categories have 46% strong coverage for the 181K we track, and our topics have 70% strong coverage for the over 2K that we have publicly released to date.</p>
<p>What we&#8217;ll see happening over time is that we&#8217;ll grow our universe of public and private companies tracked and simultaneously we&#8217;ll continue to seek additional web sources of information to strengthen the coverage.  This same strategy will be methodically applied to product categories and topics. With our human and machine power we feel that we have a sustainable, scalable, ever-improving approach that will expand even faster as we build within the B2B community who in turn will start helping to shape our growth.</p>
<p><b>Q: Do B2B searchers differ in how they search from people doing more general searches on Google &#038; Yahoo?</b></p>
<p>We&#8217;re finding that search behavior is relatively consistent.  People have the same information needs in B2B as the do in B2C: Read the news and get updates on products, services and companies; perform research functions; look for places to buy things.  Of course, search behavior within each of these activities varies (for example, people prefer to browse for news &#8220;tell me what&#8217;s new&#8221; and employ geographic/parametric search for vendors &#8220;I&#8217;m looking for a company that makes ball bearings in NJ.&#8221;)  But, in terms of the number of terms per query or the specificity that is employed when looking for eg, product information, there isn&#8217;t too much variation.</p>
<p><b>Q: I would think B2B searchers might tend to have more long tail queries&mdash;more specific in expressing what they&#8217;re looking for. Have you seen that?</b></p>
<p>Yes, this is absolutely the case. By nature, B2B trade media is really about having deep subject matter expertise.  This is reflected in our content and the needs of our search customers.  A portion of the queries that we see are for specific products, parts, and company names which tend to be long-tail queries.</p>
<p><b>Q: How does Zibb differ from Zibb On Demand?</b></p>
<p>The Reed Business Search team has built two products.  The first is a branded B2B search destination, called Zibb.  It is located at <a href="http://www.zibb.com">www.zibb.com</a> and its Dutch counterpart is located at <a href="http://www.zibbsearch.nl">www.zibbsearch.nl</a>. Zibb on Demand is the hosted vertical search service that we provide to our sites, partners, and affiliates.  Zibb on Demand has all of the same features as Zibb and allows the customer to specify the industry and type of content they want included in the results.  You can see a couple of examples:  <a href="http://www.variety.com">Variety.com</a> uses Zibb on Demand to power search across the site, media news search, and media web search.  <a href="http://www.expertbusinesssource.com">Expert Business Source</a> does the same for small business information.</p>
<p><b>Q: Customized verticals have been gaining a lot of popularity recently&mdash;Google custom search, Eurekster&#8217;s Swikis, Rollyo and others. Any plans to create APIs and let users create their own customized B2B vertical search using the Zibb index?</b></p>
<p>Absolutely.  Right now we are focused on building a search platform that meets the immediate needs of our brands, businesses, and partners, however, personalization and custom verticals are features that the Reed Business Search platform support and their deployment will be prioritized as the business needs dictate.</p>
<p><b>Q: What opportunities do you offer for advertisers who want to reach B2B searchers?</b></p>
<p>Right now we are using Google AdSense for Search.  Later this year, we will offer additional advertising products.</p>
<p><b>Q: Any tips on optimizing content for sites that want to rank well in Zibb?</b></p>
<p>We&#8217;re looking for unique, targeted, well described content for the B2B sector.  You can see the vertical industries that we are targeting on the Zibb.com homepage.  If you are writing content targeted to the B2B audience, keep the industry sector in mind and choose relevant phrases.  We&#8217;re also very good about looking for new content via site maps, RSS, and other natural web protocols.  Other than that, if you&#8217;re site is well optimized for Google, Yahoo!, MSN, etc., then you&#8217;ll do fine in Zibb.  And, of course, we have editors on the back-end making sure that our spam detection and categorization algorithms are performing well.</p>
<p><b>Q: What kind of traffic are you getting?  What kind of growth? What growth are you projecting/expecting over the next 1-3-5 years?</b></p>
<p>So far, both growth of Zibb.com and adoption of Zibb on Demand have been strong.  Traffic is growing in excess of 50% month-over-month and Zibb on Demand is currently live on about 30 sites with many, many more scheduled to deploy this year.  Looking out beyond 2007 is anyone&#8217;s guess.  We certainly see a market opportunity for providing a unique B2B search experience and hope that we have a meaningful position in the market beyond 2007.</p>
<p><b>Thanks, Stephen.</b></p>
<p>The <a href="http://searchengineland.com/guides/qa_land.php">Q&#038;A Land</a> column appears Wednesdays at <a href="http://searchengineland.com">Search Engine Land</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/qa-with-stephen-baker-ceo-of-reed-business-search-10404/feed</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Q&amp;A With Gabe Rivera, Creator Of Techmeme</title>
		<link>http://searchengineland.com/qa-with-gabe-rivera-creator-of-techmeme-10278</link>
		<comments>http://searchengineland.com/qa-with-gabe-rivera-creator-of-techmeme-10278#comments</comments>
		<pubDate>Wed, 17 Jan 2007 12:32:59 +0000</pubDate>
		<dc:creator>Danny Sullivan</dc:creator>
				<category><![CDATA[Q&A Land]]></category>
		<category><![CDATA[Search Engines: Blog Search Engines]]></category>
		<category><![CDATA[Search Engines: News Search Engines]]></category>

		<guid isPermaLink="false">http://searchengineland.com/beta/qa-with-gabe-rivera-creator-of-techmeme-10278.php</guid>
		<description><![CDATA[ Over the past decade, I&#8217;ve seen a lot of search tools that were supposed to  transform my life. Few of them have. But Techmeme was one of those. When it kicked off back in September 2005, I wrote a review, gave  it a preliminary thumbs-up and soon found myself addicted. It has [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fsearchengineland.com%2Fqa-with-gabe-rivera-creator-of-techmeme-10278"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fsearchengineland.com%2Fqa-with-gabe-rivera-creator-of-techmeme-10278" height="61" width="51" /></a></div><p> Over the past decade, I&#8217;ve seen a lot of search tools that were supposed to  transform my life. Few of them have. But <a href="http://techmeme.com/">Techmeme</a> was one of those. When it kicked off back in September 2005, I wrote a <a href="http://blog.searchenginewatch.com/blog/050913-084749">review</a>, gave  it a preliminary thumbs-up and soon found myself addicted. It has become my  newspaper, my front page guide to what&#8217;s going in the blogosphere relating to  tech.</p>
<p>I met Techmeme creator Gabe Rivera in person for the first time last August,  during the Search Engine Strategies conference in San Jose. I roped him into  being on the Meet The Blog &amp; Feed Search Engines panel at the last minute, where  he joined <a href="http://technorati.com/">Technorati</a>,  <a href="http://www.topix.net/">Topix</a>, <a href="http://www.bloglines.com/">Bloglines</a> and  <a href="http://www.sphere.com/">Sphere</a>. Gabe asked the audience if  anyone had heard of Techmeme. Apparently, no hands went up &#8212; at least in the  audience. All the panelists certainly had.</p>
<p>I love that anecdote, because I feel it largely reflects how Techmeme  operates. The masses might have Digg, but perhaps the influencers have Techmeme.  Certainly plenty of large, influential bloggers I know keep an eye on what it is  covering. But I recommend it for anyone, not just influencers, for the easy way  it organizes what&#8217;s happening with technology stories.</p>
<p>I caught up with Gabe last week, to talk about the service, how it has grown,  operates and future plans.</p>
<p><strong>Q. Do you feel Techmeme has a more early adopter or influencer audience?</strong></p>
<p>Yes, and in particular a lot of influencers in the tech media are Techmeme  readers.  A good number of high profile bloggers and reporters reload Techmeme  all day long in fact.  One sign of which is citations of course, for example the  &#8220;via&#8221; link included the other day on Engadget [<a style="color: blue; text-decoration: underline;" title="blocked::http://www.engadget.com/2007/01/10/cisco-svp-mark-chandler-weighs-in-on-iphone-debacle/" href="http://www.engadget.com/2007/01/10/cisco-svp-mark-chandler-weighs-in-on-iphone-debacle/">link  here</a>] and so many other sites. I&#8217;ve also run into a number of VCs and  executives who use it too, so I&#8217;m pleased with that aspect of the demographics.</p>
<p>The traffic is not huge. I still get fewer than 30,000 daily uniques [unique  visitors per day]. But then these are actual loyal users, since I see very  little search traffic, perhaps due to poor SEO as my permalinks don&#8217;t lead to  individual articles.</p>
<blockquote><p>[Note: Techmeme permalinks are little paper icons that show next to stories,  marked as "Click This" in the illustration below:</p>
<p><a title="Photo Sharing" href="http://www.flickr.com/photos/dannysullivan/183495684/"> <img src="http://farm1.static.flickr.com/69/183495684_1aa1e85d89_o.jpg" border="0" alt="Techmeme Screenshot" width="268" height="98" /></a></p>
<p>To understand why the SEO is "poor," compare how Techmeme <a href="http://www.techmeme.com/070116/p16#a070116p16">lists</a> the TechCrunch  story about <a href="../../070116-074812.php">the Wikiseek  launch</a> to <a href="http://digg.com/tech_news/Wikipedia_Search_Engine_WikiSeek_Launches"> how</a> Digg does it. Techmeme jumps you to where the story is mixed with others on different topics, diluting the overall content or theme of the page. Digg give the story its own dedicated page, which often fills up with user generated content from comments].</p></blockquote>
<p>But I doubt even good SEO will help a lot. People who arrive from Google tend  to be looking for primary documents, not an aggregated page that quotes primary  documents, and therefore rarely return anyway.</p>
<p><strong>Q. Have you considered the idea of creating a Techmeme widget or button of  some type, similar to those offered by <a href="http://del.icio.us/help/tagometer">Del.icio.us</a> or <a href="http://digg.com/tools/count">Digg</a>, so people could easily point to  how they are listed at Techmeme or tell people to see discussions on a topic,  the way <a href="http://technorati.com/developers/help/cosmos.html">Technorati</a> or <a href="http://www.sphere.com/tools">Sphere</a> do?</strong></p>
<p>Given the number of sites I monitor, that would probably only help a small  number of bloggers a small portion of the time. So it&#8217;s probably not the highest  priority feature for me to work on.</p>
<p><strong>Q. How about a way to enter a URL and see if it&#8217;s been on Techmeme, to see  the related posts that are clustered with a main item? Or a way to keyword  search for articles or subjects.</strong></p>
<p>That would certainly be a welcome feature, useful for some.  But I&#8217;m really  inclined to offer something more powerful if I do add search. In the meantime <a style="color: blue; text-decoration: underline;" title="blocked::http://techmeme.com/river" href="http://techmeme.com/river"> Techmeme River</a> [launched in December, more background <a style="color: blue; text-decoration: underline;" title="blocked::http://blog.memeorandum.com/061211/river" href="http://blog.memeorandum.com/061211/river"> here</a>] actually enables search, if what you&#8217;re searching for appeared in the  last 5 days.</p>
<p><strong>Q. Personally, I&#8217;ve also wanted that ability to go back in time easily and  see the &#8220;front pages.&#8221; I know how to do it, as I&#8217;ve <a href="http://daggle.com/060706-201321.html">written about</a>, but I want it  even easier.</strong></p>
<p>I suppose you mean a simple, clickable interface in additional to the &#8220;Page  version&#8221; text input I offer already. I hear a lot of requests for obviously  useful features like that. But considering the cost/benefit, I believe there are  better things for me to tackle for now. I&#8217;ll be sure to write that one down  though!</p>
<p><strong>Q. Is Techmeme an echo chamber, just showing blogs commenting about blogs  commenting about blogs? Does Techmeme feed into that echo chamber? Or how do you  break apart the conversations on a particular topic into sub-conversations or  topics?</strong></p>
<p>Clearly Techmeme creates superficial incentives for &#8220;echo chamber&#8221;  participation, yet I don&#8217;t see clear evidence that this makes things noticeably  worse.  I still like to trot out the example of the day my site launched. eBay&#8217;s  acquisition of Skype <a style="color: blue; text-decoration: underline;" title="blocked::http://www.techmeme.com/050912/h2100" href="http://www.techmeme.com/050912/h2100"> became</a> one of those huge story clusters, and this was hours before Techmeme  [then tech.memeorandum] was publicly launched, i.e. before anyone believed they  could get on the site by linking to stories.</p>
<p>I&#8217;d also point out the idea of many headlines on a single major story is not  a problem in and of itself. Consider that the iPhone unveiling will probably be  one of the major stories for all of 2007. So on one day for it to account for 40  percent of the headlines on Techmeme is not all that out of whack.</p>
<p>A more serious problem is when multiple stories essentially say the same  thing. Ideally no two posts should just be rewrites of the same facts.   Unfortunately it&#8217;s hard to accomplish that with software. Now I do expect to  introduce something that tends to reduce those story clusters in a smarter way,  but it will take some time to do that right.</p>
<p><strong>Q. <a href="http://www.techmeme.com/070109/h1200">This page</a> shows how  the Yahoo purchase of MyBlogLog broke the story up into various clusters. You&#8217;ve  got news of the sale, but also a report on the sale price, Yahoo&#8217;s Jeremy  Zawodny doing a personal welcome. How do you determine which of the many stories  to break out into these sub-clusters?</strong></p>
<p>It&#8217;s driven by what  my software things is the most important story inside of a cluster, then ranked by  what it  believes are the most important stories that discuss that. On that page, it just thinks that  Om Malik&#8217;s post was the second most important [after the headline story from the  CESblog].</p>
<p><strong>Q. Over time, those clusters change. Some stories rise; all of them  eventually fall. How does that work?</strong></p>
<p>All of this derives from this &#8220;importance&#8221; measure I just alluded to.  Importance is determined by a number of factors. Citations can increase  importance, so a post that accumulates inbound links can rise. Time is a factor  as well. A headline that&#8217;s appeared on the page for most of the day loses  importance. Headlines usually fall off the page when the time component swamps  all other factors. That&#8217;s how old news gives way to newer news.</p>
<p><strong>Q. Old meaning how long from when Techmeme first spotted a story, not how  long since the story was actually written, right?</strong></p>
<p>&#8220;Old&#8221; in terms of how long it&#8217;s appeared on Techmeme. Though how old an  article is according to its publication date is also a factor.</p>
<p><strong>Q. Last month, Robert Scoble was <a href="http://scobleizer.com/2006/12/26/techmeme-doesnt-care-if-you-link-to-it/"> explaining</a> how Techmeme doesn&#8217;t pick up a source just because it links to an item  on Techmeme. It picks up a source because other sources on Techmeme link to that  source. That&#8217;s correct, right? To get into Techmeme, you need someone already in  it to link to you &#8212; just linking to them doesn&#8217;t help.</strong></p>
<p>That&#8217;s right. I think Robert recognized that that guy had a Technorati-type  service in mind and believed merely linking out would trigger inclusion. That  is, Technorati includes you in a URL search provided you link to that URL.  Techmeme doesn&#8217;t work that way.</p>
<p><strong>Q. But you did have a <a href="http://blog.memeorandum.com/061115/how-to-show-up">post with tips</a> on getting on Techmeme in November where one of the ways was indeed said to be  linking to you.</strong></p>
<p>Yes, so if a post on your site includes a Techmeme permalink, as opposed to  linking to an article that Techmeme links to, and my system notices a moderate  amount of traffic through that link, and in addition determines your site is not  spammy and the referral is real, your site stands a much better chance of  appearing under the Techmeme &#8220;Discussion&#8221; for the article.</p>
<p>This is a good way for a news organization without RSS feeds to enable an  article to appear on Techmeme. Without inbound links and without an RSS feed,  many article URLs are undiscoverable by my system, except through this  mechanism.</p>
<p><strong>Q. When you launched way back, you had to have seed sites, correct? A  group of sites you started out with. Do you still use that list?</strong></p>
<p>Yes, though the list has been modified since then. Since September 2005, all  of my sites have utilized a source discovery process in which the majority of  monitored sources are found through yet not actually contained in the seeding  set.</p>
<p><strong>Q. How many sources do you monitor?</strong></p>
<p>It&#8217;s in the low thousands. But on any given day, new sources are added and  dropped so the total monitored over time is much larger.</p>
<p>I believe a good automated news site doesn&#8217;t really need to monitor more than  a few thousand sources at a time.  Maybe even a few hundred will suffice.  Of  course the key is the intelligence of the thing that sifts through those  sources.</p>
<p>A good analogy for Techmeme is &#8220;automated blogger&#8221;. And there are indeed  bloggers who largely work like filters or routers who churn through lots of  stuff and only post the most interesting bits. <a style="color: blue; text-decoration: underline;" title="blocked::http://www.instapundit.com/" href="http://www.instapundit.com/"> Instapundit</a> is a classic example. These people can only check a few hundred  blogs daily, even the ones who work extremely efficiently.</p>
<p><strong>Q. I&#8217;m exactly like that. I monitor like 100 to 200 feeds, to build our  search headlines for the <a href="../../searchcap.php"> SearchCap newsletter</a> each day. And if I keep coming across a blog being  mentioned by other blogs with good stuff, I eventually add them to my feeds.</strong></p>
<p>Right. Now Techmeme&#8217;s not as smart as you, but it can run faster, and there&#8217;s  a lot of tedious stuff it can do that you would never want to do. So Techmeme is  kind of like a blogger, but with strengths and weaknesses that go with the  automation.</p>
<p><strong>Q. Why do sources get dropped? Do they fail to post new material? Fail to  keep being cited?</strong></p>
<p>Fail to keep being cited. Every day Techmeme performs a bit of a reset,  usually around 3am Eastern, where it doesn&#8217;t update for about an hour as it  repeats the source discovery. So every day it tries to find the best few  thousand sources. A blog can make the list one day but not the next.</p>
<blockquote><p>[NOTE: I've seen this personally. My personal blog <a href="http://daggle.com/">Daggle</a>, for example. Things that I post there,  I never assume they'll make Techmeme. But when I <a href="http://daggle.com/060829-112950.html">announced</a> I was leaving  Search Engine Watch, suddenly it got on and <a href="http://www.techmeme.com/060829/p28#a060829p28">went to</a> the top of  the page for a bit. Links from elsewhere made it relevant for inclusion.  Occasionally I see it make it <a href="http://www.techmeme.com/061130/p71#a061130p71">for other things</a>.  But with Search Engine Land, we're far more likely to be in on a regular basis,  since I gather we say things that enough other people in Techmeme find  interesting enough to link to regularly.]</p></blockquote>
<p><strong>Q. Is spamming much of an issue for you? Since just linking to a story  doesn&#8217;t get you on Techmeme automatically, seems like that should reduce a lot  of it.</strong></p>
<p>Exactly. I designed the algorithms to discover the most consistently useful  news sources, and as a result spam blogs almost never come up. It certainly  helps that reputable blogs almost never link to spam blogs.</p>
<p><strong>Q. Do you spider everything, the full text of posts. Or do you just read  whatever&#8217;s in a feed.</strong></p>
<p>If my robot sees an indication that there&#8217;s additional text on the article  page, it&#8217;ll spider that page. I need to make it a little smarter so that it  succeeds in this more reliably, but I&#8217;m most of the way there even now.</p>
<p><strong>Q. Do you store the full text over time?</strong></p>
<p>Yes. Though not readily retrievable by all parts of my system, the data is  archived.</p>
<p><strong>Q. Do you depend on feeds to know who is authoring a story?</strong></p>
<p>I utilize both the feed and the main page. I found early on you need to visit  both to accumulate all this metadata.</p>
<p><strong>Q. How&#8217;s the <a href="http://www.techmeme.com/sponsor">sponsored posts  program</a> going since it <a href="http://blog.memeorandum.com/060925/sponsorship_model">launched</a> last  September?</strong></p>
<p>I think it&#8217;s going pretty well. The obvious test being that sponsors are  renewing. I&#8217;m even sold out for the next few months. Now the approach I&#8217;ve taken  tends to limit who can sponsor Techmeme, so I don&#8217;t have a network of thousands  of potential advertisers like Google, but fortunately there are enough sponsors  out there to fill Techmeme&#8217;s inventory.</p>
<p><strong>Q. How the <a href="http://techmeme.com/river">Techmeme River</a> going?  Lots of use?</strong></p>
<p>Not lots, though I wasn&#8217;t expecting lots. The river was intended just to  cover some unusual uses cases. For example, when you need to scan everything  that&#8217;s happened over the past few days, or you want to search through recent  Techmeme posts for certain authors or title keywords.</p>
<p><strong>Q. How about a way to see the top things on Techmeme, the most popular  items rather than the river, that just lists them by date? When I was at <a href="http://popurls.com/">Popurls</a>, I was surprised not to see the top  stories from Techmeme shown right up there with the top picks from Digg,  Delicious, Reddit and others. Perhaps a popular list would help?</strong></p>
<p>What you want is basically the <a style="color: blue; text-decoration: underline;" title="blocked::http://www.techmeme.com/index.xml" href="http://www.techmeme.com/index.xml"> Techmeme RSS feed</a>. It includes the top 20 to 30 items of the day. Perhaps I  could create starting page based only on these posts, but others have  effectively done so already. For example, Original Signal <a style="color: blue; text-decoration: underline;" title="blocked::http://buzz.originalsignal.com/" href="http://buzz.originalsignal.com/"> includes Techmeme</a>.</p>
<p><strong>Q. Is Techmeme too elite with its sources?</strong></p>
<p>Any selection of finite length is necessarily elitist. Now people often  protest about Techmeme&#8217;s elitism, but these people are typically bloggers who  don&#8217;t show up on Techmeme frequently. I don&#8217;t believe I&#8217;ve had anyone ask me  make it less elitist that doesn&#8217;t blog! Yet my readers are largely non-bloggers.</p>
<p>For better or for worse, well-read bloggers tend to have better access to  interesting news, and also tend to exercise the talent that helped establish  them in the first place. I&#8217;m rather unapologetic that there are lots of less  established writers who will never show up on Techmeme.</p>
<p><strong>Q. Do you hand remove or add sources?</strong></p>
<p>If I believe a site is essential enough, I&#8217;ll add it to the seed set, and I  do that occasionally for Techmeme.</p>
<p><strong>Q. Does it make any sense for people to ask you for inclusion or does it  make more sense for them to get the attention of A-listers already in Techmeme?</strong></p>
<p>Getting the attention of bloggers has probably worked for many more people  than has asking me. Trackbacks, emails, and comments can be very effective and  require much less time than the actual reporting and writing. And appealing to  &#8220;A-listers&#8221; isn&#8217;t even necessary. The &#8220;B&#8221; and &#8220;C&#8221; list also wield considerable  power in the Techmeme ecosystem.</p>
<p><strong>Q. How are the other sites doing? [<a href="http://www.wesmirch.com/" target="_self">WeSmirch</a> for gossip, <a href="http://www.memeorandum.com/">M</a><a href="http://www.memeorandum.com/" target="_self">emeorandum</a> for politics, <a href="http://www.ballbug.com/" target="_self">Ballbug</a> for  baseball news -- all use the same Techmeme technology]. Do you have more  planned?</strong></p>
<p>The trending looks good on WeSmirch, the celebrity gossip site, which  apparently has decent word of mouth. Ballbug, the baseball site is clearly the  worst of the bunch trafficwise. The political site, memeorandum, is doing OK but  could do better.</p>
<p>I think I want to further develop the technologies that improve the existing  sites before introducing even more sites. Although 10 minutes of actual labor  can lead to a new vertical, launching something truly compelling will depend on  new technologies. Fortunately, these are the same technologies that will improve  my existing sites.</p>
<p><strong>Q. Ever been on Digg?</strong></p>
<p>Techmeme has never made the front page or any category pages of Digg. I  believe a few links have been submitted, but I suppose they were of little  interest to the Digg community!</p>
<p><strong>Q. Why not sell up or work for a larger company?</strong></p>
<p>I&#8217;m not sure there&#8217;s a larger company that sees eye to eye with me about  where all this is headed or where the value is and isn&#8217;t, to be perfectly vague.  Probably the biggest reason though: I nap after lunch, unconditionally, and most  big companies can&#8217;t accommodate this need nearly as well as my own.</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/qa-with-gabe-rivera-creator-of-techmeme-10278/feed</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>Q&amp;A Land: New Column From Search Engine Land</title>
		<link>http://searchengineland.com/qa-land-new-column-from-search-engine-land-10275</link>
		<comments>http://searchengineland.com/qa-land-new-column-from-search-engine-land-10275#comments</comments>
		<pubDate>Wed, 17 Jan 2007 12:00:00 +0000</pubDate>
		<dc:creator>Danny Sullivan</dc:creator>
				<category><![CDATA[About Search Engine Land]]></category>
		<category><![CDATA[Columns]]></category>
		<category><![CDATA[Q&A Land]]></category>

		<guid isPermaLink="false">http://searchengineland.com/beta/qa-land-new-column-from-search-engine-land-10275.php</guid>
		<description><![CDATA[One of my resolutions in starting  Search Engine Land was to spend more time doing Q&#38;As with people. There&#8217;s no  better way to ensure I do that than to start a column. So I&#8217;m happy to announce  that Q&#38;A Land becomes the second  column offered by Search Engine Land.
Unlike Link Week, [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fsearchengineland.com%2Fqa-land-new-column-from-search-engine-land-10275"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fsearchengineland.com%2Fqa-land-new-column-from-search-engine-land-10275" height="61" width="51" /></a></div><p><a href="http://searchengineland.com/guides/qa_land.php"></a>One of my resolutions in starting <a href="../../"> Search Engine Land</a> was to spend more time doing Q&amp;As with people. There&#8217;s no  better way to ensure I do that than to start a column. So I&#8217;m happy to announce  that <a href="../../guides/qa_land.php">Q&amp;A Land</a> becomes the second <a href="../../guides/columns.php"> column offered</a> by Search Engine Land.</p>
<p>Unlike <a href="../../guides/link_week.php">Link Week</a>,  which started this week and runs on Mondays, Q&amp;A Land is a &#8220;floating&#8221; column. It  will happen once per week, but the actual day might change.</p>
<p>I won&#8217;t be doing Q&amp;A Land alone. Chris Sherman will also be writing for it,  and from time-to-time we may have other guest authors. And yes, we&#8217;ll have a  nifty logo coming, as we already have for Link Week.</p>
<p>The first official Q&amp;A Land column is now out, <a href="../../070117-083259.php">Q&amp;A With Gabe Rivera,  Creator Of Techmeme</a>. I say official because I&#8217;ve also now classified my <a href="../../061229-193718.php">Q&amp;A With Jimmy Wales On  Search Wikia</a> article from the end of December as part of Q&amp;A Land as well.  Check them both out!</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/qa-land-new-column-from-search-engine-land-10275/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Q&amp;A With Jimmy Wales On Search Wikia</title>
		<link>http://searchengineland.com/qa-with-jimmy-wales-on-search-wikia-10171</link>
		<comments>http://searchengineland.com/qa-with-jimmy-wales-on-search-wikia-10171#comments</comments>
		<pubDate>Fri, 29 Dec 2006 23:37:18 +0000</pubDate>
		<dc:creator>Danny Sullivan</dc:creator>
				<category><![CDATA[Q&A Land]]></category>
		<category><![CDATA[Search Engines: Search Wikia]]></category>
		<category><![CDATA[Search Features: Query Refinement]]></category>

		<guid isPermaLink="false">http://searchengineland.com/beta/qa-with-jimmy-wales-on-search-wikia-10171.php</guid>
		<description><![CDATA[
]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fsearchengineland.com%2Fqa-with-jimmy-wales-on-search-wikia-10171"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fsearchengineland.com%2Fqa-with-jimmy-wales-on-search-wikia-10171" height="61" width="51" /></a></div><p><a href="http://searchengineland.com/guides/qa_land.php"></a> News came out earlier this week that Wikipedia founder
<a href="http://en.wikipedia.org/wiki/Jimmy_Wales">Jimmy Wales</a> had a new
project in mind, to build a community-driven &quot;Google-killer&quot; search engine. I&#8217;ve
just finished talking with Jimmy about his plans. Here&#8217;s a rundown on his vision
and what may come as his Search Wikia project grows over the course of the next
year or two. </p>
<p><span id="more-10171"></span></p>
<p>Note that in the Q&amp;A, I&#8217;ve had to recreate my questions as best I remember
asking them. I was focused more on getting down Jimmy&#8217;s responses.</p>
<p><b>Q. Since the news emerged, there&#8217;s been some confusion about Amazon and
Wikipedia in relation to Search Wikia project. What&#8217;s the situation?</b></p>
<p>We recently
<a href="http://www.wikia.com/wiki/Wikia_in_the_news#Amazon_invests_in_Wikia">
completed</a> a funding round with Amazon [for Wikia], but other than that, they
don&#8217;t have anything to do with the search project. [The project] is a
<a href="http://www.wikia.com/wiki/Wikia,_Inc.">Wikia</a> project [the
for-profit company that Wales is chairman of], not a
<a href="http://www.wikipedia.org/">Wikipedia</a> project [the separate
community-driven encyclopedia he co-founded].</p>
<p><b>Q. Was the search project formally announced, or did the
<a href="http://search.wikia.com/wiki/Search_Wikia">Search Wikia site</a> come
online as a result of
<a href="http://business.timesonline.co.uk/article/0,,9075-2517026,00.html">The
Times article</a> discussing it. </b></p>
<p>It was a combination of them both. I&#8217;ve been working on this for a long time.
We didn&#8217;t actually intend to announce per se just yet, but me and my big mouth,
the reporter asked me if I ever thought about search.</p>
<p><b>Q. It&#8217;s been said the search engine would launch in the first quarter of
2007. That&#8217;s fast. Is that really just when you expect active development work
to begin?</b></p>
<p>During Q1, we&#8217;re going to set up a project to get developers involved with
building the site, writing the code and getting the search engine going. We&#8217;re
going to rely initially with <a href="http://lucene.apache.org/nutch/">Nutch</a>
and <a href="http://lucene.apache.org/nutch/">Lucene</a> [related open-source
search software that's been developed over the past few years].</p>
<p>We&#8217;ll start from scratch on how to apply the Wikipedia principles to keep it
as simple as possible and move forward.</p>
<p>It&#8217;s just the development starting. We&#8217;re not producing a Google killing
search engine in three months. I only wish I were that good of a programmer.</p>
<p>We&#8217;ll have some servers open, some development, maybe a pre-pre-alpha demo
site up. We&#8217;d really anticipate it would be a year or two until we&#8217;re able to
launch a viable search engine.</p>
<p><b>Q. How do you see this improving on what&#8217;s out there?</b></p>
<p>There are a lot of things that we&#8217;ve learned in the wiki world on how to get
communities involved and engaged to build trusted networks in communities.</p>
<p>A lot of the people who have tried to do this in the past have stumbled not
on technical issues but on community issues &#8230; dmoz [<a href="http://dmoz.org/">The
Open Directory</a>] was too closed &#8230; that was their response because of the
pressure of spammers &#8230; others have thought in terms of ranking algorithms.
That&#8217;s not the right approach. The right approach allows for open dialog and
debate and discussion.</p>
<p><b>Q. How do you envision the community participating? Will they be selecting
sites? Will this leverage material in Wikipedia? Will they rate sites?</b></p>
<p>This will be completely independent of Wikipedia.</p>
<p>Exactly how people can be involved is not yet certain. If I had to speculate
about it, I would say it&#8217;s several of those things, not just community involved
with rating URLs but also community rating for whole web sites, what to include
or not to include and also the whole algorithm &#8230; That&#8217;s a human type process
that we can empower people to guide the spider</p>
<p><b>Q. Do you see humans reviewing the most popular queries, perhaps picking
the right answers to come up?</b></p>
<p>Part of it might be a human review of queries. For the narrow subset of the
really popular queries, I think it&#8217;s important to apply humans &#8230;. if someone
types Ford Motor Company, there is a correct answer for that. There&#8217;s no reason
to beat our brains out to train our algorithm to do that.</p>
<p><b>Q. Search engines have actually gotten much better over time with these
type of navigational requests. You don&#8217;t need humans so much to make sure the
right answer shows up.</b></p>
<p>Those kinds are not too difficult. The harder one if you type ford, did you
mean President Ford or do you mean the Ford Motor Company? That&#8217;s the type of
thing where human disambiguation pages
<a href="http://en.wikipedia.org/wiki/Ford_(disambiguation)">like we have</a> at
Wikipedia are helpful.</p>
<p><b>Q. Search engines already do a lot of this type of stuff. Ask
<a href="http://www.ask.com/web?q=ford&#038;qsrc=0&#038;o=312&#038;l=dir">has</a> its Zoom
suggestions, others have clusterings or related searches. Do you imagine people
being forced to make a query refinement choice before they actually get search
results?</b></p>
<p>If you type ford, you should get some disambiguation terms that humans have
collected, then some search results&#8230;.this is one of the places where I think
human intelligence is most important</p>
<p>[NOTE: For more on query refinement, see some of my past posts such as
<a href="http://blog.searchenginewatch.com/blog/051003-124141">Robert Scoble
Wants What We Had -- Better Query Refinement. So Do I!</a>,&nbsp;
<a href="http://blog.searchenginewatch.com/blog/061005-095006">Hello Natural
Language Search, My Old Over-Hyped Search Friend</a> and
<a href="http://daggle.com/060919-204304.html">Why Search Sucks &amp; You Won't Fix
It The Way You Think</a>. The first link in particular discusses how Microsoft
used to have disambiguation created by editors very similar to what Wales hopes
to recreate. Sadly, it was killed in the quest to chase Google on the
algorithmic front.]</p>
<p><b>Q. Are you planning to crawl the entire web, billions and billions of
pages? Or will you go after a subset of important ones?</b></p>
<p>The number of pages is yet to be determined. Obviously we won&#8217;t be doing that
initially [gathering everything], but we&#8217;ll invest in the hardware. Not to
belittle the investment required to do a full crawl of the web on a regular
basis, but I think it&#8217;s a fairly commoditized.</p>
<p><b>Q. Crawling is one thing. Serving up millions of queries per day is an
entire other issue. Wikipedia handles a lot of traffic, but not at a Google
scale. How&#8217;s it going with that?</b></p>
<p>The traffic&#8217;s not too bad. Servers are getting more and more powerful.
Bandwidth is getting cheaper. It&#8217;s all pretty much off the shelf. It&#8217;s pretty
efficient.</p>
<p><b>Q. Will you be selling ads, and if so, how will that work?</b></p>
<p>There are no immediate plan to sell ads, so for now we&#8217;re not too focused on
that. If we don&#8217;t build something useful, selling ads on it is sort of a moot
point.</p>
<p><b>Q. Why do this at all? What do you see wrong with search?</b></p>
<p>For certain types of searches, search engines are very good. But I still see
major failures, where they aren&#8217;t delivering useful results. I think at a deeper
almost political level, I think it&#8217;s important that we as a global society have
some transparency in search. What are the algorithms involved? What are the
reasons why one site comes up over another one. [Wales also raised the issue of
how ads might influence regular listings, perhaps search engines trying to keep
commercial sites out of the free listings to make money. From there, he went
on....] Those types of incentives are problematic in search. The only solution I
know to that is to be transparent</p>
<p><b>Q. How are you going to keep the community from being gamed. Wikipedia is
very good at keeping out spam, but it&#8217;s not perfect. And despite its size, it&#8217;s
dealing with far fewer topics than unique searches that will happen on any
particular day. How do you police all those searches?</b></p>
<p>You have to recognize the difference between the way community is often used
on the internet, which is short hand for millions of people clicking on some
stuff as compared to community in the wiki world, which is people who actually
know each other.</p>
<p>It&#8217;s one thing to say if you have millions of spammers out there trying to
game and trick an algorithm &#8230;. but it&#8217;s not the number of queries. it&#8217;s the
web sites themselves. A lot of numbers are thrown about for sites on the web,
but the number of legitimate pages that are not coming from affiliate sites and
spammers is a much more finite number. It&#8217;s much easier for a community to ban
the bad stuff.</p>
<p><b>Q. But what if someone gets into a &quot;good&quot; domain. We&#8217;ve had cases where
bad content gets shoved into &quot;trusted&quot; sites or even places like university
sites. Do you ban those entire domains? How do they get back in?</b></p>
<p>At Wikipedia, we&#8217;d have a big discussion. [Wales then explained that people
might realize a domain had done something accidentally wrong or without thinking
about spam issues and so might be allowed back in.]</p>
<p><b>Q. You probably already search a lot, probably mostly with Google. Is it
not finding what you want already most of the time, without a flood of spam or
crud in your way?</b></p>
<p>Usually I&#8217;m looking for pages on Wikipedia, so they do a good job with that.
It depends on the types of searches you are doing. If you&#8217;re doing a factual
search, then Wikipedia [in the results] would be good. In other areas, I think
there&#8217;s a strong commercial incentive. Why is it bad if I search for tampa
hotels?</p>
<p>[NOTE: I then did <a href="http://www.google.com/search?q=tampa hotels">this
search</a> on Google, which we discussed. I noted I saw plenty of good hotels
listed, and that if I clicked through to the local
<a href="http://www.google.com/maps?hl=en&#038;lr=&#038;q=hotels&#038;near=Tampa,+FL&#038;sa=X&#038;oi=local&#038;ct=title">
search results</a>, I got an even better experience of hotels listed. </p>
<p>Wales replied that he's often after reviews of hotels, not the hotels
themselves. That took me back to the original results, where I pointed out the
top listing was from TripAdvisor, exactly the type of review site he mentioned
liking -- and that I often found them listed on these types of queries. </p>
<p>I also noted that Google even offers refinement categories at the top of the
page similar to the disambiguation he wanted, with
<a href="http://www.google.com/search?hl=en&#038;lr=&#038;q=tampa+more:lodging_guides&#038;cx=destination_guides&#038;sa=N&#038;oi=cooptsr&#038;resnum=0&#038;ct=col1&#038;cd=2">
<font size="-1">l</font></a><font size="-1"><a href="http://www.google.com/search?hl=en&#038;lr=&#038;q=tampa+more:lodging_guides&#038;cx=destination_guides&#038;sa=N&#038;oi=cooptsr&#038;resnum=0&#038;ct=col1&#038;cd=2">odging
guides</a> as one of the categories. Unfortunately for Google, I didn't find
that the results from that refinement did a good job bringing back trusted hotel
guides]</font></p>
<p><b>Q. Back to transparency. People keep saying they want more of this. But
can you name some exact examples of what you want to see? Do you want Google to
say that using a term in bold text adds X percent of a score to the ranking
criteria? And if you do that, don&#8217;t you think spammers will just abuse the
recipe that&#8217;s been published?</b></p>
<p>If your search relies on some secret factors that you hope people won&#8217;t
discover, you haven&#8217;t really come up with a good solution the problem.</p>
<p><b>Q. Microsoft has spent millions of dollars and years now of effort to try
and be a Google killer and haven&#8217;t made it. You&#8217;re coming into this fresh with
fewer resources and no real prior experience. Can you really do it?</b></p>
<p>I have no idea. I only do whatever sounds like it is fun.</p>
<p><b>Q. What type of funding do you have behind this?</b></p>
<p>Wikia&#8217;s initial round was 4 million from a variety of angels, then there was
second round from Amazon, but the amount wasn&#8217;t announced.</p>
<p><b>Closing Comments</b></p>
<p>When I first heard of the plans, I was pretty dubious the project would have
much success. For one thing, the idea of the &quot;open source&quot; search engine to take
on the world and provide more transparency is old news. Consider this from back
when Nutch first came out,
<a href="http://www.newscientist.com/article/mg18024200.200-search-angst.html">
out of</a> New Scientist in 2003:</p>
<blockquote>
<p>The project &quot;is about providing free technology that should not be controlled
by private, commercial, secretive organisations,&quot; says Doug Cuttings, veteran
web search engineer, and a Nutch founder.</p>
</blockquote>
<p>Three years on, nothing really changed despite the reasoning behind such a
project being the same. And this was despite Nutch
<a href="http://searchenginewatch.com/searchday/article.php/3071971">having</a>
some big names behind it.</p>
<p>In 2004, Nutch got another round of attention in an ACM
<a href="http://www.acmqueue.com/modules.php?name=Content&#038;pa=showpage&#038;pid=144">
article</a> looking at how it works. My comment at that time was:</p>
<blockquote>
<p><font face="Verdana, Helvetica, sans-serif">Interesting read especially for
the efforts that are involved to defeat spam. The argument is that though Nutch
is open, revealing secrets won&#8217;t hurt because spammers will batter down any
defenses, no matter how tightly protected. OK, so what will stop spam? Nutch
hopes that an open, public discussion may reveal new methods. Perhaps. But the
real test will only come if Nutch is deployed by a major, highly-trafficked
site. Spammers aren&#8217;t going to bother trying the defenses of other places. It&#8217;s
not worth the time. That&#8217;s also a positive for those considering Nutch. If you
operate a small, vertical site or just want Nutch to be used on your own
content, then spam concerns are much less an issue.</font></p>
</blockquote>
<p>The spam test simply hasn&#8217;t happened with Nutch. And every new search engine
project I&#8217;ve looked at coming in over the years completely underestimates the
spam problem they face. When I looked at the Search Wikia site, comments
<a href="http://search.wikia.com/index.php?title=Forum:Graphical_Mock-Up&#038;t=20061228150946">
like this</a> almost seemed laughable:</p>
<blockquote>
<p>search active for spammer sites </p>
</blockquote>
<ul>
<li>trying to simulate user-typos (ie. &quot;yaoho.com&quot; rather than &quot;yahoo.com&quot;);
<i>see also:</i> Microsoft&#8217;s
<a href="http://research.microsoft.com/URLTracer/" class="external text" title="http://research.microsoft.com/URLTracer/" rel="nofollow">
URL Tracer</a> <br />
&nbsp;</li>
<li>blacklist domains, where spammails are linking to; create actively
honeypods to get spam; use a pattern like <code>
&lt;domain-where-we-have-registered&gt;@myhoneypod.com</code> to identify the spam
networks; shell the common user get the possibility to register such a mail-adress?
</li>
</ul>
<p>Seek out the spam sites? Hey, don&#8217;t worry &#8212; if you&#8217;re popular, they&#8217;ll find
you fast enough. And as you blacklist one, two more throwaway domains will show
up in their place.</p>
<p>I also tend to think Wales is completely underestimating how crawling a big
chunk of the web, keeping those pages fresh, ranking them quickly to provide
answers and doing so for millions each day isn&#8217;t an off-the-shelf commodity.</p>
<p>Still, I find myself oddly hopeful. I don&#8217;t think a Google killer will
emerge, but perhaps some new ways of a community to be involved with search will
come out of it. I wouldn&#8217;t have thought Wikipedia would work. Certainly it&#8217;s
flawed, but it&#8217;s also an incredible resource. Maybe something useful will come
from the Search Wikia project. </p>
<p>At the very least, I&#8217;ve long wanted humans to be back in the role of
reviewing queries and actually looking to see if they make sense, rather than so
much reliance on algorithms. Maybe the mere concept of the Search Wikia project
will encourage the major search engines to do more in this area.</p>
<p><strong>Postscript:</strong> Originally I had Wales listed as cofounder of Wikipedia, but he got in touch saying he was the founder. I&#8217;d noted that Wikipedia itself <a href="http://en.wikipedia.org/wiki/Wikipedia">lists</a> him as founding (well, creating) it with Larry Sanger. Is Wikipedia incorrect on this, I asked? &#8220;Yes, it is wrong,&#8221; he emailed. Sanger posts his own views on the origins <a href="http://en.wikipedia.org/w/index.php?title=User:Larry_Sanger/Origins_of_Wikipedia&#038;oldid=39843351">here</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/qa-with-jimmy-wales-on-search-wikia-10171/feed</wfw:commentRss>
		<slash:comments>18</slash:comments>
		</item>
	</channel>
</rss>
