<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>searchengineland.com &#187; Search On Search</title>
	<atom:link href="http://searchengineland.com/library/columns/search-on-search/feed" rel="self" type="application/rss+xml" />
	<link>http://searchengineland.com</link>
	<description>Search Engine Land: Must Read News About Search Marketing &#38; Search Engines</description>
	<lastBuildDate>Sun, 22 Nov 2009 16:39:14 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Of Permanent Value: Archiving The Web</title>
		<link>http://searchengineland.com/of-permanent-value-archiving-the-web-11764</link>
		<comments>http://searchengineland.com/of-permanent-value-archiving-the-web-11764#comments</comments>
		<pubDate>Mon, 23 Jul 2007 18:46:39 +0000</pubDate>
		<dc:creator>Gary Price</dc:creator>
				<category><![CDATA[Search Engines: Academic Search Engines]]></category>
		<category><![CDATA[Search Engines: Other Search Engines]]></category>
		<category><![CDATA[Search On Search]]></category>

		<guid isPermaLink="false">http://searchengineland.com/beta/of-permanent-value-archiving-the-web-11764.php</guid>
		<description><![CDATA[
 I love working for Ask.com as Director of Online Information Resources and also compiling and editing ResourceShelf and DocuTicker.
Yes, it&#8217;s a busy life but I&#8217;m very fortunate to do what I love and even get paid for it. The challenge, as least as I see it, is writing on something of interest for Search [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fsearchengineland.com%2Fof-permanent-value-archiving-the-web-11764"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fsearchengineland.com%2Fof-permanent-value-archiving-the-web-11764" height="61" width="51" /></a></div><p><a href="http://searchengineland.com/lands/search-on-search.php">
<img border="0" src="http://searchengineland.com/images/searchonsearch100.jpg" alt="Search On Search - A Column From Search Engine Land" align="left" hspace="5" vspace="3" width="100" height="100"></a> I love working for Ask.com as Director of Online Information Resources and also compiling and editing <a href="http://ResourceShelf.com">ResourceShelf</a> and <a href="http://DocuTicker.com">DocuTicker</a>.</p>
<p>Yes, it&#8217;s a busy life but I&#8217;m very fortunate to do what I love and even get paid for it. The challenge, as least as I see it, is writing on something of interest for Search Engine Land and not worrying about conflicts of interest with every sentence I write.</p>
<p>Good news: I have found a topic that not only interests me but grows in significance for all of us as each day and each version of a web page passes: The importance of making web content more permanent.  It&#8217;s crucial for historical purposes for web content to become less ephemeral.</p>
<p><span id="more-11764"></span>
It&#8217;s my goal in this series of articles to keep you posted on some of the major web archiving initiatives, databases, research and services, while at the same time offering quick peeks at tools you can use to save web pages and other forms of electronic content on your own. Naturally, awareness of copyright is key.</p>
<p>There is a lot going on all over the world and I will do my best to offer you introductions to many digital preservation initiatives, along with the research from universities and organizations engaged in collecting and storing online content.</p>
<p>So, where do we begin?</p>
<p>Many people know about <a href="http://www.archive.org">The Internet Archive</a>, based at the Presidio in San Francisco and home to The Wayback Machine. But many people aren&#8217;t aware of numerous additional projects (archiving, digitizing, preservation) that the Internet Archive, under the leadership of Brewster Kahle, is involved in.</p>
<p>One is a service the Internet Archive offers for a growing number of institutional clients, named <a href="http://www.archive-it.com">Archive-It</a>.</p>
<p>In a nutshell, this subscription service allows an organization to use an application that includes crawling, recrawling and data hosting services.</p>
<p>From the web site: 
<blockquote>Internet Archive&#8217;s subscription service, Archive-It, allows institutions to build, manage and search their own web archive through a user friendly web application, without requiring any technical expertise or hosting facilities.</p>
<p>Subscribers can capture, catalog, and archive their institution&#8217;s own web site or build collections from the web, and then search and browse the collection when complete.</p></blockquote>
<p>The collections are then made public (unless a user decides to keep them private) via the Archive-It web site. At last count, Archive-It was permanently archiving more than 135 million pages in nearly 300 collections.</p>
<p>For those interested, Archive-It <a href="http://www.archive-it.org/public/contact-us.html">regularly offers webinars</a> explaining their services.</p>
<p><a href="http://www.archive-it.org/public/largest_all.html">This page</a> offers direct links to all of Archive-It collections. In recent weeks, the collection <a href="http://www.archive-it.org/public/all_collections.html"> has seen many new collections</a> added to the service</p>
<p>A few of the most interesting collections include:</p>
<ul>
<li><a href="http://www.archive-it.org/collections/649">Tragedy at Virginia Tech</a> A collection of web pages from the University and elsewhere immediately following the tragedy.</li>
<li><a href="http://www.archive-it.org/collections/657">California High Speed Rail Authority</a></li>
<li><a href="http://www.archive-it.org/collections/660">Orange County California Web Sites</a></li>
<li><a href="http://www.archive-it.org/collections/176">Latin American Government Documents Archive, (University of Texas)</a></li>
<li><a href="http://www.archive-it.org/collections/227">Canadian Political Parties and Political Interest Groups</a></li>
</ul>
<p>It&#8217;s also worth noting that <i>unlike</i> the tens of millions of archived pages accessible via The Wayback Machine which cannot be keyword searched, pages archived using the <a href="http://www.archive-it.org/public/faq.html#506">Archive-It service</a> can be searched using keywords.</p>
<p>In an upcoming article I will take a look at two massive web archives that combine the best of both the National Archives of the United States and The Internet Archive. They are named <a href="http://www.webharvest.gov">Web Harvest Presidential Term 2004 and Web Harvest 109th Congress (2006)</a>. Between them they contain terabytes of archived U.S. Government web data.</p>
<p><i>Gary Price is Director of Online Information Resources for Ask.com and also editor of <a href="http://ResourceShelf.com">ResourceShelf</a> and <a href="http://DocuTicker.com">DocuTicker</a>. The <a href="http://searchengineland.com/lands/search-on-search.php">Search On Search</a> column, written by employees of major search engines, appears periodically at <a href="http://searchengineland.com">Search Engine Land</a>.</i></p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/of-permanent-value-archiving-the-web-11764/feed</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Search On Search: New Column From Search Engine Land</title>
		<link>http://searchengineland.com/search-on-search-new-column-from-search-engine-land-11763</link>
		<comments>http://searchengineland.com/search-on-search-new-column-from-search-engine-land-11763#comments</comments>
		<pubDate>Mon, 23 Jul 2007 17:49:24 +0000</pubDate>
		<dc:creator>Chris Sherman</dc:creator>
				<category><![CDATA[About Search Engine Land]]></category>
		<category><![CDATA[Search On Search]]></category>

		<guid isPermaLink="false">http://searchengineland.com/beta/search-on-search-new-column-from-search-engine-land-11763.php</guid>
		<description><![CDATA[]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fsearchengineland.com%2Fsearch-on-search-new-column-from-search-engine-land-11763"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fsearchengineland.com%2Fsearch-on-search-new-column-from-search-engine-land-11763" height="61" width="51" /></a></div><p><a href="http://searchengineland.com/lands/search-on-search.php">
<img border="0" src="http://searchengineland.com/images/searchonsearch100.jpg" alt="Search On Search - A Column From Search Engine Land" align="left" hspace="5" vspace="3" width="100" height="100"></a> Our newest <a href="http://searchengineland.com">Search Engine Land</a> column, Search On Search, launches today. <a href="http://searchengineland.com/lands/search-on-search.php">Search On Search</a> is a column written by employees of search engines. Columnists are free to discuss technical details of how their search engine works or write opinion pieces that may or may not reflect the official position of their employer.  The <a href="http://searchengineland.com/lands/search-on-search.php">Search On Search</a> column will appear periodically at <a href="http://searchengineland.com">Search Engine Land</a>.</p>
<p>In today&#8217;s debut article, Ask.com&#8217;s Director of Online Information Resources, Gary Price, talks about the increasing importance of archiving digital content. Price argues that not enough is currently being done to preserve the ephemeral content posted to the web and elsewhere. He also discusses a service that makes it easy for organizations to create and maintain searchable archives of content at relatively low cost. Read on for Gary&#8217;s insights in <a href="http://searchengineland.com/070723-144639.php">Of Permanent Value: Archiving The Web</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://searchengineland.com/search-on-search-new-column-from-search-engine-land-11763/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
