How To Save URLs To The Wayback Machine On Demand

2014-05-12_17-15-54 Yesterday, Search Engine Land featured a post about the Internet Archive's Wayback Machine now providing access to more than 440 billion archived web pages back to 1996. As I've written about before on Search Engine Land and my infoDOCKET blog, the Wayback Machine is an absolutely essential resource for just about every web user. If you've been using Wayback since it debuted, you probably remember that the lag time for material (new pages and updates) to become accessible used to be six months or longer. However, in the past couple of years this has also dramatically improved. In F [...]

Wayback Machine Adds 160 Billion Indexed Pages In A Year, Surpasses 400 Billion Indexed Pages

wayback_logo The Internet Archive announced that the Wayback Machine, a huge internet archive of web pages dating back to 1996, has surpassed 400 billion pages indexed. In January 2013, a little over a year ago, the Wayback Machine said they had 240 billion URLs indexed and since then, they have added another 160 billion URLs! That brings up the indexed page count by the Wayback Machine to over 400 billion URLs. On Friday, the Internet Archive announced this on their blog and said the indexed pages date back from late 1996 up until a few hours ago. Then they shared some of their history: 2001 – [...]

Bing For Schools Offers Ad-Free Search Experience, SafeSearch Filtered Content & Lesson Plans

bing-school-chalkboard-featured Bing announced Bing For Schools today, a new program designed for K-12 US schools, scheduled to launch later this year. In an effort to help schools teach digital literacy skills, Bing For Schools will offer an ad-free search experience with SafeSearch settings to filter adult content and lesson plans attached to homepage images. The Bing For Schools program will be free to any school that wishes to participate. Once a school opts-in, Bing will enable the specialized search features across all searches from within the school's network. Bing says schools that use the program will [...]

Search Engine Designed By Denmark Researchers Helps Medics Diagnose Rare Diseases

Technical University of Denmark researchers have designed a search engine that indexes specific sets of databases to help identify rare diseases. Users can query patient symptoms on which crawls over 31,000 medical articles focused on rare and genetic diseases to deliver medically relevant search results. While the search engine's homepage includes a notification stating that the site is a research project and should only be used by medical professionals, the developers boast that it delivers significantly better results than Google when searching for rare disease content. [...]

Wayback Machine Now Has 240 Billion URLs

wayback_logo The Wayback Machine from the Internet Archive, one of the most useful and important Internet research tools, recently reached a major milestone. In a blog post, archive founder Brewster Kahle announced that The Wayback Machine now provides access an index containing more than 240 billion URLs (about five petabytes of data), with archived pages dating back to 1996. The amount of newly accessible archived material is huge. Prior to this update, The Wayback Machine provided access to about 150 billion URLs. Researchers should note that a small amount of the index available in the prior r [...]

DuckDuckGo Adds Zero-Click Info From Zanran

duckduckgo-zanran DuckDuckGo has added new "zero click" information from Zanran to its search results, giving users quicker access to some of the deeper web content that Zanran offers. You may remember Zanran from my profile last year, Zanran: New Search Engine That Unearths Data In Charts, Graphs & Tables -- it's essentially a search engine for data that often gets lost in PDFs, Excel documents and even images. The new integration into DuckDuckGo is about a week old, and can be seen on queries like [aircraft production brazil], [adult obesity prevalence in indiana] and [child poverty uk 2012]. [...]

Academic, But Valuable: A Free Alert Service

JournalTOCS, sponsored by Heriot Watt University in Edinburgh Scotland, is a free service that has been online for many years and provides alerts for more than 17,000 journals published by more than 900 publishers and organizations. Almost 3,000 of these publications are open access so the full text is quickly and directly accessible on the web for free. For many years, professionals, academics, students and others have used services that alerted them when new issues of scholarly/academic (peer-reviewed) publications were published or about to be published. If you’re not a research [...]

Zanran: New Search Engine That Unearths Data In Charts, Graphs & Tables

A few days ago, I was looking for a very specific piece of data: the average age of professors in the US. I did a Google search for "average age of us professors" and, after the obligatory and unhelpful Wikipedia result at the top, found some data that was good enough further down the search results. Google (and other search engines) does pretty well with some data-based searches, especially since many of us have trained ourselves how to phrase a query to get the info we want. But what about when the data we want isn't found in text, but is likely to be found in graphs, charts and tables [...]

Searching For Old Versions of Web Sites? The Wayback Machine Is New and Improved

The Internet Archive was founded in 1996 to "build an internet library" and offers a searchable collection of digital collections. Internet Achive's Wayback Machine (launched in 2001) has archived web pages going back to 1996, enabling historical snapshots of the web. As ReadWriteWeb notes, the Wayback Machine has just been redesigned and relaunched with a nicer user interface, and more importantly, has been migrated to a new platform as a foundation for further improvements. (The new beta has been launched under a new domain:; the original version is still available at arch [...]

Google Launches Public Data Search & Charts During Wolfram Alpha Demo

During the Wolfram Alpha demonstration today, the Google Blog announced the launch of their own structured data search feature that allows you to find and compare public data. Searches such as [unemployment rate new york] will now bring up a quick answer that looks like this: Clicking on the result will take you to a more detailed graph with ways to plot other data on this charge. Below is me plotting Rockland County, New York, compared to New York State and also adding California: Google promises to add public data for answers on prices of cookies, CO2 emissions, asthma frequenc [...]

Microsoft Kills Encyclopedia Encarta

Ars Technica reports that Microsoft has decided to discontinue their encyclopedia software, Encarta. Both the MSN Encarta reference Web sites as well as its Encarta software will be shut down. Microsoft said: On October 31, 2009, MSN® Encarta® Web sites worldwide will be discontinued, with the exception of Encarta Japan, which will be discontinued on December 31, 2009. Additionally, Microsoft will cease to sell Microsoft Student and Encarta Premium software products worldwide by June 2009. We understand that Encarta users may have questions regarding this announcement so we have prepare [...]

BioNumbers – Specialty Biology Answer Search Engine

Today, I met with Dr. Ron Milo who works in the field of plant sciences at Weizmann Institute in Israel, who showed me a specialty search engine project he built from the ground up named BioNumbers. In short, BioNumbers is a collection of common biological numbers that is useful to one's research. For example, the average diameter of a protein is 5 nanometers. This week, I am on a special trip known as the Blogger Delegation to Israel. We are being taken around by the Israeli Consulate to learn about Israel and the people who live here. Clearly, my focus is on search - so when I went to the [...]

Of Permanent Value: Archiving The Web

I love working for as Director of Online Information Resources and also compiling and editing ResourceShelf and DocuTicker. Yes, it's a busy life but I'm very fortunate to do what I love and even get paid for it. The challenge, as least as I see it, is writing on something of interest for Search Engine Land and not worrying about conflicts of interest with every sentence I write. Good news: I have found a topic that not only interests me but grows in significance for all of us as each day and each version of a web page passes: The importance of making web content more permanen [...]

Scholarly Science Search Tools Explored

Gary Price has complied a comprehensive list of science search tools that are available via the internet. He explored several science search tools including Scirus, Live Search Academic, Google Scholar, Scitopia, Global Science Gateway and others. [...]

TimeSearch: Searching Through History

TimeSearch is an interestingly different search engine, since the focus is just on time, as one may expect from the name. It's the brainchild of Bamber Gascoigne, (a well known British television presenter of historical programmes and academic quiz shows) and HistoryWorld. The search options are fairly limited - to geographic areas of the world, themes (such as art, politics, science and so on), with the option of further limiting to specific sites such as the British Museum, Museum of Fine Arts Boston and so on. The searcher then pops into the search box the year they are interested in and T [...]

UFO Crawler: The Truth Is Out There & Searchable

IBM and Yahoo teamed up to bring to you UFOCrawler, a search engine that is about finding sources on "UFO Sightings, time travel, conspiracy theories and anomalies." For example, a search on area 51 returns 7,904 sources, unfortunately some of the results do not look all that great. It is important to note that all the other tabbed searches (i..e Web Images Video Audio Directory Local News) take you to Yahoo, the Enterprise tab takes you to UFOCrawler results. This is powered by IBM OmniFind Yahoo! Edition enterprise and brought to you by the Anomalies Network. [...]

Microsoft Releases Live Search Books Beta

The Live Search Blog announced that Microsoft released a beta named Live Search Books this morning. Plus they enhanced Live Search Academic by adding millions of new articles, plus indexing theses, dissertations, and books within these disciplines. The details on Live Search Books Beta: The U.S. beta launch of Live Search Books is a big step forward in advancing the way people discover information through the integration of content that has been "off-limits [...]

Get Our News, Everywhere!

Daily Email:

Follow Search Engine Land on Twitter @sengineland Like Search Engine Land on Facebook Follow Search Engine Land on Google+ Get the Search Engine Land Feed Connect with Search Engine Land on LinkedIn Check out our Tumblr! See us on Pinterest


Click to watch SMX conference video

Join us at one of our SMX or MarTech events:

United States


Australia & China

Learn more about: SMX | MarTech

Free Daily Search News Recap!

SearchCap is a once-per-day newsletter update - sign up below and get the news delivered to you!



Search Engine Land Periodic Table of SEO Success Factors

Get Your Copy
Read The Full SEO Guide