Oh Good Grep! Web Grepper: A New Web Intelligence Feature From Blekko

Blekko, the search engine that launched last year and is known for its spam reduction efforts, is introducing a new feature today that will help make previously difficult, if not impossible to find (for free), web data accessible to more searchers, marketers, media, and others. Say hello to Web Grepper. More about the name in […]

Chat with SearchBot

Blekko LogoBlekko, the search engine that launched last year and is known for its spam reduction efforts, is introducing a new feature today that will help make previously difficult, if not impossible to find (for free), web data accessible to more searchers, marketers, media, and others.

Say hello to Web Grepper. More about the name in a moment.

What’s a Web Grepper?

Web Grepper allows users to run searches on the ENTIRE Blekko index (more than 20 billion URLs according to a recent presentation) to reveal data that you could not search and find using keywords. In other words, you can also search material in the actual HTML coding of a web page.

It’s another tool for your web intelligence toolbox.

Here are a few examples of Web Greps that Blekko has run during the testing:

1. The number of sites utilizing Facebook Comments vs. sites using Disqus

2. The number of urls and domains using nofollow tags

3. Sites with rickroll links

4. Determining which type of feed is more popular, ATOM or RSS? (see image below; click for larger version)

blekko-grep

Given the increasing amount of data included in the coding of a web page these days the possibilities are many. Plus, having this data and being able to see trends over time will also be both interesting and helpful.

How Can You Use Web Grepper

Simply head to the Blekko homepage and enter the slashtag /webgrep into the search box.  Next, click the request a report button and enter what you would like to have grepped. Next
, Blekko’s community of  users vote on what greps to run. Blekko will run reports twice a day, Monday-Friday.

Blekko also said they’ll be reviewing reports to make sure that Web Grepper is not be used as a hacking tool to obtain personal and private info.

The Name Web Grepper

Web Grepper is based on a command-line text matching search tool originally developed for UNIX named grep.  For more background here’s the Wikipedia entry.

Disclosure: I have a small amount of Blekko stock options that I received after doing some consulting for Blekko in 2009. I have not consulted for the company since that time.


Opinions expressed in this article are those of the guest author and not necessarily Search Engine Land. Staff authors are listed here.


About the author

Gary Price
Contributor
Gary Price is a librarian, author, and an online information analyst based in suburban Washington, DC. He is the co-founder and co-editor of INFOdocket and FullTextReports.com and prior to that was founder/editor of ResourceShelf and DocuTicker for 10 years. He has worked for Blekko, Ask.com, and at Search Engine Watch where he was news editor. In 2001, Price was the co-author (with Chris Sherman) of the best-selling book The Invisible Web.

Get the must-read newsletter for search marketers.