30% Of Results For Some Competitive Searches Found To Be Spam

Researchers Track Down a Plague of Fake Web Pages from the NY Times reports on a recent paper from Microsoft Research released named “Spam Double-Funnel: Connecting Web Spammers with Advertisers” [PDF download] (also note that Gary linked to it Friday).

The report categorizes search results spam by industry category, showing that some search categories have a 30% or more rate of spam. Here is a chart covering various :

Microsoft Spam  - Spammer Targeted Categories

Read this from section 4.0:

In late September 2006, we submitted the 1,000 keywords to the Search Ranger system, which retrieved the top-50 results from all three major search engines. In total, we collected 101,585 unique URLs from 1,000x50x3=150,000 search results. With a set of approximately 500 known-spammer redirection domains and AdSense IDs at that time, the system identified 12,635 unique spam URLs, which accounted for 11.6% of all the top-50 appearances. (The actual redirection-spam density should be higher because some of the doorway pages had been deactivated, which were no longer causing URL redirections when we scanned which were no longer causing URL redirections when we scanned them.)

The NY Times summarizes the paper saying they “discovered that the average spam density — a measure of the percentage of Web pages that contain only advertisements — was 11 percent for 1,000 keywords they used in their research.”

Here are some other references for you:

- Strider URL Tracer with Typo-Patrol from Microsoft Research - Strider Typo-Patrol from Microsoft Research - Typo Domain Spotting Tool & Domain Registration Stats from SEW Blog - Google AdSense For Domains Program Overdue For Reform — And Yahoo & Microsoft Should Also Take Note from SEW Blog - MS Research: Typo-Squatters Are Gaming Google from eWeek

Related Topics: Channel: SEO | SEO: Spamming | Stats: General | Stats: Relevancy


About The Author: is Search Engine Land's News Editor and owns RustyBrick, a NY based web consulting firm. He also runs Search Engine Roundtable, a popular search blog on very advanced SEM topics. Barry's personal blog is named Cartoon Barry and he can be followed on Twitter here. For more background information on Barry, see his full bio over here.

Connect with the author via: Email | Twitter | Google+ | LinkedIn


SMX - Search Marketing Expo

SearchCap:

Get all the top search stories emailed daily!  

Like This Story? Please Share!

Other ways to share:

Like Our Site? Follow Us!

Subscribe to Our Feed! Join our LinkedIn Group Check out our Tumblr! See us on Pinterest Get Search Engine Land on your mobile device!
 

Read before commenting! We welcome constructive comments and allow any that meet our common sense criteria. This means being respectful and polite to others. It means providing helpful information that contributes to a story or discussion. It means leaving links only that substantially add further to a discussion. Comments using foul language, being disrespectful to others or otherwise violating what we believe are common sense standards of discussion will be deleted. Comments may also be removed if they are posted from anonymous accounts. You can read more about our comments policy here.

Comments are closed.

Get Our News, Everywhere!

 
  • Advertise With Us
 

Click to watch SMX conference video

Join us at an upcoming SMX event:

North America

EMEA

APAC

Search Engine Land produces SMX, the Search Marketing Expo conference series. SMX events deliver the most comprehensive educational and networking experiences - whether you're just starting in search marketing or you're a seasoned expert.

SMX Site » | SMX Difference » | SMX News »




 

Search Engine Land Periodic Table of SEO Ranking Factors

Get Your Copy
Read The Full SEO Guide