30% Of Results For Some Competitive Searches Found To Be Spam

Researchers Track Down a Plague of Fake Web Pages from the NY Times reports on a recent paper from Microsoft Research released named “Spam Double-Funnel: Connecting Web Spammers with Advertisers” [PDF download] (also note that Gary linked to it Friday).

The report categorizes search results spam by industry category, showing that some search categories have a 30% or more rate of spam. Here is a chart covering various :

Microsoft Spam  - Spammer Targeted Categories

Read this from section 4.0:

In late September 2006, we submitted the 1,000 keywords to the Search Ranger system, which retrieved the top-50 results from all three major search engines. In total, we collected 101,585 unique URLs from 1,000x50x3=150,000 search results. With a set of approximately 500 known-spammer redirection domains and AdSense IDs at that time, the system identified 12,635 unique spam URLs, which accounted for 11.6% of all the top-50 appearances. (The actual redirection-spam density should be higher because some of the doorway pages had been deactivated, which were no longer causing URL redirections when we scanned which were no longer causing URL redirections when we scanned them.)

The NY Times summarizes the paper saying they “discovered that the average spam density — a measure of the percentage of Web pages that contain only advertisements — was 11 percent for 1,000 keywords they used in their research.”

Here are some other references for you:

- Strider URL Tracer with Typo-Patrol from Microsoft Research - Strider Typo-Patrol from Microsoft Research - Typo Domain Spotting Tool & Domain Registration Stats from SEW Blog - Google AdSense For Domains Program Overdue For Reform — And Yahoo & Microsoft Should Also Take Note from SEW Blog - MS Research: Typo-Squatters Are Gaming Google from eWeek

Related Topics: Channel: SEO | SEO: Spamming | Stats: General | Stats: Relevancy


About The Author: is Search Engine Land's News Editor and owns RustyBrick, a NY based web consulting firm. He also runs Search Engine Roundtable, a popular search blog on very advanced SEM topics. Barry's personal blog is named Cartoon Barry and he can be followed on Twitter here. For more background information on Barry, see his full bio over here.

Connect with the author via: Email | Twitter | Google+ | LinkedIn


Get all the top search stories emailed daily!  


Other ways to share:

Read before commenting! We welcome constructive comments and allow any that meet our common sense criteria. This means being respectful and polite to others. It means providing helpful information that contributes to a story or discussion. It means leaving links only that substantially add further to a discussion. Comments using foul language, being disrespectful to others or otherwise violating what we believe are common sense standards of discussion will be deleted. Comments may also be removed if they are posted from anonymous accounts. You can read more about our comments policy here.

Comments are closed.

Get Our News, Everywhere!

Daily Email:

Follow Search Engine Land on Twitter @sengineland Like Search Engine Land on Facebook Follow Search Engine Land on Google+ Get the Search Engine Land Feed Connect with Search Engine Land on LinkedIn Check out our Tumblr! See us on Pinterest


Click to watch SMX conference video

Join us at one of our SMX or MarTech events:

United States


Australia & China

Learn more about: SMX | MarTech

Free Daily Search News Recap!

SearchCap is a once-per-day newsletter update - sign up below and get the news delivered to you!



Search Engine Land Periodic Table of SEO Success Factors

Get Your Copy
Read The Full SEO Guide