Tweet Showing How Google Itself Is A “Scraper Site” Goes Massively Viral


Perhaps it’s SEO’s “Oreo moment,” a tweet relating to search engine optimization that’s gained nearly as much attention as Oreo’s famous Super Bowl blackout tweet. But the subject was a perfect storm of goodness — a real-life example of Google doing the type of thing in search it seems to be telling others not to do.

Yesterday, the head of Google’s web spam team Matt Cutts announced a new Google Scraper Report for publishers to use if they see a site that has copied or “scraped” its content and which outranks the publisher in Google searches.

That quickly brought up a number of people joking in various places about how Google itself borrows content from other sites to make the direct answers it displays in its own search results. But, no joke hit it as right as that from digital marketer Dan Barker on Twitter, who tweeted back to Cutts:

It was super clever. Barker did a search for what is a scraper site, which brought up Google’s own web definition at the top of the results. And that definition technically outranks the original source of the content, Wikipedia, which comes right below.

Google does link to Wikipedia in its excerpt, which is in keeping with how its other search results work and generally on the right side of the law, when these things have been challenged in various places. And by scraper site, Google’s really talking about sites that wholescale copy all of someone’s content, rather than aiming for a fair use excerpt.

But still, as Google has increased the amount of web definitions, direct answers and Knowledge Graph box answers that are drawn from the content of other sites, the tensions have been rising.

With regular search listings, Google typically showed enough information for a searcher to decide if they want to visit a website and, if so, they’d click through. But the changes over the past few years (which Bing also does) have been to provide actual answers drawn from sites, so that there’s no need to click through.

It’s a difficult balancing act, because there are good reasons why it makes more sense for Google (or Bing) to just show the direct answer of something, rather than having dozens of sites all fight to be number one for “What time is the Super Bowl,” as they do.

But, it’s also a fundamental change to the unwritten contract between search engines and publishers — that yes, search engines can build their “content” on the back of publisher content, but only if there’s a fair exchange of traffic.

Barker’s tweet is perhaps the biggest sign ever that publishers are feeling like the balancing act is tipping too much into Google’s side. I’ve never, in 18 years writing about search, seen such a response like this. Last year’s Oreo tweet, when the Super Bowl had a blackout, was a darling example of huge engagement.

That tweet, associated with a prime time event, has about 16,000 retweets as of today, over a year later. Barker’s tweet, not associated with any major sporting event and about an issue that’s usually only of concern to SEOs, is over 14,000 tweets as I write this — and over 12,000 favorites.


Related Topics: Channel: SEO | Features: Analysis | Google: Critics | Google: Knowledge Graph | Google: SEO | Legal: Copyright | Legal: Crawling & Indexing | Search Features: Direct Answers | SEM Industry | Top News


About The Author: is a Founding Editor of Search Engine Land. He’s a widely cited authority on search engines and search marketing issues who has covered the space since 1996. Danny also serves as Chief Content Officer for Third Door Media, which publishes Search Engine Land and produces the SMX: Search Marketing Expo conference series. He has a personal blog called Daggle (and keeps his disclosures page there). He can be found on Facebook, Google + and microblogs on Twitter as @dannysullivan.

Connect with the author via: Email | Twitter | Google+ | LinkedIn


Get all the top search stories emailed daily!  


Other ways to share:

Read before commenting! We welcome constructive comments and allow any that meet our common sense criteria. This means being respectful and polite to others. It means providing helpful information that contributes to a story or discussion. It means leaving links only that substantially add further to a discussion. Comments using foul language, being disrespectful to others or otherwise violating what we believe are common sense standards of discussion will be deleted. Comments may also be removed if they are posted from anonymous accounts. You can read more about our comments policy here.

Comments are closed.

Get Our News, Everywhere!

Daily Email:

Follow Search Engine Land on Twitter @sengineland Like Search Engine Land on Facebook Follow Search Engine Land on Google+ Get the Search Engine Land Feed Connect with Search Engine Land on LinkedIn Check out our Tumblr! See us on Pinterest


Click to watch SMX conference video

Join us at one of our SMX or MarTech events:

United States


Australia & China

Learn more about: SMX | MarTech

Free Daily Search News Recap!

SearchCap is a once-per-day newsletter update - sign up below and get the news delivered to you!



Search Engine Land Periodic Table of SEO Success Factors

Get Your Copy
Read The Full SEO Guide