• Search Engine Land
  • Sections
    • SEO
    • SEM
    • Local
    • Retail
    • Google
    • Bing
    • Social
    • Resources
    • More
    • Home
  • Search Engine Land
  • SEO
  • SEM
  • Local
  • Retail
  • Google
  • Bing
  • Social
  • Resources
  • Live
  • More
  • Events
  • SUBSCRIBE

Search Engine Land

Search Engine Land
  • SEO
  • SEM
  • Local
  • Retail
  • Google
  • Bing
  • Social
  • Resources
  • More
  • Newsletters
  • Home
SEO

How to Recover When Your Content Is Stolen

Last month, I shared a case study of a client I’m currently working with on a duplicate content issue. It turns out that this particular site had significantly lost rankings over the past year because of other sites “lifting” their content, often verbatim, causing the client’s site to lose rankings through Panda updates. This is […]

Janet Driscoll Miller on September 5, 2013 at 9:10 am
  • More

Last month, I shared a case study of a client I’m currently working with on a duplicate content issue. It turns out that this particular site had significantly lost rankings over the past year because of other sites “lifting” their content, often verbatim, causing the client’s site to lose rankings through Panda updates.

This is a common problem for content producers. You produce great content only to have other sites disregard copyright and repurpose the content on their own sites — without attribution. I see this often in the case of associations and non-profits that may be producing valuable research or information that other sites may want to share. In some cases, the infringement may be unintentional… but in many cases, it isn’t.

So, how can you recover when your content is stolen? First, understand that this isn’t a quick fix or fast process. However, it’s the process that I find works best with Google.

Step 1: Find The Infringements

If you believe you’ve been hit by a Panda update and are seeing dramatic traffic losses on certain pages, I would prioritize looking for duplicate content that corresponds to those specific page losses.

To get started, copy a few lines of content from the page on your website and search for that content as an exact match search in Google by putting it in quotes. If you find pages other than your own that appear with this exact match content, it’s time to find out exactly how much of the content on these pages matches yours. You can also use Copyscape’s Web search to see if it readily identifies other copies of your content on the Web.

Copy the URL from your page and the URL from the suspect page and paste them into Copyscape’s Content Comparison tool. This tool looks at both pages of content, side-by-side, and indicates the percentage of overlap in the two pages’ content. My rule of thumb is that anything over 50% really does need to be addressed immediately. However, you’d be surprised how often we see 90-100% duplicate content.

Step 2: Log The Infringements

As you find duplicate copies, log the information in a spreadsheet, including the percentage of overlap. If you find a site that seems to have copied any of your content nearly verbatim, focus in on these sites and see what else you can find on them. I’ve typically found that sites which duplicate one page of your site don’t stop with just one page. Log all of the pages you can find from the sites that have duplicated your site content.

Also check the Wayback Machine to see how long the infringing site has been using the copyrighted content. Go back as far as you can to get a full understanding and log of information about this infringement (you may need the information later).

Step 3: Reach Out To The Infringing Site Owners

Next, you’ll want to reach out to the infringing site owner(s). I generally start with a friendly email alerting the site owner about the infringement and politely asking the site owner to take the pages down. I also request that the site owner respond to the email, letting me know that the content is down, by a certain date. List out all of the pages on the owner’s site that are in violation and that you would like removed.

How can you find out who owns a site? If the site itself does not provide contact information, check out who owns the domain through the WhoIs lookup. The site owner may have his/her contact information hidden; but if not, you can see the individual to contact and an email and mailing address.

Generally, the email is enough to get the owner to take the infringing content down. However, if it’s not, you may want to send a more strongly-worded letter via postal mail. In this case, it may also be helpful to have the services of an attorney who can send a legal letter on your behalf.

dmca-google

Step 4: If the Pages Are Not Removed

After all of your efforts, if the site pages are not removed and you cannot resolve the conflict with the site owner, it’s time to hard ball.

Copyright infringement on the Web is a violation of the Digital Millennium Copyright Act (DMCA) (pdf). Google and Bing will take down content that is in violation of DMCA, but they do request that you attempt to contact the site owner to resolve the issue first. The forms you’ll need to fill out are:

  • Google DMCA Form
  • Bing DMCA Form

The Wrinkle Of Content Syndication

If you’re syndicating content or giving another site permission to copy your content, it can be tricky. If the content is completely duplicated, you (and the site using the content) risk being affected by Panda updates. Sometimes, too, Google misinterprets the original content creator or doesn’t rank the preferred version highest. Google’s advice on syndicated content is:

Syndicate carefully: If you syndicate your content on other sites, Google will always show the version we think is most appropriate for users in each given search, which may or may not be the version you’d prefer. However, it is helpful to ensure that each site on which your content is syndicated includes a link back to your original article. You can also ask those who use your syndicated material to use the noindex meta tag to prevent search engines from indexing their version of the content.

Protecting Your Content Long Term

While I know that all of this may seem daunting, ultimately it is the responsibility of the copyright owner to protect his/her copyrighted material. One of the first ways you can do this is by making your copyright very clear — add the copyright icon and year to each page of your website. I also prefer to see the beginning year to the present year following the copyright so that it is very clear when the copyright was established.

Another tool you can use to catch duplicate pages before they cause you Panda issues is the CopySentry tool from CopyScape. This tool, for a small fee, will continuously monitor certain pages you identify on your site and notify you when duplicates are found. If you have pages that tend to be more popular or have been duplicated in the past, I’d prioritize these pages for monitoring.

All in all, the process takes time. Time to research the infringing pages, time to document, time to contact site owners, time to report to search engines and time to see recovery (even when infringing pages have been removed). It can be frustrating, but it’s a necessary process to protect your content and keep your organic rankings strong.


Opinions expressed in this article are those of the guest author and not necessarily Search Engine Land. Staff authors are listed here.



About The Author

Janet Driscoll Miller
Janet Miller is the President and CEO of Marketing Mojo. She regularly blogs on a variety of search engine marketing topics, often focusing on technical solutions. You can find her on Twitter @janetdmiller.

Related Topics

All Things SEO ColumnChannel: SEOGoogle: Panda Update 🐼Google: SEOLegal: CopyrightPanda Update Tips

We're listening.

Have something to say about this article? Share it with us on Facebook, Twitter or our LinkedIn Group.

Get the daily newsletter search marketers rely on.

Processing...Please wait.

See terms.

ATTEND OUR EVENTS

Lorem ipsum doler this is promo text about SMX events.

June 15-16, 2021: SMX Advanced

June 21-22, 2021: SMX Advanced Europe

August 17, 2021: SMX Convert

November 9-10, 2021: SMX Next

December 14, 2021: SMX Code

Available On-Demand: SMX

Available On-Demand: SMX Report

Available On-Demand: SMX Create

×


Learn More About Our SMX Events

Discover actionable tactics that can help you overcome crucial marketing challenges. Our next conference will be held:

Next Event: Sept. 14-15, 2021

Available On-Demand: March 2021

Available On-Demand: October 2020

×

Attend MarTech - Click Here


Learn More About Our MarTech Events

White Papers

  • SEO Wars: How to Resist the Dark Side and Earn Links Organically
  • Data & Organizational Roadblocks? Your Path to Frictionless Revenue Optimization
  • Converting with Conversational AI
  • 4 Ways Chatbot Marketing Can Drive Sales
  • Client Reporting Best Practices Guide
See More Whitepapers

Webinars

  • Drive Customer Engagement with the Power of Personalization
  • 7 Use Cases That Prove Why You Should Implement DAM
  • Accelerate Your SEO & Content Marketing Program with 4 Key Milestones
See More Webinars

Research Reports

  • Local Marketing Solutions for Multi-Location Businesses
  • Enterprise Digital Asset Management Platforms
  • Identity Resolution Platforms
  • Customer Data Platforms
  • B2B Marketing Automation Platforms
  • Call Analytics Platforms
See More Research

Attend SMX For Only $199

h
Receive daily search news and analysis.

Channels

  • SEO
  • SEM
  • Local
  • Retail
  • Google
  • Bing
  • Social

Our Events

  • SMX
  • MarTech

Resources

  • White Papers
  • Research
  • Webinars

About

  • About Us
  • Contact
  • Privacy
  • Marketing Opportunities
  • Staff

Follow Us

  • Facebook
  • Twitter
  • LinkedIn
  • Newsletters
  • RSS
  • Youtube

© 2021 Third Door Media, Inc. All rights reserved.

Your privacy means the world to us. We share your personal information only when you give us explicit permission to do so, and confirm we have your permission each time. Learn more by viewing our privacy policy.Ok