Search Engine Land » SEO » Everything You Wanted To Know About Blocking Search Engines

Everything You Wanted To Know About Blocking Search Engines

Last week, the three major search engines came together to say how they agree — and disagree — over the Robots Exclusion Protocol. It’s such an important standard, one every webmaster should understand. To help, Vanessa Fox has compiled an extensive and outstanding overview of it at Jane & Robot in her Managing Robot’s Access […]

Danny Sullivan on June 12, 2008 at 10:14 am | Reading time: 2 minutes

Chat with SearchBot

Please note that your conversations will be recorded.
I am trained with Search Engine Land content. Ask me anything!

SearchBot is thinking ...

Last week, the three major search engines
came together to
say how they agree — and disagree — over the Robots Exclusion Protocol.
It’s such an important standard, one every webmaster should understand. To
help, Vanessa Fox has compiled an extensive and outstanding overview of it
at Jane & Robot in her

Managing Robot’s Access To Your Website post.

The tutorial takes you through key areas such as:

A nice chart showing what you can block using either robots.txt or the
meta robots tag for each major search engine. It also covers other things
like reverse DNS lookup to verify a crawler’s identity.
Types of content you want private from search engines versus public.
Rather than private versus public, "not listed" versus "listed" might be
better terms Anything that really should be private ought to be kept
behind a password barrier. The tutorial does cover this, but it’s worth
stressing that no one should think robots exclusion is a method to keep
private/personally identifiable information out of search engines. But
there’s other info that you might want "private" in terms of not being
listed, such as printer-friendly pages, as the tutorial also explains.
How to block search engines, such as on a site-wide basis using
robots.txt, along with tips like using wildcards, specifying particular
search engines by crawler name. Page level blocking (with meta tags) is
also covered. There are lots of examples.
Common mistakes and myths are addressed, such as the idea that using
nofollow alone will keep pages from being indexed. Methods of testing
implementation are also covered.

Bookmark the guide — it’s one you’ll want to come back to time and
again.

Contributing authors are invited to create content for Search Engine Land and are chosen for their expertise and contribution to the search community. Our contributors work under the oversight of the editorial staff and contributions are checked for quality and relevance to our readers. The opinions they express are their own.

Add Search Engine Land to your Google News feed.

Related stories

New on Search Engine Land

Google Merchant Center Next fixes product download frustration

Phishing scam targets Microsoft Ad accounts via Google Search

From retro gaming glory to fighting Google: The journey of Brandon Saltalamacchia

How volatile have Google rankings really been?

How to maximize your Google Ads remarketing campaigns

About the author

Contributor

Danny Sullivan

Danny Sullivan was a journalist and analyst who covered the digital and search marketing space from 1996 through 2017. He was also a cofounder of Third Door Media, which publishes Search Engine Land and MarTech, and produces the SMX: Search Marketing Expo and MarTech events. He retired from journalism and Third Door Media in June 2017. You can learn more about him on his personal site & blog He can also be found on Facebook and Twitter.