Yahoo Updates Crawler: Introduces Yahoo Slurp 3.0

Yahoo has announced the launch of their new crawler, Yahoo! Slurp 3.0. 3.0 is slowly being pushed out over the course of the “next several weeks,” including “some infrastructure updates,” Yahoo said. I suspect the recent changes we reported with the Yahoo Search Tax Day update is related to the “infrastructure updates.” The new spider […]

Chat with SearchBot

Yahoo has announced the launch of their new crawler, Yahoo! Slurp 3.0. 3.0 is slowly being pushed out over the course of the “next several weeks,” including “some infrastructure updates,” Yahoo said. I suspect the recent changes we reported with the Yahoo Search Tax Day update is related to the “infrastructure updates.”

The new spider will contain the same user-agent and all robots.txt directives, but will be named “Slurp 3.0” as opposed to “Slurp 2.0.” Yahoo notifies us to expect two changes:


(1) The crawlers will reside on a new set of IP addresses, but will still reverse DNS lookup to the crawl.yahoo.net domain. If you are using IP-based rules, make sure to keep an eye out for the new Slurp crawler and add those sets of IP addresses to your database.

(2) The new crawlers will return a new user agent, specifically Yahoo! Slurp/3.0. If you set directives for “Slurp” or “Yahoo! Slurp,” those will remain working, but if you specified “Slurp/2.0,” then you may have an issue. Yahoo recommends using User-agent: Slurp in your robots.txt file.

Remember, you can reverse DNS lookup the spiders to confirm that this new crawler is from Yahoo. If you have any problems, Yahoo asks you submit them at the SiteExplorer forum.

This update confirmed two past posts:

(1) Yahoo Slurp being extremely hyperactive one weekend.
(2) A Yahoo Search update that we reported over the weekend.


About the author

Barry Schwartz
Staff
Barry Schwartz is a Contributing Editor to Search Engine Land and a member of the programming team for SMX events. He owns RustyBrick, a NY based web consulting firm. He also runs Search Engine Roundtable, a popular search blog on very advanced SEM topics. Barry can be followed on Twitter here.

Get the must-read newsletter for search marketers.