Should we use Google's crawl delay setting?

lzhao

We’ve been noticing a huge uptick in Google’s spidering lately, and along with it a notable worsening of render times.

Yesterday, for example, Google spidered our site at a rate of 30:1 (google spider vs. organic traffic.) So in other words, for every organic page request, Google hits the site 30 times.

Our render times have lengthened to an avg. of 2 seconds (and up to 2.5 seconds). Before this renewed interest Google has taken in us we were seeing closer to one second average render times, and often half of that.

A year ago, the ratio of Spider to Organic was between 6:1 and 10:1.

Is requesting a crawl-delay from Googlebot a viable option?

Our goal would be only to reduce Googlebot traffic, and hopefully improve render times and organic traffic.

Thanks,

Trisha

RichardVaughan

Unfortunately you can't change crawl settings for Google in a robots.txt file, they just ignore it. The best way to rate limit them is using custom Crawl settings in Google Webmaster Tools. (look under Site configuration > Settings)

You also might want to consider using your loadbalancer to direct Google (and other search engines) to a "condomised" group of servers (app, db, cache, search) thereby ensuring your users arent inadvertantly hit by perfomance issues caused by over zealous bot crawling.

lzhao

We're a publisher, which means that as an industry our normal render times are always at the top of the chart. Ads are notoriously slow to load, and that's how we earn our keep. These results are bad, though, even for publishing.

We're serving millions of uniques a month, on a bank of dedicated servers hosted off site, load balanced, etc.

adriandg

more info on that here: http://www.robotstxt.org/

adriandg

Wow! those are really high render times. Have you considered perhaps moving to another webserver? NginX is pretty damm fast, and could probably get those render times down. Also, are you on a shared host? or is this a dedicated server?

What you're looking for is the robots.txt file though, and you want to add some lines like this:

User-agent: *
Disallow:
Crawl-Delay: 10

User-agent: ia_archiver
Disallow: /

User-agent: Ask Jeeves
Crawl-Delay: 120

User-agent: Teoma
Disallow: /html/
Crawl-Delay: 120

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Should we use Google's crawl delay setting?

Got a burning SEO question?

Browse Questions

Explore more categories

Related Questions

Why Google crawl parameter URLs?

Good alternatives to Xenu's Link Sleuth and AuditMyPc.com Sitemap Generator

If Google's index contains multiple URLs for my homepage, does that mean the canonical tag is not working?

Medium sizes forum with 1000's of thin content gallery pages. Disallow or noindex?

How to create site map for large site (ecommerce type) that has 1000's if not 100,000 of pages.

Hard-working newbie question: benefit of moving my blog to my online store's domain?

Mobile site rank on Google S.E. instead of desktop site.

Are URL's with trailing slash seen as two different URLs