Crawl Budget and Faceted Navigation

Webpresence

Hi, we have an ecommerce website with facetted navigation for the various options available.

Google has 3.4 million webpages indexed. Many of which are over 90% duplicates.

Due to the low domain authority (15/100) Google is only crawling around 4,500 webpages per day, which we would like to improve/increase.

We know, in order not to waste crawl budget we should use the robots.txt to disallow parameter URL’s (i.e. ?option=, ?search= etc..). This makes sense as it would resolve many of the duplicate content issues and force Google to only crawl the main category, product pages etc.

However, having looked at the Google Search Console these pages are getting a significant amount of organic traffic on a monthly basis.

Is it worth disallowing these parameter URL’s in robots.txt, and hoping that this solves our crawl budget issues, thus helping to index and rank the most important webpages in less time.

Or is there a better solution?

Many thanks in advance.

Lee.

jcnotfound2083

Hello, I have also been in a similar situation. What I did was to disallow the urls with parameters using the robots.txt and place (in only the pages with parameters) the following two html tags:

This will expressly indicate to google not to index these pages. I still have some errors but I guess they will disappear in a few months.

Regards

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Crawl Budget and Faceted Navigation

Got a burning SEO question?

Browse Questions

Explore more categories

Related Questions

Crawl Depth improvements

Does Google make continued attempts to crawl an old page one it has followed a 301 to the new page?

How can Google index a page that it can't crawl completely?

MOZ crawl report says category pages blocked by meta robots but theyr'e not?

How to stop pages being crawled from xml feed?

Could you use a robots.txt file to disalow a duplicate content page from being crawled?

Crawl questions

How to prevent Google from crawling our product filter?