Does a sitemap override Google parameter handling?
-
This question might seem silly, but I'll ask anyway.
We have an eCommerce site with a ton of duplicate content, mostly caused by faceted navigation. In researching ways to reduce the clutter, I've decided to use Google parameter handling to stop Googlebot from crawling pages with certain parameters, like: sort order, page #, etc...
Now my question:
If I set all of these parameters so that Googlebot doesn't crawl the grids, how will they ever find the individual product pages? We do upload a sitemap with all of the product pages. Does this solve my issue? Or, should I handle the duplicate content with noindex, follow tag?
Or, is there an even better way?
Thanks
-
Hello John,
This is a very good question, and something people don't often think about when blocking the navigational paths on their site from being crawled.
Depending on how fast your category pages load and how many products are on each of them, you may consider a View All Canonical page: http://googlewebmastercentral.blogspot.com/2011/09/view-all-in-search-results.html
There are many different ways to handle faceted navigation problems, including javascrpt, GWT parameter handling, robots meta, robots.txt, rel canonical... and combinations of these. The right approach should be customized for your specific needs. When possible, I prefer to allow Google to crawl and index down to a certain level of faceting, similar to allowing them into sub-categories (though it depends entirely on your taxonomy) but not tertiary (i.e. sub-sub) categories. For the next couple of levels I might allow them to crawl, but not index. And once it gets down to 4 or 5 levels deep (e.g. /?category=1&size=5&color=blue&price=low&this=that&so-on=so-forth...) I just block them from being both indexed and crawled (i.e. Meta NOINDEX,NOFOLLOW or robots.txt block) to save crawl budget by avoiding spider traps.
With all of that said, if you are giving Google an XML sitemap that contains the indexable URLs to all of your products they should have no problem indexing them, regardless of whether or not they can crawl all the way through your faceted navigation.
-
I would recommend you to use 'Canonical Link'
You can find more here:
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Brand on Google Knowledge Panel
Hi, I have a question that maybe someone can give me some insight, as I haven't been able to get any kind of solution by Google My Business support so far for several months. Our brand "neakriti" is a news organization located in Greece. If you search the brand "neakriti", you get all the results as normal, but the "Knowledge Panel" on the right of the search appears empty. However, if you search the brand as "neakriti.gr" on google search, it appears on the Knowledge Panel. I have been talking about this on the phone support of Google My Business for months now (more than a year), and I get responses like, "give it some more time and it will appear on neakriti results too". We are authority for the keyword "neakriti", and it is our site that appears first. Our website is linked to the business profile too. I have claimed the business, I have filled all its information, with daily posts specifically for the knowledge panel and so on. Is there any suggestion as to why we are unable to show up with just the brand name on the knowledge panel, or suggestions for actions we can possibly take?
Intermediate & Advanced SEO | | ioannisanif0 -
Google Indexing our site
We have 700 city pages on our site. We submitted to google via a https://www.samhillbands.com/sitemaps/locations.xml but they only indexed 15 so far. Yes the content is similar on all of the pages...thought on getting them to index the remaining pages?
Intermediate & Advanced SEO | | brianvest0 -
Significant Google crawl errors
We've got a site that continuously like clockwork encounters server errors with when Google crawls it. Since the end of last year it will go a week fine, then it will have two straight weeks of 70%-100% error rate when Google tries to crawl it. During this time you can still put the URL in and go to the site, but spider simulators return a 404 error. Just this morning we had another error message, I did a fetch and resubmit, and magically now it's back. We changed servers on it in Jan to Go Daddy because the previous server (Tronics) kept getting hacked. IIt's built in html so I'm wondering if it's something in the code maybe? http://www.campteam.com/
Intermediate & Advanced SEO | | GregWalt1 -
Quantity or quality in Google+ authorship?
Hi folks, here goes a (hopefully) easy one for the local authorship gurus. For our blog content strategy we currently have two inhouse contributors. Both have decent Google+ profiles and one is in the process of really establishing authorship/influence by submitting guest posts to several industry sites, sharing content in Google+, engaging in conversations in twitter, etc. Posts by this latter contributor already rank page 1 for the main keywords. We now have a new content contributor who is a retired employee from the company and a good friend. He has written excellent content that will be published in our blog in the coming few months. He does not have a Google+ profile but he can have one if we ask him to, but he is not going to use it for anything other than writting on our blog. He does not mind having his content published under any of our current Google+ profiles. Question: should we include this new content under our current profiles or should we create a new Google+ profile for this new contributor knowing that it will be an 'empty' profile? Thanks in advance!
Intermediate & Advanced SEO | | TIBA0 -
How long is the google sandbox these days?
Hello, I'm putting up a new site for the first time in a while. How long is the Google Sandbox these days, and what has changed about it. Before it was 6 months to 1 year long. Thanks!
Intermediate & Advanced SEO | | BobGW0 -
Can google read ajax
Looking to load a one page product view instead of 10 pages of pagination. Does google read ajax and see all 10 pages as 1 page.
Intermediate & Advanced SEO | | Archers1 -
Google is not Indicating any Links to my site
We built a new store on another ccTLD and linked to it from some of our other domains in a few locations. I am noticing that with the Google operator command "links:" we are seeing nothing linking to our site anywhere. Some things to clarify: These are not no-follow links These pages linking to our new domain are indexed The pages being linked to on our new domain are indexed This is not a flash site or heavy in JavaScript The links existed the day the site was launched so when the new pages were crawled they existed. "Site:" command in Google shows me that my new site is indexed. What could potentially be causing this? I am trying to get these newer ccTLD's to begin ranking and I understand that I need to get links going to these pages since they are fairly new (2.5 months) so I can outrank the .com in the SE's in those locales. (Like Google.co.uk)
Intermediate & Advanced SEO | | DRSearchEngOpt0 -
URL Parameter is not available in website which was monitored by Google
I was checking URL parameters section over Google webmaster tools. Google have monitored following parameters and exclude it from crawling. utm_campaign utm_medium utm_source I have built URLs with following tool to track visits from vertical search engine like Google shopping and other comparison shopping engines. http://www.google.com/support/analytics/bin/answer.py?answer=55578 So, I am quite confuse to see over my data. Will Google consider external URLs which are available with above parameters or require to consist on live website? Note: I am asking for my eCommerce website. http://www.lampslightingandmore.com/
Intermediate & Advanced SEO | | CommercePundit0