Is a Rel Canonical Sufficient or Should I 'NoIndex'
-
Hey everyone,
I know there is literature about this, but I'm always frustrated by technical questions and prefer a direct answer or opinion. Right now, we've got recanonicals set up to deal with parameters caused by filters on our ticketing site. An example is that this:
http://www.charged.fm/billy-joel-tickets?location=il&time=day relcanonicals to...
http://www.charged.fm/billy-joel-tickets
My question is if this is good enough to deal with the duplicate content, or if it should be de-indexed. Assuming so, is the best way to do this by using the Robots.txt? Or do you have to individually 'noindex' these pages?
This site has 650k indexed pages and I'm thinking that the majority of these are caused by url parameters, and while they're all canonicaled to the proper place, I am thinking that it would be best to have these de-indexed to clean things up a bit.
Thanks for any input.
-
I totally agree with EGOL on this. I would like to add my 2cents since I think I am one of the only SEO people that is a developer too.
This is what I would do (in pseudo code) put a <rel="canonical" href="$url=strtok($_SERVER[" request_uri"],'?');"=""> </rel="canonical">
This is in php, I don't know what platform you are on, but what it will do in php is return the current url as the canonical and delete the ? and everything after. So basically it will return the url minus the query string. I use this technique a lot with my clients for doing canonical urls on CMS's that use query strings and it works great.
-
Hi - Just to throw in my two cents - the canonicals should do it as Moosa says but if you really want to de-index then a dynamic meta robots tag is the best way to get them out of the index in my experience.
That being said, having a quick look at your site it doesn't look like those url parameters are the issue, a quick look at something like this: site:charged.fm inurl:date= only shows a few thousand results and the location= and time= show even less - so looks like the rel canonicals are doing the job and will continue to with a bit of patience. If you look at urls with /event/ in them however you see a lot (300,000+) and I am guessing many of those are for past events. Google webmaster tools should help you id what the bulk of those 600 thousand urls are so worth verifying where the exact issue is before attempting to fix something that isn't a problem...
-
There are a few choices for managing parameters. I have used....
A) The URL parameter manager in the "crawl" options of Google Webmaster Tools. I have found it to be totally unreliable.
B) Rel=canonical. It is much more reliable than WMT but you still must rely on search engines to discover it and obey - which can be slow to take effect and is less than 100% effective.
I have not used robots.txt because I think that it would have similar performance to rel=canonical.
I have the belief that you shoud not trust search engines to do things for you that you can do for yourself with 100% reliability. So, I am doing ......
C). Managing parameters on my server with .htaccess so I have 100% control.
-
I believe if you have setup the rel canonical correctly there ideally should be no issue with that but if you really see some of your non preferred versions indexed in Google then you can go with the no index idea.
When no-indexing pages you can go with any approach but in my experience it is better do it by using robots.txt.
I hope this is a direct and to the point opinion J
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Ecommerce Link Juice and Canonical URLs
Hello all. I am optimising an E-Commerce site and I have a questions about Products in several categories & Canonical URL's. Using Magento Platform. site.com/category1/product1/ ( link from category is site.com/product1/ )
Intermediate & Advanced SEO | | s_EOgi_Bear
site.com/category2/product1/ ( link from category is site.com/product1/ )
site.com/category2/subcategory1/product1 ( link from category is the same , as is the canonical URL )
site.com/product1/ ( this is where other categories link to ) Canonical links for all the above is site.com/category2/subcategory1/product1 which takes care of duplicate content correctly. I just wonder if we would get more link juice if ALL the links from all categories went to site.com/category2/subcategory1/product1 ( instead of some going to site.com/product1/ ) Thanks in advance 🙂0 -
Page is noindex
Hi, We set pages with this and i can see in the view source of the page <meta name="robots" content="noindex"/> We had a new page posted in the site and its indexed by Google but now the new post is visible on a page thats shows partial data which we noindexed as above because its duplicate data and search engines dont have to see it But its still crawling Any ideas?
Intermediate & Advanced SEO | | mtthompsons0 -
Set up a rel canonical
I have a question. I was wondering, if it was possible to set up a rel canonical. When I can't access the non canonical pages? For example, my site as at www.site.com , but the non cannocail is at site.com is their any way to set thet up without actually edting it at site.com ? Thanks for your help
Intermediate & Advanced SEO | | PeterRota0 -
Hreflang many to one relationship (And canonical)
I have a question about the use of hreflang. In country one i have for instance web.one/BlueWidget web.one/YellowWidgetIn country (and language) two I have web.two/Widget In other words, the distinction between yellow and blue isn't there in country two. My best try would beweb.one/BlueWidget two" href="<a>http://</a>web.two/Widget" />web.one/YellowWidget <link rel="<a>alternate" hreflang="two" href="<a>http://</a>web.two/Widget" /></link rel="<a> <link rel="<a>canonical" href="web.one/BlueWidget"/></link rel="<a>web.two/Widget <link rel="<a>alternate" hreflang="one" href="web.one/BlueWidget" /> </link rel="<a> <link rel="<a>canonical" href="web.two/Widget "/></link rel="<a>So pointing both blue and yellow to the Widget in country two, and pointing widget in country two to Blue widget in country one, as BlueWidget is what I would prefer to promote if I had to choose.What do people think? Is this the way to go? If so, why? If not, why?And is it actually worth bothering with? Will country two get a boost because I get links to country one? Regards
Intermediate & Advanced SEO | | Peekabo0 -
Include Cross Domain Canonical URL's in Sitemap - Yes or No?
I have several sites that have cross domain canonical tags setup on similar pages. I am unsure if these pages that are canonicalized to a different domain should be included in the sitemap. My first thought is no, because I should only include pages in the sitemap that I want indexed. On the other hand, if I include ALL pages on my site in the sitemap, once Google gets to a page that has a cross domain canonical tag, I'm assuming it will just note that and determine if the canonicalized page is the better version. I have yet to see any errors in GWT about this. I have seen errors where I included a 301 redirect in my sitemap file. I suspect its ok, but to me, it seems that Google would rather not find these URL's in a sitemap, have to crawl them time and time again to determine if they are the best page, even though I'm indicating that this page has a similar page that I'd rather have indexed.
Intermediate & Advanced SEO | | WEB-IRS0 -
Canonical URL Tag Usage
Hi there, I have a .co.uk website and a .ie website, which have the exact same content on both, should I put a canonical tag on both websites, on every page? Kind Regards
Intermediate & Advanced SEO | | Paul780 -
How to check a website's architecture?
Hello everyone, I am an SEO analyst - a good one - but I am weak in technical aspects. I do not know any programming and only a little HTML. I know this is a major weakness for an SEO so my first request to you all is to guide me how to learn HTML and some basic PHP programming. Secondly... about the topic of this particular question - I know that a website should have a flat architecture... but I do not know how to find out if a website's architecture is flat or not, good or bad. Please help me out on this... I would be obliged. Eagerly awaiting your responses, BEst Regards, Talha
Intermediate & Advanced SEO | | MTalhaImtiaz0 -
<rel canonical="">and Query Strings</rel>
How are you supposed to <rel canonical="" tag="">a page with a query string that has already been indexed? It's not like you're serving that page from a CMS where you have an original page with content to add to the head tag.</rel> For example.... Original Page = http://www.example.com/about/products.php Query String Page = http://www.example.com/about/products.php?src=FrontDoorBox Would adding the <rel canonical="" tag="">to the original page, referencing itself, be the solution so that the next time the original page is crawled, the bot will know that the previously indexed URL with query string should actually be the "original"? That's the only solution I can come up with because there's no way to find the query string rendered page to tag with the canonical.....</rel>
Intermediate & Advanced SEO | | Yun0