Duplicate Content & Canonicals
-
I am a bit confused about canonicals and whether they are "working" properly on my site. In Webmaster Tools, I'm showing about 13,000 pages flagged for duplicate content, but nearly all of them are showing two pages, one URL as the root and a second with parameters. Case in point, these two are showing as duplicate content:
http://www.gallerydirect.com/art/product/vincent-van-gogh/starry-night
We have a canonical tag on each of the pages pointing to the one without the parameters. Pages with other parameters don't show as duplicates, just one root and one dupe per listing,
So, am I not using the canonical tag properly? It is clearly listed as:Is the tag perhaps not formatted properly (I saw someone somewhere state that there needs to be a /> after the URL, but that seems rather picky for Google)?Suggestions?
-
Thanks, Dr. Pete.
I'll discuss the options with our dev team and see which one will cause the least amount of developer caffeine consumption.
-
Argh... sorry, I didn't even check/see that. Yeah, that may be a real problem - you're basically sending two canonicalization signals that are in conflict. Is there any way to hide the defaults? If the canonicals point to (A), but then (A) redirects to (B), Google may just ignore the canonical.
Unfortunately, your options are to either: (1) hope for the best, (2) canonical to the uglier URL, or (3) kill the redirect and set the default parameters on the server-side (without resetting the URL).
I am primarily seeing the canonical URL in Google's index, so I'm not sure it's actually causing you harm. It's just not an ideal situation.
-
Dr. Pete:
I'm looking into it to be sure, but I believe that you are correct in that this is an ad-tracking URL.
A follow up question:
The URL that is the canonical version of each page would be in the format of
http://www.gallerydirect.com/art/product/vincent-van-gogh/starry-night
However, this exact URL redirects to one with default parameters for substrate, style and frame size:
Should we change our canonical from the first URL (without the parameters) to the second URL with the parameters? Or is that a moot point with Google?
-
While the properly closed tag should have "... />", that's generally only an issue in very isolated cases. I've never seen it interfere with a canonical tag. It's a harmless change to make (and it is more correct), but my gut reaction is that this will make no difference. Google should be honoring these canonicals.
One odd thing I'm seeing. If I dig into the index, I'm finding the following page:
This may be an ad-tracking URL (?) and it's redirecting somehow (but not with a 301 or 302) to the non-canonical URL. This may be sending a mixed signal, and ideally it would redirect to the canonical version of the URL. I'm not sure where this version is coming from, so it's a bit hard to diagnose.
-
Hi Darin
The tag is not working because if you go into Google and enter the URL: http://www.gallerydirect.com/art/product/vincent-van-gogh/starry-night?substrate_id=3&product_style_id=8&frame_id=63&size=25x20 you will see that it is being indexed on Google.
If it's being indexed, then it runs the risk of duplicate content issues.
The tag definitely does need the /> at the end, so the correct usage of the tag would be: rel="canonical" href="http://www.gallerydirect.com/art/product/vincent-van-gogh/starry-night" />
I think if you implement that small change, there shouldn't be any problems.
Hope this helps.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
NO Meta description pulling through in SERP with react website - Requesting Indexing & Submitting to Google with no luck
Hi there, A year ago I launched a website using react, which has caused Google to not read my meta descriptions. I've submitted the sitemap and there was no change in the SERP. Then, I tried "Fetch and Render" and request indexing for the homepage, which did work, however I have over 300 pages and I can't do that for every one. I have requested a fetch, render and index for "this url and linked pages," and while Google's cache has updated, the SERP listing has not. I looked in the Index Coverage report for the new GSC and it says the urls and valid and indexable, and yet there's still no meta description. I realize that Google doesn't have to index all pages, and that Google may not also take your meta description, but I want to make sure I do my due diligence in making the website crawlable. My main questions are: If Google didn't reindex ANYTHING when I submitted the sitemap, what might be wrong with my sitemap? Is submitting each url manually bad, and if so, why? Am I simply jumping the gun since it's only been a week since I requested indexing for the main url and all the linked urls? Any other suggestions?
Web Design | | DigitalMarketingSEO1 -
Mergers & Acquisitions - Website Transition Good practice
Hi everyone, I was wondering if anyone has come across good practice for maintaining websites after a merger or acquisition where there needs to be an association between two websites of the two companies involved. For an acquisition, I'm considering moving the acquired company to a sub domain of the parent company e.g. aquiredcompany.parentcompany.com. On both websites there wmay be a prominant popup so visitors can switch between the websites if they have visited the incorrect one. One worry I have is the acquired company has some good rankings, which I want to keep. I will of course manage the process through 301 redirects. But I was wondering if anyone has any thoughts on this approach or can suggest any better solutions. Thanks in advance, Stuart
Web Design | | Stuart260 -
Managing website content/keywords for wordpress site
We are in the midst of redesigning our website and have been working with freelance blog/content writers to increase the unique content on our site. We are finding it increasingly difficult to manage the topics/keywords as we continue to expand. Googledrive and google spreadsheets have been our primary tools thus far. Can anyone recommend a good tool that would allow us to manage content and blog posts for our site?
Web Design | | Tom_Carc0 -
Above the Fold Content - Use of large images
Hi All, Our designers have come to the SEO team to ask if have a large image across the top of the page taking up a large majority of the above the fold real estate will impact our SEO. Our initial thoughts are no as long as we have an optimised H1 visibal to the user landing there which informs them what the page is about. Any thoughts would be appreciated.
Web Design | | J_Sinclair1 -
Will a .com and .co.uk site (with exact same content) hurt seo
hello, i am sure this question has been asked before, but while i tried to search i could not find the right answer. my question is i have a .com and .co.uk site. both sites have exact same product, exact same product descriptions, and everything is the same. the reason for 2 sites is that .com site shows all the details for US customers and in $, and .co.uk site shows all the details to UK customers and with Pound signs. the only difference in the 2 sites might be the privacy policy (different for US and UK) and different membership groups the site belongs to (US site belong to a list of US trade groups, UK belongs to a list of UK trade groups). my question is other than the minor difference above, all the content of the site is exactly the same, so will this hurt seo for either one or both the site. Our US site much more popular and indexed already in google for 4 years, while our UK site was just started 1 month ago. (also both the sites are hosted by same hosting company, with one site as main domain and the other site as domain addon (i thought i include this information also, if it makes sense to readers)) i would appreciate a reply to the question above thanks
Web Design | | kannu10 -
Competitive Analysis: Links & Keywords
I'm noticing that for some key local search terms our company is not ranking in SERPs as I would expect considering it's size relative to the local sites that are ranking. I subscribed to SEOmoz to get a better understanding of what's going on, and haven't figured it out yet. Our site is higher in almost every metric than the sites we're competing with, but our competition consistently ranks higher in organic results for industry standard keywords. The few metrics we're being outranked in are, "Linking C Blocks" and "Page MozTrust" (we're very close to the leader in MozTrust). Are these two metrics enough to account for our companies poor SERP performance or do I need to be paying attention to something else?
Web Design | | thinkWebstoreSEO0 -
Getting a lot more duplicate content warnings than I expected.
I run WordPress on many of my sites and a site crawl has found MANY duplicate content pages on the latest domain I started a campaign for. I expected to see quite a lot on the tag pages that only had one post but even tag pages with multiple posts and author and category pages with many posts are showing as duplicate content. Is this normal for a WordPress site to have so much duplicate content warnings from the taxonomy pages? I have the option to bulk noindex, follow the category and tag pages but should I do it? I get some traffic directly to the tag pages so removing the pages from search results would dent the traffic of the site a little (generally high bounce rate, low engagement traffic anyway) but could removing the apparent duplicate content actually improve the article pages themselves? Or does anyone have any WordPress specific advice for making the pages not duplicate content? I've toyed with the idea of just displaying excerpts but creating manual excerpts for the 4 years worth of posts, some of which I have no personal knowledge of the subject matter so other suggestions are welcome.
Web Design | | williampatton0 -
Canonical Tag
I've been helping someone out with their website, and I noticed the person who built the site made the canonical tags like this:
Web Design | | StandUpCubicles
href="http://www.example.com/" rel="canonical" /> I'm use to seeing it how seomoz does it: Does this matter? Is it ok to have it inverted? They also have another canonical tag in there like this:
var hs_canonical_url = "http\x3A\x2F\x2Fwww.example.com\x2Fhome" Any idea what that is? Could it be hurting the site?0