Duplicated content detected with MOZ crawl with canonical applied
-
Hi there!
I have a slight problem.
I have a site with Joomla 3.3 that we recently migrated from 2.5.Joomla, for some reason that I don´t really get, creates hundreds of weird urls for the site like
mydomain.com/en -> joomla creates en/home/149-xxx-xxx/xxxxxx-xxxxxx that links to the first one.
The new version 3.3 knows this bug and applies a rel=canonical to the ones created "artificially", so they should not be identified as duplicated.Sample piece of code: en/home/149-all-en/xxxxxxx-xxxxxx" rel="canonical" /
MOZ crawler identifies this as duplicated and like this I have thousands of pages duplicated all with titles, content etc... all the ones created by joomla. Still my site has good SEO results and I can not see any penalties but I am a bit concerned they may come in the future....
Can anyone explain me what is happening?
Thank you in advance for your time,
-
If it's a period of 2 weeks and you're going to do it anyways, I would just make the new content and not go to the expense of setting up redirects and then taking them down, which can cause issues when you plan on recreating a URL.
-
Thank you for your time!
We are going to setup 301 redirects (one colleague suggested importing those directly in the DB of redirects) from those duplicated pages until joomla has a native solution and we have the time to make all unique content, to avoid penalties.
At least, we would solve temporaly the problem, it will take 2 weeks to make all the unique content.
Would that make sense?
Have a nice weekend!
-
I personally would not generate new language sections unless the content has been translated and localized on those pages. Right now your Spanish homepage has English content in the body, so I would view this as incomplete. Ideally you'd translate the entire page for those sections.
When you do that, you'll want to use hreflang, not canonicals, to indicate different versions of the same content.
So, my recommendation is (A) get rid of the Spanish content sections which would solve the duplication problem, or (B) finish translating the content and then install hreflang code, which would also solve the duplication problem.
Unfortunately I don't know of a good hreflang tool for Joomla specifically.
Let me know if that makes sense?
-
Thank you Kane.
I would like to keep the content in all the languages, ,as I think it is useful for customers to enter easily certain areas.
The problem that I am always having is the implementation...There are not real good canonical plugins (that would allow me to do a bulk import), and I am not that advanced as for doing an htaccess redirect with 301... still, I would like that if someone from NL or FI version would like to find the area barcelona could see it....
Anything on mind!? Just to say, I tried SH404, does all the work but rewrites the whole url structure (not possible), I tried canonical http://www.cmsplugin.com/products/components/4-canonical-url which solves the duplication by languages but not the random urls created by 3.3...
Then I decided to leave the plugin I mentioned before, it deletes all the duplicated urls generated automatically but does not solve the language problem...So, here I am
Any suggestion?
-
Also, if you decide to keep the /es/ section of the website then you'll need to look into hreflang instead of canonical tags, because /es/ and /en/ will not be duplicate content once they're translated.
Read this Q&A from Google for details - https://sites.google.com/site/webmasterhelpforum/en/faq-internationalisation#q20
-
Hey Jose,
If you have an /es/ subfolder then ideally you would be translating that content to Spanish, not canonicalizing that content back to the English version.
I can see from http://www.spain-internship.com/es/internships-in-salamanca that not all /es/ pages are translated - is this true across the entire website?
If you don't have any Spanish content, then you should just kill off the /es/ version entirely.
-
Hi there,
Thanks for the update. Now that you told me the problem I found out this is a known bug for joomla and I am working on it.
I found a plugin http://styleware.eu/store/item/26-styleware-content-canonical-plugin that sends all the duplicated urls, generated automatically with a canonical to the home.Sample:
http://www.spain-internship.com/en/home/149-all-en/placement-spain
Now with the link http://www.spain-internship.com" rel="canonical" />.This solves the problem of the core canonical bug.
Would this be a proper solution?Now I only have to change all the ones duplicated due to languages config, block then in robots or canonical but as far as I control it, it is ok.
Please, let me know if this would be a proper solution.
Thank you in advance for your help, if I can help you in some moment with something here we are!
-
Ok, the problem is your pages are all canonical to themselves, the canonical tag should point at the main page for the content, not to every page. For your first example, all pages that get their content from http://www.spain-internship.com/en need to have canonical tags to that page, instead the copy page has this:
href="http://www.spain-internship.com/fi/etusivu/186-all-fi/home-page-fi" rel="canonical" />
it should have
href="http://www.spain-internship.com/fi/" rel="canonical" />
-
I will provide few so you can look!
Detected as duplicated:
http://www.spain-internship.com/en
http://www.spain-internship.com/en/home/149-all-en/placement-spainSame here:
http://www.spain-internship.com/fi
http://www.spain-internship.com/fi/etusivu/186-all-fi/home-page-fihttp://www.spain-internship.com/en/internships-in-salamanca
http://www.spain-internship.com/es/internships-in-salamancaFirst one is the original. The rest one have canonical. Still detected as duplicated.
-
Do you have an example of one of these generated pages as well, everything looks fine on the main page.
-
Hey,
Yes, sure.
This is the duplicated from the /en
http://www.spain-internship.com/en/home/149-all-en/placement-spain
Thanks!
-
Do you have a link to one of these pages so we can look at how it is deploying the canonical onto the page.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Why is my backlink not detected here, but it appears on google search console? I have added the moz robots
Why is my backlink not detected here, but it appears on google search console? I have added the moz robots
Link Explorer | | gunawanjord0 -
Need help understanding this Moz Chart comparing link metrics against competitors...
Why is my website at 60% and what does it mean? And more importantly, what needs to be done to fix (or not fix) this situation??? Please help... Image of chart attached kcd5r40
Link Explorer | | SamCitron0 -
Strange error in MOZ report
I get the following warning about our domain name in Link Explorer Moz tool You entered the URL debtacademy.com which redirects to www.hugedomains.com/domain_profile.cfm?d=debtacademy&e=com. Click here to analyze www.hugedomains.com/domain_profile.cfm?d=debtacademy&e=com instead. Please advice me. How I can fix it.
Link Explorer | | jeffreyjohnson0 -
MOZ doesn't work for .dating and .chat domain extensions
I have been a MOZ subscriber for a few years now. I don't think MOZ works for .dating and .chat domain extensions. I have 2 sites that have authority 1 despite back links. Here are the details: https://oooo.dating > DA = 1 https://talk.chat > DA = 1 oooo has 221 links (Google search console) talk has 1317 links (Google search console) May be a MOZ staff member can look into this. If you are customer and use some of the newer domain extensions please share your details if you have the same problem.
Link Explorer | | dmcubed1 -
Crawl a node js page - Why can I only see my frontpage?
Hi When i am trying to crawl my website ( https://www.doorot.com/ ) it can only find my frontpage. It's a node js page. Any one had the same problem or know how to crawl my site in order to see all my pages? Kasper
Link Explorer | | KasperClio1 -
Moz Spam Score 9/17 when there are no links
Hi I was just looking at a competitors new site in OSE and it had a DA and PA of 1 which I would expect and also no links, which again I would expect, basically the guy used to design stuff for me and thinks he can do a better job so he is giving it ago. But he has a spam score of 9/17 but no links - how is this even possible. Thanks Andy
Link Explorer | | Andy-Halliday0 -
Duplicate content, despite having correct canonical tags
Hi there We've run a report today which shows we have 157 pages with duplicate content. Out of these 157, 134 have canonical tags so I'm not sure why they're showing up in the reports as being duplicate. Here's an example, this page: https://www.havaianasaustralia.com.au/Accessories?category=beach-towels .....has an canonical of: http://www.havaianasaustralia.com.au/beach-towels-havaianas-accessories But it's still showing as duplicated in your reports. Any support would be appreciated. Thanks.
Link Explorer | | Havs10 -
A few questions regarding Moz tools + E-commerce strategy
Hi everyone 🙂 I'm currently in the midst of optimizing a Scandinavian E-commerce site. I have a few questions, that hopefully someone will be able to help me get answered. Firstly, GoogleBots should be able to recognize "ø" as "oe", "æ" as "ae" and "å" as "aa" in the URL title. I've noticed that Moz' On-page grader does not support this unfortunately - has something changed or do Scandinavians just receive a little less love than the English? Secondly, how does one avoid keyword stuffing on E-commerce sites? The products that are displayed in category pages all make use of the same keyword that is targeted for that category. As such, some pages have 40+ mentions of the keyword, although in reality there are less than 15 (the rest being in the product names). Any tips or tricks on how to get this optimized or does Google simply recognize the site as an E-commerce site and somewhat ignores keyword stuffing - as long as the website has sufficient content? Thirdly, has something happened to Moz' Open Site Explorer? It seems like something has changed and when I checked for backlinks for the site today, only 3 was found. I know for a fact that many many more exist (which other tools also confirm when they scrape the site). Looking forward to hearing from all of you! Best, Mark
Link Explorer | | osn0