Multiple Instances of the Same Article
-
Hi, I'm having a problem I cannot solve about duplicate article postings.
As you will see from the attached images, I have a page with multiple variants of the same URL in google index and as well as duplicate title tag in the search console of webmasters tools. Its been several months I have been using canonical meta tags to resolve the issue, aka declare all variants to point to a single URL, however the problem remains. Its not just old articles that stay like that, even new articles show the same behaviour right when they are published even thought they are presented correctly with canonical links and sitemap as you will see from the example bellow.
Example URLs of the attached Image
-
All URLs belonging to the same article ID, have the same canonical link inside the html head.
-
Also because I have a separate mobile site, I also include in every desktop URL an "alternate" link to the mobile site.
-
At the Mobile Version of the Site, I have another canonical link, pointing back to the original Desktop URL. So the mobile site article version also has
-
Now, when it comes to the xml sitemap, I pass only the canonical URL and none of the other possible variants (to avoid multiple indexing), and I also point to the mobile version of the article.
<url><loc>http://www.neakriti.gr/?page=newsdetail&DocID=1300357</loc>
<xhtml:link rel="alternate" media="only screen and (max-width: 640px)" href="http://mobile.neakriti.gr/fullarticle.php?docid=1300357"><lastmod>2016-02-20T21:44:05Z</lastmod>
<priority>0.6</priority>
<changefreq>monthly</changefreq>
image:imageimage:lochttp://www.neakriti.gr/NewsASSET/neakriti-news-image.aspx?Doc=1300297</image:loc>
image:titleΟΦΗ</image:title></image:image></xhtml:link></url>
The above Sitemap snippet Source: http://www.neakriti.gr/WebServices/sitemap.aspx?&year=2016&month=2
The main sitemap of the website: http://www.neakriti.gr/WebServices/sitemap-index.aspxDespite my efforts you see that webmasters tools reports three variants for the desktop URL, and google search reports 4 URLs (3 different desktop variant urls and the mobile url).
I get this when I type the article code to see if what is indexed in google search: site:neakriti.gr 1300297
So far I believe I have done all I could in order to resolve the issue by addressing canonical links and alternate links, as well as correct sitemap.xml entry. I don't know what else to do... This was done several months ago and there is absolutelly no improvement.
Here is a more recent example of an article added 5 days ago (10-April-2016), just type
site:neakriti.gr 1300357
at google search and you will see the variants of the same article in google cache. Open the google cached page, and you will see the cached pages contain canonical link, but google doesn't obey the direction given there.Please help!
-
-
Hi all,
sorry for the delay, I am away on a business trip, this is why I stopped communicating the past few days.
I can confirm that the latest entries (those after March) come as a single instance.
However there are some minor exceptions like the one hereExample of a recent article indexed in both desktop (even though desktop url is not the canonical) and mobile URL
https://www.google.gr/search?q=site:neakriti.gr&biw=1527&bih=899&source=lnms&sa=X&ved=0ahUKEwiIxODGt5_MAhUsKpoKHdcUAkYQ_AUIBigA&dpr=1.1#q=site:neakriti.gr+1315539&tbs=qdr:w&filter=0Also I noticed that with the "alternate" and "canonical" links the mobile version of the site doesn't get indexed anymore (with minor exceptions like the one above).
-
Hi Ioannis!
How's this going? We'd love an update.
-
Hmm, interestingly, when I followed your link, I only saw the canonical version of the article. Is this what you're seeing now?
Also, in response to your earlier question, yes, you can disallow parameters with robots.txt. If these canonical issues continue, that may be the best next step.
-
Thank you for your response, I will take a look at this.
However I have two questions regarding your suggestion
- Since I have canonical links at the loading page, doesn't that resolve the issue?
- the printerfriendly variation has a noindex meta at the head, shouldn't that be taken into account?
- Can I put regular expressions in my robots.txt? How can I block url params? Because printerfriendly and newsdetailsports are values of the "page" GET param
Infact the printerfriendly contains canonical link and noindex meta to inform search engines not to index content, and let them know where the original content exists
-
Hi there
The printer friendly URL is coming from the print this article button (attached) and the /default.aspx URL is coming from the ^ TOP button (attached).
What you could do is use your robots.txt to ignore these URLs. You can all tell Google what URL parameters to ignore, but please be EXTREMELY careful doing this. It's not a fine comb tool, not a hatchet.
Let me know if you have any questions or comments, good luck!
Patrick
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Hreflag Tags - English language & multiple regions
My client is concerned about duplicate content on their site which has versions of the same page in multiple regions. All pages are english language and the regions are; Asia, North America, Australia, Europe, UK and Rest of the World. The url just changes the location to a folder e.g. .com/australia My question is does anyone have any recommendations on how to handle this for Europe, Asia and Rest of the World? Any thoughts would be appreciated
Intermediate & Advanced SEO | | J_Sinclair0 -
Single topic website or as part of a multiple topic website?
I have content sitting on a site here - https://www.pfizerpro.co.uk/product/xeljanz/rheumatoid-arthritis - domain authority 25 page authority 18 - the pages went live three months ago and the website was launched 18 months. We now have the option to use a brand new domain www.xeljanz.co.uk Which is the better option to stick with the www.pfizerpro.co.uk as it is a larger multiple topic site that should attract more links or to start a new single topic site which google may view as the better source as it is dedicated to the topic? Thanks
Intermediate & Advanced SEO | | Kate_team_DM0 -
Multiple hreflang tags pointing to one page from the same country
Hi All, Hypothetically, let’s say a brand established in the UK created the following URL for the Italian market, www.example.com/it/ticket-watch (Ticket watch the made up brand) In this scenario, Ticket Watch is used across multiple services and domains in the UK such as: www.example.com/ticket-watch www.ticketwatch.com/ Essentially, could you point multiple ticket watch pages that live on different domains so that www.example.com/it/ticket-watch could potentially have 4 or 5 tags from the same country (UK), but the self-referencing pages will only have one hreflang tag: canonical and hreflang meta information to be included on www.example.com/it/ticket-watch But the hreflang meta information to be included on www.ticketwatch.com/ will only have one tag I’ve only in included 2 hreflang tags for the for the first example but let’s say there were an additional 2 or 3 GB based ticket watch hreflang tags. Will these tags still be validated? Thanks,
Intermediate & Advanced SEO | | SEONOW1230 -
Multiple brands issue
My client has his main brand on the domain name .com and then 3 brands that exist on .com/brandA , com/brandB and .com/brandC We created a lot of content for .com main brand and we noticed that brandB copied some of our content and put it on .com/brandB . How to deal with this? Canonical tags?
Intermediate & Advanced SEO | | aliciaporrata10090 -
Technical Infrastructure to reach multiple C class domains for portfolio
Community, We have an online portfolio of about 40+ different infotainment products/websites. Naturally we promote the sites across each other; linking from different class domains is suspected to increase the impact the cross linkages can have from an SEO perspective. We would like o see what technical infrastructure approaches marketers with similar conditions use to reach a large set of different C class IPs for SEO benefits givne their main server infrastructure comes out of one computing center = 1 IP class. What we experimented with in the past have been: small cost-efficient virtual servers across different domain providers for static sites leverage CDN providers like Cloudfront to mask different IPs per domain use forward proxies like Varnish on dumb servers to forward request for dynamic sites ask hosting providers to host individual servers in different computing centers leverage cloud-based machines for easy provisioning across different hosting providers (not necessarily cheap) Any other technical infrastructure processes out there we should be looking at? /T
Intermediate & Advanced SEO | | ttpro0 -
Duplicate Titles caused by multiple variations of same URL
Hi. Can you please advise how I can overcome this issue. Moz.com crawle is indicating I have 100's of Duplicate Title tag errors. However this is caused because many URL's have been indexed multiple times in Google. For example. www.abc.com
Intermediate & Advanced SEO | | adhunna
www.abc.com/?b=123 www.abc.com/ www.abc.com/?b=654 www.abc.com/?b=875 www.abc.com/index.html What can I do to stop this issue being reported as duplictae Titles, as well as content? I was thinking maybe I can use Robots.txt to block various query string parameters. I'm Open to ideas and examples.0 -
Google Places Multiple Location
Hi everyone, I have a client with multiple locations in the same city. I would like to have their Goolge places listing show up under the main website listing. Currently, one of the Google places listings in being pulled in directly below the main website but not the other. The Zagat rating is being pulled in as well. I would like to have both locations show up when you type in the name of the business. Any ideas how to do this?
Intermediate & Advanced SEO | | SixTwoInteractive0 -
Submitting URLs multiple times in different sitemaps
We have a very dynamic site, with a large number of pages. We use a sitemap index file, that points to several smaller sitemap files. The question is: Would there be any issue if we include the same URL in multiple sitemap files? Scenario: URL1 appears on sitemap1. 2 weeks later, the page at URL1 changes and we'd like to update it on a sitemap. Would it be acceptable to add URL1 as an entry in sitemap2? Would there be any issues with the same URL appearing multiple times? Thanks.
Intermediate & Advanced SEO | | msquare0