What is the best method to solve duplicate page content?
-
The issue I am having is an overwhelmingly large number of pages on cafecartel.com show that they have duplicate page content.
But when I check the errors on SEOmoz it shows that the duplicate content is from www.cafecartel.com not cafecartel.com.
So first of all, does this mean that there are two sites? and is this a problem I can fix easily? (i.e. redirecting the URL and deleting the extra pages)
Is this going to make all other SEO useless due to the fact that it shows that nearly every page has duplicate page content?
Or am I just completely reading the data wrong?
-
the wordpress just has a setting under general settings for www or non www.
-
I had the htaccess redirect, but the ccsnews is a wordpress blog. When I had that re-direct going, the blog complained of too many re-directs. I've seen this happen before even on seomoz.
So I'm using a joomla redirect plug in. I'm thinking the wordpress has a redirect plug in also, just haven't installed it yet.
-
The internal crawl report from SEOmoz is based on your internal links, not external inbound links. So if there are any errors, it is in your site.
At a quick glance, I see that you have setup the 301 to www, but if you click into the blog (news), then you aren't at the www anymore. http://cafecartel.com/ccsnews/ - (if wordpress, then it's just a simple settings change.)
Run a crawl test on it (http://pro.seomoz.org/tools/crawl-test) and keep on plugging away and fixing every issue until there are no more.
And make sure you use rel=canonical tags. This will help out with the duplicate content as well. http://www.seomoz.org/learn-seo/canonicalization
-
Thank you Brent, and Mark...
So taking your advice this is what happened...
At the tail end of last week, we implemented a 301 redirect to www.cafecartel.com, we adjusted the .htaccess file to implement it and it worked as far as always landing on www.cafecartel.com....BUT the errors didn't adjust after the crawl.
I fear that the mere existence of these links to cafecartel.com and www.cafecartel.com may need to be manually redirected for each page.
The pages that are showing the highest errors are the blog article pages, quote request pages, and the free download pages. These same pages have links going between pages on www.cafecartel.com and other blog sites, which we did as an organic SEO tactic. Is this possibly something that is causing errors?
Thank you all for your advice!
-
You need to setup your site Canonicalization so that you don't have the duplicates. SEOmoz has a great article here: http://www.seomoz.org/learn-seo/canonicalization
Since you are hosted on an Apache server, you will need to modify your .htaccess file in your root directory to take care of these.
Make sure you also setup the www or non www preference in GWT. (Google Webmaster Tools)
-
You are reading the correct data. You should be redirecting the pages to cafecartel.com/.... this will eliminate the duplicate content issues. You also might be able to see the issue with the sitemap....if the website was converted from another website then the pages might still be attached.
Another option, less SEO favorable, but will eliminate the duplicate content, is figuring out where the pages are and then installing robot no follows....
This will help your SEO not hurt it. You are being penalized for the duplicate content.
Hope this helps....
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Tracking keyword rankings on sub pages
Hello, What is the best way to track keywords on sub pages of a website through seomoz? Do we need to create a separate campaign for each sub page? Thanks for all the help!
Moz Pro | | DerekDenholm0 -
Duplicate page report
We ran a CSV spreadsheet of our crawl diagnostics related to duplicate URLS' after waiting 5 days with no response to how Rogerbot can be made to filter. My IT lead tells me he thinks the label on the spreadsheet is showing “duplicate URLs”, and that is – literally – what the spreadsheet is showing. It thinks that a database ID number is the only valid part of a URL. To replicate: Just filter the spreadsheet for any number that you see on the page. For example, filtering for 1793 gives us the following result: | URL http://truthbook.com/faq/dsp_viewFAQ.cfm?faqID=1793 http://truthbook.com/index.cfm?linkID=1793 http://truthbook.com/index.cfm?linkID=1793&pf=true http://www.truthbook.com/blogs/dsp_viewBlogEntry.cfm?blogentryID=1793 http://www.truthbook.com/index.cfm?linkID=1793 | There are a couple of problems with the above: 1. It gives the www result, as well as the non-www result. 2. It is seeing the print version as a duplicate (&pf=true) but these are blocked from Google via the noindex header tag. 3. It thinks that different sections of the website with the same ID number the same thing (faq / blogs / pages) In short: this particular report tell us nothing at all. I am trying to get a perspective from someone at SEOMoz to determine if he is reading the result correctly or there is something he is missing? Please help. Jim
Moz Pro | | jimmyzig0 -
Duplicate titles reported with canonical
Hi Mozzers, In the reports it is saying that I have some duplicate content and titles even though there is a canonical tag on them, is anyone else getting this?
Moz Pro | | KarlBantleman0 -
Finding the source of duplicate content URL's
We have a website that displays a number of products. The product has variations (sizes) and unfortunately every size has its own URL (for now anyway). Needless to say, this causes duplicate content issues. (And of course, we are looking to change the URL's for our site as soon as possible) However, even though these duplicate URL's exist, you should not be able to land on them by navigating through the site. In theory, the site should always display the link to the smallest size. It seems that there is a flaw in our system somewhere, as these links are now found in our campaign here on SEOmoz. My question: is there any way to find the crawl path that lead to the URL's that shouldn't have been found, so we can locate the problem?
Moz Pro | | DocdataCommerce0 -
Duplicate Page Titles and Content
The SeoMoz crawler has found many pages like this on my site with /?Letter=Letter, e.g. http://www.johnsearles.com/metal-art-tiles/?D=A. I believe it is finding multiple caches of a page and identifying them as duplicates. Is there any way to screen out these multiple cache results?
Moz Pro | | johnsearles0 -
SEOmoz crawler and duplicate content
Does anybody know if the SEOmoz crawler picks up canonical tags when its looking for duplicate content? I've got a ton of errors in one of my projects even though they all have canonical tags. Thanks!
Moz Pro | | neooptic0 -
Will canonical tag get rid of duplicate page title errors?
I have a directory on my website, paginated in groups of 10. On page 2 of the results, the title tag is the same as the first page, as it is on the 3rd page and so on. This is giving me duplicate page title errors. If i use rel=canonical tags on the subsequent pages and href the first page of my results, will my duplicate page title warnings go away? thanks.
Moz Pro | | fourthdimensioninc0 -
Broken Links and Duplicate Content Errors?
Hello everybody, I’m new to SEOmoz and I have a few quick questions regarding my error reports: In the past, I have used IIS as a tool to uncover broken links and it has revealed a large amount of varying types of "broken links" on our sites. For example, some of them were links on my site that went to external sites that were no longer available, others were missing images in my CSS and JS files. According to my campaign in SEOmoz, however, my site has zero broken links (4XX). Can anyone tell me why the IIS errors don’t show up in my SEOmoz report, and which of these two reports I should really be concerned about (for SEO purposes)? 2. Also in the "errors" section, I have many duplicate page titles and duplicate page content errors. Many of these "duplicate" content reports are actually showing the same page more than once. For example, the report says that "http://www.cylc.org/" has the same content as "http://www.cylc.org/index.cfm" and that, of course, is because they are the same page. What is the best practice for handling these duplicate errors--can anyone recommend an easy fix for this?
Moz Pro | | EnvisionEMI0