Duplicate page content
-
Hi guys the feedback form my campaign suggests I have to much duplicate page content. I’ve had a look at the CSV file but it doesn’t seem to be abundantly clear as to which pages on my site have the duplicate content. Can anyone tell which columns I need to refer to on the sheet, to ascertain this information.
Also if the content is only slightly different, will Google still consider it to be duplicate? I look forward to hearing from you
-
When you download your crawl diagnostics as a csv, column A is "URL", column L is the true/flase column for "Duplicate Page Content", and column AF "duplicate_page_content" contains the urls of duplicates to the url in column A.
To look at duplicate content, I sort by column L, delete all of the false rows (because they don't have duplicate content), then I delete all of the columns except column A (URL) and column AF (duplicate_page_content), save the spreadsheet as "yyyymmdd-duplicate-content" and work from that. (Easier to see what you are doing without all the other data in the way.)
Also note that column AF "duplicate_page_content" can have more than one url in it if you have multiple versions of the same content. In this case I use Excel's "Text to Columns" function (under "Data" in the ribbon) to put each url into its own column so I can deal with them individually.
And yes, if there are just small differences Google is likely to see pages as duplicates.
-
I really don't know which columns in the CSV, but you can see those pages in the campaigns page.
Anyway, if the content is slightly different, you could consider using noindex on those pages that could generate conflicts. For example, that happens quite often on a blog, duplicate content reports on tag/category pages, in those cases it would be considered a good practice to noindex tag pages.
Just my 2 cents
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
How can a page have high Google/ organic traffic but show no ranking keywords in Moz?
We have a page on our website with a higher than average number of pageviews, 85% of which came from Google organic search. When I research this page by entering the URL into the "exact page" keyword research tool, Moz says it has no ranking keywords. How can a page be earning organic traffic without ranking for any keywords?
Moz Bar | | baystatemarketing0 -
Duplicate Page Content
The site crawl is registering duplicate page content for our storefront site, but the pages aren't the same. They're ascending pages within the same category (ex: Featured, Featured pg2, Featured pg3, and so on). What can be done to fix these errors or prevent them in the future?
Moz Bar | | MGuid550 -
Too Many On-Page Links Notice
When calculating the number of links on a page, are navigation links included in the total? I have all of my navigation links within the <nav>element. I would think that there are a lot of sites out there that easily exceed the 100 link recommendation if you add up nav and footer links. </nav>
Moz Bar | | Brando160 -
605 : Page banned by robots.txt
Hello everyone, I need experts help here, Please suggest, I am receiving crawl errors for my site that is , X-Robots-Tag: header, or tag. my robots.txt file is: User-agent: * Disallow:
Moz Bar | | bhomes0 -
What is Considered Duplicate Content by Crawlers?
I am asking this because I have a couple of site audit tools that I use to crawl a site I work on every week and they are showing duplicate content issues (which I know there is a lot on this site) but some of what is flagged as duplicate content makes no sense. For example, the following URL's were grouped together as duplicate content: | https://www.firefold.com/contact-us | https://www.firefold.com/gabe | https://www.firefold.com/sale | | | How are these pages duplicate content? I am confused on what site audit tools are considering duplicate content. Just FYI, this is data from Moz crawl diagnostics but SEMrush site auditor is giving me the same type of data. Any help would be greatly appreciated. Ryan
Moz Bar | | RyanRhodes0 -
Duplicate content reported for totally different pages
Hi, The Moz report is showing just over 21,500 duplicate page issues on our site. This is more or less every page we have. However when I look at the pages it says are duplicates they are totally different (it could for example report that a news page for 2009 is the same as a product page just added which has no relation when you read the content or view the page). What sort of thing could it be picking up as duplicate content? I assume it must be something in the HTML for the site rather than the actual page content as there is no cross over at all on the pages highlighted. The only issue I can currently identify is that the menu for the mobile version of the site has a huge number of internal links which I will cut down. If the tools purely look at HTML content this could be seen as duplicate but shouldn't it be clever enough to realise what is content and what is site structure? Thanks,
Moz Bar | | TW-Steve0 -
How does an index page have a higher Authority than the root domain?
So just curious, but on my domain, http://www.bulwarkpestcontrol.com the Page Authority is 59 and the root domain authority 52. That seems odd as it is the same page. Explanation?
Moz Bar | | Thos0030 -
Probs with campaign overview page ??
Hi As of today my overview pages are devoid of formatting and just text and basic display, anyone else getting this ? Cheers Dan
Moz Bar | | Dan-Lawrence1