Duplicate Content
-
Hello All, my first web crawl has come back with a duplicate content warning for
and
slightly mystified!
thanks
paul
-
If you're still in contact with a web developer, that would be great. If you're not, a note to everyone else on this thread that the website in question is using IIS 6.0, so apache info isn't going to help in this case.
-
Just a 301 from 7index to /
-
Hi Cesar,
there is no drawback. technically www.simodal.com and www.simodal.com/ are different pages just like www.simodal.com/randompage and www.simodal.com/randompage/ would be considered different. Most people would consider /randompage a page and /randompage/ a directory. But from a SEO perspective .com and .com/ are equally good.
What you should do is to decide whether you want to use a trailing slash or not and stick to it. if you dedide not to use / on your sites root page use it consistently everywhere.
Generally speaking there are 3 often seen ways: use .html for pages and / for directorys vs. no suffix for pages (domain.tld/page ) and / for directorys vs. / for all pages and directories (wordpress uses / AFAIK). It doesnt realy matter much, take one and stick to it.
-
which is the drawback of the 301 redirect without the "/"?
-
Hi Paul
I can fully identify with your frustrations - been there!
A simple question may help you. Did you have a web developer, and are you still in relationship with him/her. If so, get them to do a 301 redirect from the www.simodal.com/index.htm to your chosen version. Most seem to do www.simodal.com/ - but with a trailing forward slash at the end. Someone else might like to comment on that.
Also as Aaron says also do it for the version without the www's ie: http://simodal.com/ and do a 301 to exactly the same URL as the above.
If you haven't got a developer there is some info around telling you exactly how to do it.
Hope this helps
-
Hey Paul,
here is the explanation:
www.simodal.com and www.simodal.com/index.htm are considered separate pages by google, although both are your sites "starting point". Some Content Management Systems (CMS) make thiis mistake, i.e. delivering the same page and not distinguishing between simodal.com/ and simodal.com/index.htm.
As said before, you should decide whether all your pages should be www.simodal oder just simodal.com. There is a great Whiteboard-Friday Video by Rand on this toppic. Then you should rewrite your URLs to either version.
Additionally you might want to add a rel canonical to your page, maybe just to your starting page. a
<link rel="canonical" href="http://www.simodal.com/" />
on your starting page would tell google to ignore the /index.htm and use /
But watch out, rel canonical is somewhat tricky...but there are good tutorials here.
To be honest: I know quiet a lot of pages, that make this mistake. Google should be able to correct this, so dont qorry about rankings. You should however do the redirect www. (or the opposite) as this will trigger googles DC filter. Also: if you plan to use SSL (https:// ) make sure that these pages are also not indexed, best by using rel canonical.
-
Hello Paul!
Because the URL is different, the crawlers look them as different pages, but as you know, they're not! It's just two ways to get there!
To solve this, you have to redirect the /index page to the non-/index, using the 301 redirection code.
Tutorial here: http://www.tamingthebeast.net/articles3/spiders-301-redirect.htm
Got it?
Hope it helps! =]
-
ThanksAaron, this is very new to me and you will have to forgive my DOH! moments.
Still don't get it. Can you point me in any direction so I can understand.
best
paul
-
It is indeed duplicate content! You might want to consider doing a redirect. I also noticed that you haven't done a redirect from the non www. domain either!
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Tricky Duplicate Content Issue
Hi MOZ community, I'm hoping you guys can help me with this. Recently our site switched our landing pages to include a 180 item and 60 item version of each category page. They are creating duplicate content problems with the two examples below showing up as the two duplicates of the original page. http://www.uncommongoods.com/fun/wine-dine/beer-gifts?view=all&n=180&p=1 http://www.uncommongoods.com/fun/wine-dine/beer-gifts?view=all&n=60&p=1 The original page is http://www.uncommongoods.com/fun/wine-dine/beer-gifts I was just going to do a rel=canonical for these two 180 item and 60 item pages to the original landing page but then I remembered that some of these landing pages have page 1, page 2, page 3 ect. I told our tech department to use rel=next and rel=prev for those pages. Is there anything else I need to be aware of when I apply the canonical tag for the two duplicate versions if they also have page 2 and page 3 with rel=next and rel=prev? Thanks
Technical SEO | | znotes0 -
Duplicate Page Content
Hello, After crawling our site Moz is detecting high priority duplicate page content for our product and article listing pages, For example http://store.bmiresearch.com/bangladesh/power and http://store.bmiresearch.com/newzealand/power are being listed as duplicate pages although they have seperate URLs, page titles and H1 tags. They have the same product listed but I would have thought the differentiation in other areas would be sufficient for these to not be deemed as duplicate pages. Is it likely this issue will be impacting on our search rankings? If so are there any recommendations as to how this issue can be overcome. Thanks
Technical SEO | | carlsutherland0 -
URL Mixed Cases and Duplicate Content
Hi There, I have a question for you. I am working on a website where by typing any letter of the URL in lower or upper case, it will give a 200 code. Examples www.examples.com/page1/product www.examples.com/paGe1/Product www.examples.com/PagE1/prOdUcT www.examples.com/pAge1/proODUCt and so on… Although I cannot find evidence of backlinks pointing to my page with mixed cases, shall I redirect or rel=canonical all the possible combination of the cases to a lower version of them in order to prevent duplicate content? And if so, do you have any advice on how to complete such a massive job? Thanks a lot
Technical SEO | | Midleton0 -
Duplicate Content from Multiple Sources Cross-Domain
Hi Moz Community, We have a client who is legitimately repurposing, or scraping, content from site A to site B. I looked into it and Google recommends the cross-domain rel=canonical tag below: http://googlewebmastercentral.blogspot.com/2009/12/handling-legitimate-cross-domain.html The issue is it is not a one to one situation. In fact site B will have several pages of content from site A all on one URL. Below is an example of what they are trying to accomplish. EX - www.siteB.com/apples-and-oranges is made up of content from www.siteA.com/apples & www.siteB.com/oranges So with that said, are we still in fear of getting hit for duplicate content? Should we add multiple rel=canonical tags to reflect both pages? What should be our course of action.
Technical SEO | | SWKurt0 -
Is duplicate content ok if its on LinkedIn?
Hey everyone, I am doing a duplicate content check using copyscape, and realized we have used a ton of the same content on LinkedIn as our website. Should we change the LinkedIn company page to be original? Or does it matter? Thank you!
Technical SEO | | jhinchcliffe0 -
Issue: Duplicate Page Content
Hi All, I am getting warnings about duplicate page content. The pages are normally 'tag' pages. I have some blog posts tagged with multiple 'tags'. Does it really affect my site?. I am using wordpress and Yoast SEO plugin. Thanks
Technical SEO | | KLLC0 -
Lots of duplicate content warnings
I have a site that says that I have 2,500 warnings. It is a real estate website and of course we use feeds. it says I have a lot of duplicate content. One thing is a page called "Request an appointment" and that is a url for each listing. Since there are 800 listings on my site. How could I solve this problem so that this doesn't show up as duplicate content since I use the same "Request an Appointment" verbeage on each of those? I guess my developer who used php to do it, created a dedicated url to each. Any help would be greatly appreciated.
Technical SEO | | SeaC0 -
Mod Rewrite / .htaccess avoid duplicate content
I have been searching and testing for hours but cannot find a solution. I am able to get a URL to display with out the file exntension. i.e domain.com/file instead of domain.com/file.php The problem is both versions of the URL above work, therefore a duplicate content issue. How can I force the URL with the file extension not to resolve and give a 404 error? Or just redirect to the non extension URL? IF it helps here is my code. Options +FollowSymLinks
Technical SEO | | MiamiWebCompany
RewriteEngine On RewriteCond %{REQUEST_FILENAME} !-f
RewriteCond %{REQUEST_FILENAME} !-d
RewriteCond %{REQUEST_FILENAME}.php -f
RewriteRule ^(.+)$ $1.php [L,QSA]0