Duplicate Content
-
Hello All, my first web crawl has come back with a duplicate content warning for
and
slightly mystified!
thanks
paul
-
If you're still in contact with a web developer, that would be great. If you're not, a note to everyone else on this thread that the website in question is using IIS 6.0, so apache info isn't going to help in this case.
-
Just a 301 from 7index to /
-
Hi Cesar,
there is no drawback. technically www.simodal.com and www.simodal.com/ are different pages just like www.simodal.com/randompage and www.simodal.com/randompage/ would be considered different. Most people would consider /randompage a page and /randompage/ a directory. But from a SEO perspective .com and .com/ are equally good.
What you should do is to decide whether you want to use a trailing slash or not and stick to it. if you dedide not to use / on your sites root page use it consistently everywhere.
Generally speaking there are 3 often seen ways: use .html for pages and / for directorys vs. no suffix for pages (domain.tld/page ) and / for directorys vs. / for all pages and directories (wordpress uses / AFAIK). It doesnt realy matter much, take one and stick to it.
-
which is the drawback of the 301 redirect without the "/"?
-
Hi Paul
I can fully identify with your frustrations - been there!
A simple question may help you. Did you have a web developer, and are you still in relationship with him/her. If so, get them to do a 301 redirect from the www.simodal.com/index.htm to your chosen version. Most seem to do www.simodal.com/ - but with a trailing forward slash at the end. Someone else might like to comment on that.
Also as Aaron says also do it for the version without the www's ie: http://simodal.com/ and do a 301 to exactly the same URL as the above.
If you haven't got a developer there is some info around telling you exactly how to do it.
Hope this helps
-
Hey Paul,
here is the explanation:
www.simodal.com and www.simodal.com/index.htm are considered separate pages by google, although both are your sites "starting point". Some Content Management Systems (CMS) make thiis mistake, i.e. delivering the same page and not distinguishing between simodal.com/ and simodal.com/index.htm.
As said before, you should decide whether all your pages should be www.simodal oder just simodal.com. There is a great Whiteboard-Friday Video by Rand on this toppic. Then you should rewrite your URLs to either version.
Additionally you might want to add a rel canonical to your page, maybe just to your starting page. a
<link rel="canonical" href="http://www.simodal.com/" />
on your starting page would tell google to ignore the /index.htm and use /
But watch out, rel canonical is somewhat tricky...but there are good tutorials here.
To be honest: I know quiet a lot of pages, that make this mistake. Google should be able to correct this, so dont qorry about rankings. You should however do the redirect www. (or the opposite) as this will trigger googles DC filter. Also: if you plan to use SSL (https:// ) make sure that these pages are also not indexed, best by using rel canonical.
-
Hello Paul!
Because the URL is different, the crawlers look them as different pages, but as you know, they're not! It's just two ways to get there!
To solve this, you have to redirect the /index page to the non-/index, using the 301 redirection code.
Tutorial here: http://www.tamingthebeast.net/articles3/spiders-301-redirect.htm
Got it?
Hope it helps! =]
-
ThanksAaron, this is very new to me and you will have to forgive my DOH! moments.
Still don't get it. Can you point me in any direction so I can understand.
best
paul
-
It is indeed duplicate content! You might want to consider doing a redirect. I also noticed that you haven't done a redirect from the non www. domain either!
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Do you think my client is being hit for duplicate content?
Wordpress website. The client's website is http://www.denenapoints.com/ The URL that we purchase so that we could setup the hosting account is http://houston-injury-lawyers.com, which shows 1 page indexed in Google when I search for site:http://houston-injury-lawyers.com On http://www.denenapoints.com/ there is <link rel="<a class="attribute-value">canonical</a>" href="http://houston-injury-lawyers.com/"> But on http://houston-injury-lawyers.com it says the same thing, <link rel="<a class="attribute-value">canonical</a>" href="http://houston-injury-lawyers.com/" /> Is this how it should be setup, assuming that we want everything to point to http://denenapoints.com/? Maybe we should do a 301 redirect to be 100% Sure? Hopefully I explained this well enough. Please let me know if anyone has any thoughts, thanks!
Technical SEO | | georgetsn0 -
Duplicate Content Question
I have a client that operates a local service-based business. They are thinking of expanding that business to another geographic area (a drive several hours away in an affluent summer vacation area). The name of the existing business contains the name of the city, so it would not be well-suited to market 'City X' business in 'City Y'. My initial thought was to (for the most part) 'duplicate' the existing site onto a new site (brand new root domain). Much of the content would be the exact same. We could re-word some things so there aren't entire lengthy paragraphs of identical info, but it seems pointless to completely reinvent the wheel. We'll get as creative as possible, but certain things just wouldn't change. This seems like the most pragmatic thing to do given their goals, but I'm worried about duplicate content. It doesn't feel as though this is spammy though, so I'm not sure if there's cause for concern.
Technical SEO | | stevefidelity0 -
Duplicate Content
We have a ton of duplicate content/title errors on our reports, many of them showing errors of: http://www.mysite.com/(page title) and http://mysite.com/(page title) Our site has been set up so that mysite.com 301 redirects to www.mysite.com (we did this a couple years ago). Is it possible that I set up my campaign the wrong way in SEOMoz? I'm thinking it must be a user error when I set up the campaign since we already have the 301 Redirect. Any advice is appreciated!
Technical SEO | | Ditigal_Taylor0 -
Duplicate content issue with trailing / ?
Hi ,I did a SEOmoz Crawl Test and found most pages show twice, for example: A: www.website.com/index.php/dog/walk B: www.website.com/index.php/dog/walk/ I've checked Google Analytics and 90% of organic search traffic arrives on the URLs with the trailing slash (B). Question 1: Can I assume I've a duplicate content problem? Question 2: Is it best to do 301 redirects from the 'non trailing slash' pages to the 'trailing slash pages'? Question 3: For some reason every web page has a '/index.php' in it (see A&B) above. No idea why. Should it be a SEO concern? Kind regards and thank you in advance Nigel
Technical SEO | | Richard5550 -
Url rewrites / shortcuts - Are they considered duplicate content?
When creating a url rewrite or shortcut, does this create duplicate content issues? split your rankings / authority with google/search engines? Scenario 1 wwwlwhatthehellisahoneybooboo.com/dqotd/ -> www.whatthehellisahoneybooboo.com/08/12/2012/deep-questions-of-the-day.html Scenario 2 bitly.com/hbb -> www.whatthehellisahoneybooboo.com/08/12/2012/deep-questions-of-the-day.html (or to make it more compicated...directs to the above mentioned scenario 1 url rewrite) www.whatthehellisahoneybooboo.com/dqotd/ *note well- there's no server side access so mentions of optimizing .htacess are useless in this situation. To be clear, I'm only referring to rewrites, not redirects...just trying to understand the implications of rewrites. Thanks!
Technical SEO | | seosquared0 -
Pages with different content and meta description marked as duplicate content
I am running into an issue where I have pages with completely different body and meta description but they are still being marked as having the same content (Duplicate Page Content error). What am I missing here? Examples: http://www.wallstreetoasis.com/forums/what-to-expect-in-the-summer-internship
Technical SEO | | WallStreetOasis.com
and
http://www.wallstreetoasis.com/blog/something-ventured http://www.wallstreetoasis.com/forums/im-in-the-long-run
and
http://www.wallstreetoasis.com/image/jhjpeg0 -
Noindex duplicate content penalty?
We know that google now gives a penalty to a whole duplicate if it finds content it doesn't like or is duplicate content, but has anyone experienced a penalty from having duplicate content on their site which they have added noindex to? Would google still apply the penalty to the overall quality of the site even though they have been told to basically ignore the duplicate bit. Reason for asking is that I am looking to add a forum to one of my websites and no one likes a new forum. I have a script which can populate it with thousands of questions and answers pulled direct from Yahoo Answers. Obviously the forum wil be 100% duplicate content but I do not want it to rank for anyway anyway so if I noindex the forum pages hopefully it will not damage the rest of the site. In time, as the forum grows, all the duplicate posts will be deleted but it's hard to get people to use an empty forum so need to 'trick' them into thinking the section is very busy.
Technical SEO | | Grumpy_Carl0 -
Duplicate Content issue
I have been asked to review an old website to an identify opportunities for increasing search engine traffic. Whilst reviewing the site I came across a strange loop. On each page there is a link to printer friendly version: http://www.websitename.co.uk/index.php?pageid=7&printfriendly=yes That page also has a link to a printer friendly version http://www.websitename.co.uk/index.php?pageid=7&printfriendly=yes&printfriendly=yes and so on and so on....... Some of these pages are being included in Google's index. I appreciate that this can't be a good thing, however, I am not 100% sure as to the extent to which it is a bad thing and the priority that should be given to getting it sorted. Just wandering what views people have on the issues this may cause?
Technical SEO | | CPLDistribution0