Best way to address duplicate news sections within site
-
A client has a news section at www.clientsite.com/news and also at subdomain.clientsite.com/news. The stories within each section are identical:
www.clientsite.com/news/story-11-5-2011
subdomain.clientsite.com/news/story-11-5-2011
What's the best way to avoid a duplicate content issue within the site? A 301 redirect doesn't seem appropriate from the user experience point of view.
Is applying a rel=canonical <www.clientsite.com news="" story-a-b-c="">to each story within the subdomain news section the best option? They have 100's of stories, wondering if there might be an easier way?</www.clientsite.com>
Also, the news pages list the story headline and the first 3 lines of copy. Do these summaries present duplicate content issues with the full story page?
Thank you!
-
Alan, I appreciate your effort here. These are the sources I already shared
A complete summary of everything shared in those articles you quote:
1. It doesn't make a difference to google which method is used. When I examine all the information and analysis, it seems to indicate Google will index the content either way. How well that content will rank in Google is a different topic. There are reasons to keep content separate, such as when discussing topics unrelated to the main site, in which case a subdomain would be best.
2. Matt uses the directory approach, and he recommends for others to do the same.
AT BEST you can get that it is close to even with a slighter preference towards subfolders based on that information.
The Rand offers outstanding analysis as to why subfolders are the superior choice. Rand's analysis is in 2009, 2 years after the original articles quoted from Matt. http://www.seomoz.org/blog/understanding-root-domains-subdomains-vs-subfolders-microsites
The bottom line, it's up to you how much you care about your site and it's performance. Personally, I am a fighter. I also micro-manage website architecture because in many aspects, it is a one-time set it and forget it type of thing. Whether to use subdirectories vs subfolders, whether to use underscores in URLs vs dashes, etc. are things you do one time and then it is automated forever.
A detailed list of reasons supporting the subfolder approach has been offered. The DA, time, costs, etc. all support subfolders. If you wish to ignore all those strong, positive benefits and go with a subdomain then that is your choice.
Good luck.
-
The originals
http://googlewebmastercentral.blogspot.com/2008_01_01_archive.htmlhttp://www.mattcutts.com/blog/subdomains-and-subdirectories/
here is a better example from Matt
Deb December 11, 2007 at 1:01 am
<dd class="comment odd alt thread-odd thread-alt depth-1">
Matt thanks for your reply, just a query (if you don’t mind) if I add content in mattcutts.com/blog – it effect in seo because I add directly content in the domain mattcutts.com but if I add content in blog.mattcutts.com is the effect is same? I don’t think so – because this is a subdomain not directly related with the domain?
If I disturb you please don’t mindThanks
Deb</dd>
<dd class="comment odd alt thread-odd thread-alt depth-1">Matt Cutts December 10, 2007 at 10:55 am</dd>
<dd class="comment byuser comment-author-matt-cutts bypostauthor odd alt thread-odd thread-alt depth-1">
Deb, it really is a pretty personal choice. For something small like a blog, it probably won’t matter terribly much. I used a subdirectory because it’s easier to manage everything in one file storage space for me. However, if you think that someday you might want to use a hosted blog service to power your blog, then you might want to go with blog.example.com just because you could set up a CNAME or DNS alias so that blog.example.com pointed to your hosted blog service.
</dd>
I was trying to find video matt made where he makes a simular claim. but i have to get back to work
-
Alan,
We will have to agree to disagree on this one.
There is a ton of what can only be referred to as "SEO bullshit" published. When I quote a source it will usually be Matt Cutts directly, or Google, or a highly respected SEO who shares an opinion on a topic AND who offers very solid research to back up that opinion. In short, credibility is everything when quoting a source to support a given position.
You are quoting a site I have never heard of, alexander.holbreich.org. Is it just me? Do others know and recognize this site as a reputable source of SEO information?
The author's About page is a total of 4 lines of text. Line 1 = his name, Line 3 & 4 is where he lives. Line 2 = he has a degree in "Business Information" but doesn't even state where or when he received this degree. This web page is a solid example of a page that has absolutely zero trust on SEO.
I think it is great that you read various sources of SEO for ideas, but that is a big difference from depending on those sources as credible information.
If you want to quote, try the main source article. Doing such would add higher credibility to your position. I can agree there is a lot of confusion on this topic, but it is propagated mostly by pages like the one you linked which should probably never be read.
Using the source you quoted and some common ground I would share the following:
-
Matt Cutts stated he uses folders "My personal preference on subdomains vs. subdirectories is that I usually prefer the convenience of subdirectories for most of my content. A subdomain can be useful to separate out content that is completely different."
-
Matt Cutts recommended for others to use folders "If you’re a newer webmaster or SEO, I’d recommend using subdirectories until you start to feel pretty confident with the architecture of your site."
-
Matt shared a specific example of when a subdirectory would be appropriate, and it is an example I had shared as well in response to the original question "A subdomain can be useful to separate out content that is completely different. Google uses subdomains for distinct products such news.google.com or maps.google.com, for example."
The above aside, one site is easier to maintain then two. There are lower costs all around (software, trust badges, SSL, etc). There is less time involved as well. All that time and money can be put into other aspects of SEO such as link building and creating great content.
Further, by combining your content into one site, all your content benefits from the higher DA of your site.
I hope you take the information I am sharing the right way Alan. My professional experience leads me to almost always use a folder unless there is a clear and specific reason to use a subdomain such as trying to separate out content which is not related to the main site. The difference is strong enough to where I would recommend for most clients who have a subdomain to delete it and move to the subfolder structure.
If you find a differing opinion, I would love to hear it. All I ask is for it to be from a highly credible SEO source who preferably shares detailed examples or logic to support the position.
Best Regards,
-
-
"With respect to the general subfolder vs domain discussion, as far as I have seen most of the "debate" ended with subfolders being the winner."
For what reasons is it the winner? I use subdomains a lot, thats why I have looked for evidence, and Matt Cutts has stated it makes no difference.
Rand states, it is his personal belief, but google and Matt Cutts have stated many times it makes no difference to rankings
http://alexander.holbreich.org/2008/01/subdomains-vs-subdirectories/" otherwise irrelevant change during this discussion only serves to confuse an otherwise muddy topic"
I dont think its confusion, it is information clearly stated (not to do with rankings) for one to consider. it is an indication of googles thinking. It is stated correcly and all informmation should be considered. One could say that stating rands personal belief is confusing.
-
I take a different view on this topic then Alan.
As Alan mentioned, the recent Google change sole effect is how links to sub-domains from the root domain visually appear in Google WMT. They have absolutely no ranking weight difference. Bringing up that otherwise irrelevant change during this discussion only serves to confuse an otherwise muddy topic.
With respect to the general subfolder vs domain discussion, as far as I have seen most of the "debate" ended with subfolders being the winner.
There are a couple situations where a subdomain would be preferable to a folder. One example is when a different, unrelated topic or product is being offered. Keith, you brought up the example of Google Maps. A few comments I would share:
-
Google Maps is a different product then Google search. Really the main thing they have is they are being offered by the same company. The idea of providing satellite images and driving directions is really quite different then providing the best search results. These two products happen to be offered by the same company but if you think about it, they are really very distinct products. It would be the same idea if Ford created their own version of Sirius radio. Yes, the radios would be offered in Ford cars but the product is truly distinct of the cars and can stand completely alone.
-
Google's site was set up years ago before this topic was analyzed to this depth. Many changes have been made over the years.
A couple great discussions on this topic:
http://www.seomoz.org/blog/understanding-root-domains-subdomains-vs-subfolders-microsites
A quote Rand shared in a different article "99.9% of the time, if a subfolder will work, it's the best choice for all parties." I agree for the overwhelming majority of cases, a subfolder is preferred. There are some corner cases but normally speaking the subfolder is the preferred approach.
-
-
Subdomains or folder is an old debaiting point, but matt cutts has said it makes no difference.
I have also noticed that google includes subdomain links in its site links, as well as google WMT now shows subdomain links as internal(I know this is seperate to ranking, but it makes but with the other evidence it gives weight to what matt cutts stated). -
Good catch on the subdomains! That is a separate issue, and I am recommending they move everything to a clientsite.com/folder setup. The sub-domains do have unique content (except for the news) and they set it up that way because they've seen other sites, like Google, set up sub-domains for maps and their other products.
What's a good explanation to the client for why other large sites like Google set up different content sections as subdomains vs. the folder approach I am recommending?
-
the news pages list the story headline and the first 3 lines of copy. Do these summaries present duplicate content issues with the full story page?
No
With respect to the subdomain, what is the purpose of having the subdomain? It seems likely the best course of action would be to merge any unique content from the subdomain into the main site, then remove the subdomain. Your articles would benefit from the (presumably) stronger DA on the main site. Also your efforts would be reduced by allowing you to fully focus on one site rather then maintain two sites.
How does this subdomain benefit anyone?
If you insisted on keeping the subdomain, then yes the canonical meta tag would work.
-
canonical would be best here. but you would want to do it with code, or use rewrite outbound rules on the server
I would not worry about the sumery problem
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate Content for Locations on my Directory Site
I have a pretty big directory site using Wordpress with lots of "locations", "features", "listing-category" etc.... Duplicate Content: https://www.thecbd.co/location/california/ https://www.thecbd.co/location/canada/ referring URL is www.thecbd.co is it a matter of just putting a canonical URL on each location, or just on the main page? Would this be the correct code to put: on the main page? Thanks Everyone!
Technical SEO | | kay_nguyen0 -
What is best practice for fixing urls that have duplicate content, non-static and other issues?
Hi, I know there are several good answers regarding duplicate content issues on this website already, however I have a question that involves the best way to avoid negative SEO impacts if I change the urls for an ecommerce site. Basically a new client has the following website http://www.gardenbeauty.co.uk and I notice that it suffers from duplicate content due to the http://www version and the non www version of the pages - this seems quite easy to fix using the guidance on this website. However I notice that the product page urls are far from ideal in that they have several issues including:- (a) they are mostly too long (b) don't include the keyword terms (in terms of best practice) (c) they don't use Static URLS An example of one these product urls would be http://www.gardenbeauty.co.uk/plant-details.php?name=Autumn Glory&p_genus=Hebe&code=heagl&category=hebe I'd like to address these issues, but the pages rank highly for the products themselves, therefore my question is what would you recommend I do to fix the urls without risking the high positions that many of these product pages have? thanks, Ben
Technical SEO | | bendyman0 -
Duplicate content on job sites
Hi, I have a question regarding job boards. Many job advertisers will upload the same job description to multiple websites e.g. monster, gumtree, etc. This would therefore be viewed as duplicate content. What is the best way to handle this if we want to ensure our particular site ranks well? Thanks in advance for the help. H
Technical SEO | | HiteshP0 -
I noticed all my SEOed sites are getting attacked constantly by viruses. I do wordpress sites. Does anyone have a good recommendation to protect my clients sites? thanks
We have tried all different kinds of security plugins but none seem to work long term.
Technical SEO | | Carla_Dawson0 -
Are aggregate sites penalised for duplicate page content?
Hi all,We're running a used car search engine (http://autouncle.dk/en/) in Denmark, Sweden and soon Germany. The site works in a conventional search engine way with a search form and pages of search results (car adverts).The nature of car searching entails that the same advert exists on a large number of different urls (because of the many different search criteria and pagination). From my understanding this is problematic because Google will penalize the site for having duplicated content. Since the order of search results is mixed, I assume SEOmoz cannot always identify almost identical pages so the problem is perhaps bigger than what SEOmoz can tell us. In your opinion, what is the best strategy to solve this? We currently use a very simple canonical solution.For the record, besides collecting car adverts AutoUncle provide a lot of value to our large user base (including valuations on all cars) . We're not just another leech adword site. In fact, we don't have a single banner.Thanks in advance!
Technical SEO | | JonasNielsen0 -
Mitigating duplicate page content on dynamic sites such as social networks and blogs.
Hello, I recently did an SEOMoz crawl for a client site. As it typical, the most common errors were duplicate page title and duplicate content. The client site is a custom social network for researchers. Most of the pages that showing as duplicate are simple variations of each user's profile such as comment sections, friends pages, and events. So my question is how can we limit duplicate content errors for a complex site like this. I already know about the rel canonical tag, and rel next tag, but I'm not sure if either of these will do the job. Also, I don't want to lose potential links/link juice for good pages. Are there ways of using the "noindex" tag in batches? For instance: noindex all urls containing this character? Or do most CMS allow this to be done systematically? Anyone with experience doing SEO for a custom Social Network or Forum, please advise. Thanks!!!
Technical SEO | | BPIAnalytics0 -
Best way to use affiliate links
What is the best practice to use amazon affliate links in blog posts? I have read different opinions on this, and want to be sure I'm using best practices. I sometimes link to amazon with an affiliate link on some of my posts, and am working on a top ten Christmas gift ideas for Children born with Down syndrome with lots of affiliate links on it. I'm want to be sure I'm using best practices when adding links like this. Tanks!
Technical SEO | | NoahsDad0 -
Duplicate content
I have to sentences that I want to optimize to different pages for. sentence number one is travel to ibiza by boat sentence number to is travel to ibiza by ferry My question is, can I have the same content on both pages exept for the keywords or will Google treat that as duplicate content and punish me? And If yes, where goes the limit/border for duplicate content?
Technical SEO | | stlastla0