Best way to address duplicate news sections within site
-
A client has a news section at www.clientsite.com/news and also at subdomain.clientsite.com/news. The stories within each section are identical:
www.clientsite.com/news/story-11-5-2011
subdomain.clientsite.com/news/story-11-5-2011
What's the best way to avoid a duplicate content issue within the site? A 301 redirect doesn't seem appropriate from the user experience point of view.
Is applying a rel=canonical <www.clientsite.com news="" story-a-b-c="">to each story within the subdomain news section the best option? They have 100's of stories, wondering if there might be an easier way?</www.clientsite.com>
Also, the news pages list the story headline and the first 3 lines of copy. Do these summaries present duplicate content issues with the full story page?
Thank you!
-
Alan, I appreciate your effort here. These are the sources I already shared
A complete summary of everything shared in those articles you quote:
1. It doesn't make a difference to google which method is used. When I examine all the information and analysis, it seems to indicate Google will index the content either way. How well that content will rank in Google is a different topic. There are reasons to keep content separate, such as when discussing topics unrelated to the main site, in which case a subdomain would be best.
2. Matt uses the directory approach, and he recommends for others to do the same.
AT BEST you can get that it is close to even with a slighter preference towards subfolders based on that information.
The Rand offers outstanding analysis as to why subfolders are the superior choice. Rand's analysis is in 2009, 2 years after the original articles quoted from Matt. http://www.seomoz.org/blog/understanding-root-domains-subdomains-vs-subfolders-microsites
The bottom line, it's up to you how much you care about your site and it's performance. Personally, I am a fighter. I also micro-manage website architecture because in many aspects, it is a one-time set it and forget it type of thing. Whether to use subdirectories vs subfolders, whether to use underscores in URLs vs dashes, etc. are things you do one time and then it is automated forever.
A detailed list of reasons supporting the subfolder approach has been offered. The DA, time, costs, etc. all support subfolders. If you wish to ignore all those strong, positive benefits and go with a subdomain then that is your choice.
Good luck.
-
The originals
http://googlewebmastercentral.blogspot.com/2008_01_01_archive.htmlhttp://www.mattcutts.com/blog/subdomains-and-subdirectories/
here is a better example from Matt
Deb December 11, 2007 at 1:01 am
<dd class="comment odd alt thread-odd thread-alt depth-1">
Matt thanks for your reply, just a query (if you don’t mind) if I add content in mattcutts.com/blog – it effect in seo because I add directly content in the domain mattcutts.com but if I add content in blog.mattcutts.com is the effect is same? I don’t think so – because this is a subdomain not directly related with the domain?
If I disturb you please don’t mindThanks
Deb</dd>
<dd class="comment odd alt thread-odd thread-alt depth-1">Matt Cutts December 10, 2007 at 10:55 am</dd>
<dd class="comment byuser comment-author-matt-cutts bypostauthor odd alt thread-odd thread-alt depth-1">
Deb, it really is a pretty personal choice. For something small like a blog, it probably won’t matter terribly much. I used a subdirectory because it’s easier to manage everything in one file storage space for me. However, if you think that someday you might want to use a hosted blog service to power your blog, then you might want to go with blog.example.com just because you could set up a CNAME or DNS alias so that blog.example.com pointed to your hosted blog service.
</dd>
I was trying to find video matt made where he makes a simular claim. but i have to get back to work
-
Alan,
We will have to agree to disagree on this one.
There is a ton of what can only be referred to as "SEO bullshit" published. When I quote a source it will usually be Matt Cutts directly, or Google, or a highly respected SEO who shares an opinion on a topic AND who offers very solid research to back up that opinion. In short, credibility is everything when quoting a source to support a given position.
You are quoting a site I have never heard of, alexander.holbreich.org. Is it just me? Do others know and recognize this site as a reputable source of SEO information?
The author's About page is a total of 4 lines of text. Line 1 = his name, Line 3 & 4 is where he lives. Line 2 = he has a degree in "Business Information" but doesn't even state where or when he received this degree. This web page is a solid example of a page that has absolutely zero trust on SEO.
I think it is great that you read various sources of SEO for ideas, but that is a big difference from depending on those sources as credible information.
If you want to quote, try the main source article. Doing such would add higher credibility to your position. I can agree there is a lot of confusion on this topic, but it is propagated mostly by pages like the one you linked which should probably never be read.
Using the source you quoted and some common ground I would share the following:
-
Matt Cutts stated he uses folders "My personal preference on subdomains vs. subdirectories is that I usually prefer the convenience of subdirectories for most of my content. A subdomain can be useful to separate out content that is completely different."
-
Matt Cutts recommended for others to use folders "If you’re a newer webmaster or SEO, I’d recommend using subdirectories until you start to feel pretty confident with the architecture of your site."
-
Matt shared a specific example of when a subdirectory would be appropriate, and it is an example I had shared as well in response to the original question "A subdomain can be useful to separate out content that is completely different. Google uses subdomains for distinct products such news.google.com or maps.google.com, for example."
The above aside, one site is easier to maintain then two. There are lower costs all around (software, trust badges, SSL, etc). There is less time involved as well. All that time and money can be put into other aspects of SEO such as link building and creating great content.
Further, by combining your content into one site, all your content benefits from the higher DA of your site.
I hope you take the information I am sharing the right way Alan. My professional experience leads me to almost always use a folder unless there is a clear and specific reason to use a subdomain such as trying to separate out content which is not related to the main site. The difference is strong enough to where I would recommend for most clients who have a subdomain to delete it and move to the subfolder structure.
If you find a differing opinion, I would love to hear it. All I ask is for it to be from a highly credible SEO source who preferably shares detailed examples or logic to support the position.
Best Regards,
-
-
"With respect to the general subfolder vs domain discussion, as far as I have seen most of the "debate" ended with subfolders being the winner."
For what reasons is it the winner? I use subdomains a lot, thats why I have looked for evidence, and Matt Cutts has stated it makes no difference.
Rand states, it is his personal belief, but google and Matt Cutts have stated many times it makes no difference to rankings
http://alexander.holbreich.org/2008/01/subdomains-vs-subdirectories/" otherwise irrelevant change during this discussion only serves to confuse an otherwise muddy topic"
I dont think its confusion, it is information clearly stated (not to do with rankings) for one to consider. it is an indication of googles thinking. It is stated correcly and all informmation should be considered. One could say that stating rands personal belief is confusing.
-
I take a different view on this topic then Alan.
As Alan mentioned, the recent Google change sole effect is how links to sub-domains from the root domain visually appear in Google WMT. They have absolutely no ranking weight difference. Bringing up that otherwise irrelevant change during this discussion only serves to confuse an otherwise muddy topic.
With respect to the general subfolder vs domain discussion, as far as I have seen most of the "debate" ended with subfolders being the winner.
There are a couple situations where a subdomain would be preferable to a folder. One example is when a different, unrelated topic or product is being offered. Keith, you brought up the example of Google Maps. A few comments I would share:
-
Google Maps is a different product then Google search. Really the main thing they have is they are being offered by the same company. The idea of providing satellite images and driving directions is really quite different then providing the best search results. These two products happen to be offered by the same company but if you think about it, they are really very distinct products. It would be the same idea if Ford created their own version of Sirius radio. Yes, the radios would be offered in Ford cars but the product is truly distinct of the cars and can stand completely alone.
-
Google's site was set up years ago before this topic was analyzed to this depth. Many changes have been made over the years.
A couple great discussions on this topic:
http://www.seomoz.org/blog/understanding-root-domains-subdomains-vs-subfolders-microsites
A quote Rand shared in a different article "99.9% of the time, if a subfolder will work, it's the best choice for all parties." I agree for the overwhelming majority of cases, a subfolder is preferred. There are some corner cases but normally speaking the subfolder is the preferred approach.
-
-
Subdomains or folder is an old debaiting point, but matt cutts has said it makes no difference.
I have also noticed that google includes subdomain links in its site links, as well as google WMT now shows subdomain links as internal(I know this is seperate to ranking, but it makes but with the other evidence it gives weight to what matt cutts stated). -
Good catch on the subdomains! That is a separate issue, and I am recommending they move everything to a clientsite.com/folder setup. The sub-domains do have unique content (except for the news) and they set it up that way because they've seen other sites, like Google, set up sub-domains for maps and their other products.
What's a good explanation to the client for why other large sites like Google set up different content sections as subdomains vs. the folder approach I am recommending?
-
the news pages list the story headline and the first 3 lines of copy. Do these summaries present duplicate content issues with the full story page?
No
With respect to the subdomain, what is the purpose of having the subdomain? It seems likely the best course of action would be to merge any unique content from the subdomain into the main site, then remove the subdomain. Your articles would benefit from the (presumably) stronger DA on the main site. Also your efforts would be reduced by allowing you to fully focus on one site rather then maintain two sites.
How does this subdomain benefit anyone?
If you insisted on keeping the subdomain, then yes the canonical meta tag would work.
-
canonical would be best here. but you would want to do it with code, or use rewrite outbound rules on the server
I would not worry about the sumery problem
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Sub domain? Micro site? What's the best solution?
My client currently has two websites to promote their art galleries in different parts of the country. They have bought a new domain (let's call it buyart.com) which they would eventually like to use as an e-commerce platform. They are wondering whether they keep their existing two gallery websites (non e-commerce) separate as they always have been, or somehow combine these into the new domain and have one overarching brand (buyart.com). I've read a bit on subdomains and microsites but am unsure at this stage what the best option would be, and what the pros and cons are. My feeling is to bring it all together under buyart.com so everything is in one place and creates a better user journey for anyone who would like to visit. Thoughts?
Technical SEO | | WhitewallGlasgow0 -
Does anyone know the linking of hashtags on Wix sites does it negatively or postively impact SEO. It is coming up as an error in site crawls 'Pages with 404 errors' Anyone got any experience please?
Does anyone know the linking of hashtags on Wix sites does it negatively or positively impact SEO. It is coming up as an error in site crawls 'Pages with 404 errors' Anyone got any experience please? For example at the bottom of this blog post https://www.poppyandperle.com/post/face-painting-a-global-language the hashtags are linked, but they don't go to a page, they go to search results of all other blogs using that hashtag. Seems a bit of a strange approach to me.
Technical SEO | | Mediaholix0 -
Off-site company blog linking to company site or blog incorporated into the company site?
Kind of a SEO newbie, so be gentle. I'm a beginner content strategist at a small design firm. Currently, I'm working with a client on a website redesign. Their current website is a single page dud with a page authority of 5. The client has a word press blog with a solid URL name, a domain authority of 100 and page authority of 30. My question is this: would it be better for my client from an SEO perspective to: Re-skin their existing blog and link to the new company website with it, hopefully passing on some of its "Google Juice,"or... Create a new blog on their new website (and maybe do a 301 redirect from the old blog)? Or are there better options that I'm not thinking of? Thanks for whatever help you can give a newbie. I just want to take good care of my client.
Technical SEO | | TheKatzMeow0 -
Can panda penalize News publisher sites?
Hey Guys,I was wondering how Panda behaves with news publisher sites.A site with +-1M visits a day that publishes +-300 news articles a day and the life of each article is one week top, given the nature of a news articles -->only relevant now.After one week the the news articles have virtually no page views. This results on a site with thousands of quality content pages that has no page views for years.Is it possible that the site gets penalized by panda for having thousands of pages with no visits?
Technical SEO | | Mr.bfz0 -
Duplicate Content
Crawl Diagnostics has returned several issues that I'm unsure how to fix. I'm guessing it's a canonical link issue but not entirely sure... Duplicate Page Content/Titles On a website (http://www.smselectronics.co.uk/market-sectors) with 6 market sectors but each pull the same 3 pages as child pages - certifications, equipment & case studies. On each products section where the page only shows X amount of items but there are several pages to fit all the products this creates multiple pages. There is also a similar pagination problem with the Blogs (auto generated date titles & user created SEO titles) & News listings. Blog Tags also seem to generate duplicate pages with the same content/titles as the parent page. Are these particularly important for SEO or is it more important to remove the duplication by deleting them? Any help would be greatly appreciated. Thanks
Technical SEO | | BBDCreative0 -
Best way to deal with over 1000 pages of duplicate content?
Hi Using the moz tools i have over a 1000 pages of duplicate content. Which is a bit of an issue! 95% of the issues arise from our news and news archive as its been going for sometime now. We upload around 5 full articles a day. The articles have a standalone page but can only be reached by a master archive. The master archive sits in a top level section of the site and shows snippets of the articles, which if a user clicks on them takes them to the full page article. When a news article is added the snippets moves onto the next page, and move through the page as new articles are added. The problem is that the stand alone articles can only be reached via the snippet on the master page and Google is stating this is duplicate content as the snippet is a duplicate of the article. What is the best way to solve this issue? From what i have read using a 'Meta NoIndex' seems to be the answer (not that i know what that is). from what i have read you can only use a canonical tag on a page by page basis so that going to take to long. Thanks Ben
Technical SEO | | benjmoz0 -
What is the best way to use canonical tag
Hi, i have been researching this since yesterday and have looked at this subject many times before but still cannot get my head around it. i done a report on my site which was very useful, i used http://www.juxseo.com for my site www.in2town.co.uk and it brought me some useful information but part of that info was it was telling me that i should have on my home page a canonical tag which would improve my seo. Now i am using sh404sef for my friendly urls and i am using joomla 3.0 and when i approached the makers of the sh404sef to ask about the tag they said i would need to be careful of using it as it could damage my site and my rankings. i have read lots of information but still do not have a clear understanding behind it. can anyone please explain the best way to use this and should i be using where i may have some sort of duplicate page, any help to understand this would be great.
Technical SEO | | ClaireH-1848860 -
Same product in Multiple categories ecommerce store, best way to avoid duplicate content?
Hello All, Im building a magento store, with around 500 products. One thing is that I am going to have some products in Multiple categories. Do you think the best solution is to remove any category name from the url structure or would this devalue SEO? Also would the use of canonical links remove any duplicate content issues if the category name was left in. So overall what would get better results No category name in URL (e.g.phonename-model1.html) V category name in url (e.g. phones/phonename-model1.html / videophones/phonename-model1.html +using canonical links Any feedback or views would be great
Technical SEO | | voipme0