Philosophy & Deep Thoughts On Tag/Category URLs
-
Hello, SEO Gurus!
First off, my many thanks to this community for all of your past help and perspective. This is by far the most valuable SEO community on the web, and it is precisely because of all of you being here. Thanks!
I've recently kicked off a robust niche biotech news publishing site for a client, and in the first 6 weeks, we've generated 15K+ views and 9300 visits. The site is built on the WordPress platform.
I'm well aware that a best practice is to noindex tag and category pages, as I've heard SEOs say that they potentially lead to duplicate content issues. We're using tags and categories heavily, and to date, we've had just 282 visits from tag & category pages. So, that's 2.89% of our traffic; the vast majority of traffic has landed on the homepage or article pages (we are using author markup).
Here's my question, though, and it's more philosophical: do these pages really cause a duplicate content issue? Isn't Google able to determine that said page is a tag page, and thus not worthy of duplicate content penalties? If not, then why not?
To me, tag/category pages are sometimes better content pages to have ranked than article pages, since, for news especially, they potentially give searchers a better search result (particularly for short tail keywords). For example, if I write articles all the time about the Mayo Clinic," I'd rather have my evergreen "Mayo Clinic" tag page rank on page one for the keyword "mayo clinic" than just one specific article that very quickly drops out of the news cycle. Know what I mean?
So, to summarize:
1. Are doindexed tag/category pages really a duplicate content problem, and if so, why the heck?
2. Is there a strategy for ranking tag/category pages for news publishing sites ahead of article pages?
Thanks as always for your time and attention.
Kind Regards,
Mike
-
Hey Mike
Great question(s)!
1. Are indexed tag/category pages really a duplicate content problem, and if so, why the heck**?**
Since we are getting philosophical - let's define "what is duplicate content"? in the first place. There's two different types really;
- technical duplicate content - this is the kind we're referring to here. It's not real duplicate content (like you're trying to copy the same article or something over and over, it's not even cross domain). Technical duplicate content is there as a result of a function of the CMS or web development. Like tracking parameters, non-canonical homepages (www, non-www, /index.heml all loading etc), sorting functions on ecommerce sites.
- actual duplicate content - this is more like when someone has scraped an article from one domain to another, or copied an article on purpose - to actually try and pass it off as "unique" when it's totally copied.
Tags & categories sort of cause "technical duplicate content" but not always. It depends how you have WordPress set up. Most commonly, I see them create duplicate content in the sense that a tag archive might look almost exactly the same as the article page its self - or very similar.
OR what a lot of people are referring to and don't even realize it (which is a bit of a pet peeve) is the subpages off of tags and categories. When tag and/or category pages paginate (again, depending on how it's set up) the title tags will look like duplicates.
ie:
/tag/exercise-and-nutrition/ has the title tag: Exercise and Nutrition - Healthblog.com
/tag/exercise-and-nutrition/page/2 etc _still has the title tag: _Exercise and Nutrition - Healthblog.com
So the question really is - if tag/categories are "technical duplicate content" is THAT type of "duplicate content" an issue.
I've heard Google say: NO. John Mueller from Google has said multiple times in Webmaster Central Hangout Help Videos - "Google can distinguish this sort of accidental duplicate from real duplicate content".
BUT - not so fast - tags and categories can still be an issue, just NOT because of "duplicate content."
It really all depends how you have them set up.
1. I first recommend understanding the distinctions between tags and categories (image from my WordPress article)
2. I do recommend indexation in categories by default in most cases. Not sure where you've also heard to noindex categories. That's IF they are used correctly per #1 above. If you use 5-8 well constructed and chosen categories there should not be a problem with indexing categories.
3. Noindex subpages of archives - this kills 95% of what some folks mistakingly call "duplicate content" and is really just duplicate title tags from the pagination of subpages.
4. I highly advocate leaving some tags indexed (using the Yoast SEO plugin) that are bringing traffic - here's how I do that analysis when de-indexing tags.
Here are the REAL issues that tags and subpages CAN create;
- index bloat - lots of pages getting indexed that fill up the index and distract from what you might prefer to rank for instead
- poor user metrics from Google results - users tend to bounce off of tag archives, creatig lower user metrics, which can feed back into rankings
- dilution of content - so while this isn't "duplicate content" is is content dilution: multiple pages that all sort of overlap in topics.
2. Is there a strategy for ranking tag/category pages for news publishing sites ahead of article pages?
Totally! Check out Kane's comment on my WordPress post - essentially he is saying to customize your category archives with some unique content on them, as to distinguish them from being posts. Also, only display excerpts of your posts on archive pages.
We always cite SugarRae's blog as a great example. Check out her category page here. It has totally unique content at the top, and the posts below.
-
- -
To conclude, and keep it philosophical
I think what you're also getting at here, is an important part of SEO (or anything) that people don't talk about as much - but that's the idea of keeping an open mind, analyzing your specific situation, testing, testing the limits of "rules" - and really applying your own brain. Validate things for yourself.
One of the biggest issues, is that most people do not use tags in a deliberate way or really understand how they fully function. They just slap 20 tags on every post (which they think is a magic SEO trick) and end up with thousands of tag pages (I've seen sites with 7,000+ tag archives!) - at the beginning this might not be an issues, but over time if done recklessly like that, it can cause some of the problems noted above.
Great question!
-Dan
-
Well..., since you opened the Philosophy & Deep Thoughts topic, think of it like this: the answer to your question lies in the engagement strategy you develop for those pages. There is no rule here. How can you formulate those pages to effectively entice likes, shares, retweets, comments, +1's?--that's the question.
For the category pages, formulate and execute a strategy that will leverage the philosophy of your product mix/editorial calendar (the two should mesh). (You have formulated those two things based on business objectives and target audience, right?) There is a reason you sell the specific products that you sell, right?---make that a fundamentally obvious part of your category page content and provide an rss feed specific to the audience of that philosophy--even if it's a small target audience. Produce content for that feed on a regular basis.
If you structure the content of your category pages around a curation philosophy there will be a fundamental difference between those pages and the content on your product pages. At that point, your duplicate content will disappear.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Should I noindex my categories?
Hello! I have created a directory website with a pretty active blog. I probably messed this up, but I pretty much have categories (for my blog) and custom taxonomy (for different categories of services) that are very similar. For example I have the blog category "anxiety therapists" and the custom taxonomy "anxiety". 1- is this a problem for google? Can it tell the difference between archive pages in these different categories even though the names are similar? 2- should I noindex my blog categories since the main purpose of my site is to help people find therapists ie my custom taxonomy?
Intermediate & Advanced SEO | | angelamaemae0 -
How To Shorten Long URLS
Hi I want to shorten some URLs, if possible, that Moz is reporting as too long. They are all the same page but different categories - the page advertises jobs but the client requires various links to types of jobs on the menu. So the menu will have: Job type 1
Intermediate & Advanced SEO | | Ann64
Job type 2
Job Type 3 I'm getting the links by going to the page, clicking a dropdown to filter the Job type, then copying the resulting URL from the address bar. Bu these are really long & cumbersome. I presume if I used a URL shortener, this would count as redirects and alsonot be good for SEO. Any thoughts? Thanks
Ann0 -
Reviewing Category & Tag policy - Update
I recently (http://moz.com/community/q/less-tags-better-for-seo) started reviewing my category and tag policy, and things have been going very well. I thought I would share what I have done: Removed all tags from site Added unique descriptions for each post for the category excerpt. Only had the category description on the first page and use the description like a post to summarise and interlink to sub-categories or posts. This keeps pages from slipping down the number of clicks until it can be reached, improving link juice distribution. I also reduced the number of posts showing to 5, to allow more focus on the description (main part) of the category post. To add the category description on the first category page only in Wordpress, you need to go to the category.php or archive.php and change: to The overall aim was to have a hierarchal resource contained in the category page description. Whilst this is still a work in progress, you can see an example of what I am trying to achieve here: https://www.besthostnews.com/web-hosting-tutorials/cpanel/ https://www.besthostnews.com/web-hosting-tutorials/cpanel/mail/ If you have any further tips and advice as I continue to implement this (with good results so far), please feel free. Also, you can use the Visual Term Description Editor plugin to allow the wysiwyg editor for the category descriptions.
Intermediate & Advanced SEO | | TheWebMastercom1 -
Bingpreview/1.0b Useragent Using Adding Trailing Slash to all URLs
The Bingpreview crawler, which I think exists in order to take snapshots of mobile friendly pages, crawled my pages last night for the first time. However, it is adding a trailing slash to the end of each of my dynamic pages. The result is my program is giving the wrong page--my program is not expecting a trailing slash at the end of the urls. It was 160 pages, but I have thousands of pages it could do this to. I could try doing a mod rewrite but that seems like it should be unnecessary. ALL the other crawlers are crawling the proper urls. None of my hyperlinks have the slash on the end. I have written to Bing to tell them of the problem. Is anyone else having this issue? Any other suggestions for what to do? The user agent is: Mozilla/5.0 (iPhone; CPU iPhone OS 7_0 like Mac OS X) AppleWebKit/537.51.1 (KHTML, like Gecko) Version/7.0 Mobile/11A465 Safari/9537.53 BingPreview/1.0b
Intermediate & Advanced SEO | | friendoffood0 -
Domain.com/keyword1.keyword2.html vs doamin.com/keyword1-keyword2.html
I was doing some research and saw this url structure in a website that was not ranking well and can't help but wonder was the url structure part of the problem as well it looks like this with a period between keywords. domain.com/keyword1.keyword2.html and was wondering if that is acceptable for search engines as opposed to the normal dashes like this expample ... domain.com/keyword1-keyword2-keyword3.html I have never noticed a period to separate words in a url before. Anyone have any experience with this ? Is this going to hurt possible rankings ? Thank you in advance, Joe
Intermediate & Advanced SEO | | jlane91 -
What is the best URL structure for categories?
A client's site currently uses the URL structure: www.website.com/�tegory%/%postname% Which I think is optimised fairly well, as the categories are keywords being targeted. However, as they are using a category hierarchy, often times the URL looks like this: www.website.com/parent-category/child-category/some-post-titles-are-quite-long-as-they-are-long-tail-terms Best practise often dictates (such as point 3 in this Moz article) that shorter URLs are better for several reasons. So I'm left with a few options: Remove the category from the URL Flatten the category hierarchy Shorten post titles two a word or two - which would hurt my long tail search term traffic. Leave it as it is What do we think is the best route to take? Thanks in advance!
Intermediate & Advanced SEO | | underscorelive0 -
Does having a file type on the end of a url affect rankings (example www.fourcolormagnets.com/business-cards.php VS www.fourcolormagnets.com/business-cards)????
Does having a file type on the end of a url affect rankings (example www.fourcolormagnets.com/business-cards.php VS www.fourcolormagnets.com/business-cards)????
Intermediate & Advanced SEO | | JHSpecialty0 -
Our site is recieving traffic for both .com/page and .com/page/ with the trailing slash.
Our site is recieving traffic for both .com/page and .com/page/ with the trailing slash. Should we rewrite to just the trailing slash or without because of duplicates. The other question is, if we do a rewrite, google has indexed some pages with the slash and some without - i am assuming we will lose rank for one of them once we do the rewrite, correct?
Intermediate & Advanced SEO | | Profero0