What is with WordPress Dupe issues?
-
Hi,
Just wondering if anyone can explain for me why it seems every tag that is entered in WP blog posts on a site creates a duplicate page (identified by ROGER and friends in SEOmoz crawl)?
Obviously if you can offer a solution (apart from the extremely obvious "don't use tags") I would be immensely grateful.
Thanks so much,
-
Good Answer and a great article on your site too! Kudos!
-
That's great!
I'll keep an eye out for news on the subject
thanks again,
-
Thank you Marcus! And I just saw your tweet. That's super awesome of you.
I'll let the owner of the site know.
-
You're welcome Sha.
I wish I knew Matt Mullenweg so I could ask him the same question. I plan to be at the WordPress Camp here in San Diego in July, so maybe I'll ask around and try to get to the root of why there has never been an inherent "canonical setting."
Maybe they figured Google would sort out which is the canonical (which it will). But I would much rather say what path, or what keyword anchor Google has to crawl through to get to the post.
-
Hey William
That's a great answer, thumbs up.
Marcus
-
Thanks William,
I figured this was what was happening, but I suppose I imagined since it is obviously going to create duplicate content issues that the WP people would have been smart enough to make it add a canonical tag or something.
Thanks for the info on the plugin - much appreciated!
-
What happens when you use tags is that you create a page that is called www.your-domain.com/tag/tag-name. You also have the original blog post which is www.your-domain.com/blog-post.
They have two completely separate URLs, but they have the exact same content.
This is exacerbated when you use multiple tags because you increase the amount of chances for duplicate content to be created. And again it's even more expounded when you take into account the Categories, and Date Based Archives.
The solution is to use something like the Yoast WordPress SEO plugin and use no-index for everything that you don't want indexed. In my case I want Categories indexed, so I no-index Tags and Date Based Archives.
If you use Yoast's plugin I have put together a fairly easy article on how to initially set it up, and it is pretty helpful in understanding some of the settings. It's geared mainly for photographers running their own site, but should be just as valuable to you.
You can find it here http://www.themoderntog.com/how-to-configure-wordpress-seo-plugin-by-yoast-for-maximum-online-exposure
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Google search console image indexing issue
Google search console tells that only '58 out of the 3553' images in the images sitemap are indexed. But if I search "site:example.com" in Google images there seem to be lots of images. There are no errors in the sitemap and I am still getting reasonable number of image search hits daily. Are the webmaster tools stats for images indexed accurate? When I click on the Sitemap Errors & Index Errors this is what i get - Error details: No errors found. https://www.screencast.com/t/pqL62pIc
Technical SEO | | 21centuryweb0 -
Http v https Duplicate Issues
Hello, I noticed earlier an issue on my site. http://mysite.com and https://mysite.com both had canonical links pointing to themselves so in effect creating duplicate content. I have now taken steps to ensure the https version has a canonical that points to the http version but I was wondering what other steps would people recommend? Is it safe to NOINDEX the https pages? Or block them via robots.txt or both? We are not quite ready to go fully HTTPS with our site yet (I know Google now prefers this) Any thoughts would be very much appreciated.
Technical SEO | | niallfred0 -
Wordpress tags and duplicate content?
I've seen a few other Q&A posts on this but I haven't found a complete answer. I read somewhere a while ago that you can use as many tags as you would like. I found that I rank for each tag I used. For example, I could rank for best night clubs in san antonio, good best night clubs in san antonio, great best night clubs in san antonio, top best night clubs in san antonio, etc. However, I now see that I'm creating a ton of duplicate content. Is there any way to set a canonical tag on the tag pages to link back to the original post so that I still keep my rankings? Would future tags be ignored if I did this?
Technical SEO | | howlusa0 -
301 issue in IE9
My development team recently discovered an issue with 301 redirects caching in IE9. They did some research and found the situation was very complicated so their solution was to use 302s and no longer use 301s. As a temporary solution to a few URLs I was okay with this, but we have a site redesign launching in a few months and I am quite worried if we have to do all of our redirects as 302s. Has anyone else had this issue with IE9 and 301s. I could use any advice on how to overcome this issue. Thanks!
Technical SEO | | SEI0 -
MSNbot Issues
We found msnbot is doing lots of request at same time to one URL, even considering we have caching, it triggers many requests at same time so caching does not help at the moment: For sure we can use mutex to make sure URL waits for cache to generate, but we are looking for solution for MSN boot. 123.253.27.53 [11/Dec/2012:14:15:10 -0600] "GET //Fun-Stuff HTTP/1.1" 200 0 "-" "msnbot/2.0b (+http://search.msn.com/msnbot.htm)" 1.253.27.53 [11/Dec/2012:14:15:10 -0600] "GET //Type-of-Resource/Fun-Stuff HTTP/1.1" 200 0 "-" "msnbot/2.0b (+http://search.msn.com/msnbot.htm)" 1.253.27.53 [11/Dec/2012:14:15:10 -0600] "GET /Browse//Fun-Stuff HTTP/1.1" 200 6708 "-" "msnbot/2.0b (+http://search.msn.com/msnbot.htm)" We found the following solution: http://www.bing.com/community/site_blogs/b/webmaster/archive/2009/08/10/crawl-delay-and-the-bing-crawler-msnbot.aspx Bing offers webmasters the ability to slow down the crawl rate to accommodate web server load issues. User-Agent: * Crawl-Delay: 10 Need to know if it’s safe to apply that. OR any other advices. PS: MSNBot gets so bad at times that it could trigger a DOS attack – alone! (http://www.semwisdom.com/blog/msnbot-stupid-plain-evil#axzz2EqmJM3er).
Technical SEO | | tpt.com0 -
Wordpress question
I was curious when i run an OSE report on certain websites and their name.wordpress.com shows up with a PA of whatever and a DA of 100. But when I created my wordpress site and post on it, it only has a PA and DA of 1. is this because SEOmoz has not indexed it yet? It is a month old. http://shiftinsurance.wordpress.com/ Can anyone help pls?
Technical SEO | | greasy0 -
Home page canonical issues
I think I’ve got a canonical issue with a client’s site that I’m having problems with I’ve noticed in their analytics that they receive traffic from themselves. I’ve used ‘ rel canonical’ throughout the site to avoid any dup issues and I have 301’ed every other variation of the home page I can think of. I don’t have full access to the back end of the host to control any of the iis as it’s an asp site. They seem to be getting traffic from their site under the URL of, example.com I’ve 301 redirected www.example.com/home.asp www.example.com/default.asp www.example.com/index.asp to www.example.com And 'rel canonical' the home page to www.example.com but still seem to be having the same problem any ideas? Thanks
Technical SEO | | FarkyRafiq0 -
Paginated Home Page Duplicates on Wordpress Sites
A number of my websites created on WP are displaying duplicate home pages with these types of urls. http://www.example.com/page/10/ http://www.example.com/page/11/ http://www.example.com/page/12/ I found these duplicates using the site:search command. Basically, put in any number and the Home Page opens. With the above mentioned url structure. Any idea on why they are created, how they can be stopped and what kind of an impact they would have in terms of SEO and the penalty that comes with duplicate content.
Technical SEO | | AsadMemon1