Can Sitemap Be Used to Manage Canonical URLs?
-
We have a duplicate content challenge that likely has contributed to us loosing SERPs especially for generic keywords such as "audiobook," "audiobooks," "audio book," and "audio books."
Our duplicate content is on two levels.
1. The first level is at our web store, www.audiobooksonline.com.
Audiobooks are sometimes published in abridged, unabridged, on compact discs, on MP3 CD by the same publisher. In this case we use the publisher description of the story for each "flavor" = duplicate content.
Can we use our sitemap to identify only one "flavor" so that a spider doesn't index the others?
2. The second level is that most online merchants of the same publisher's audio book use the same description of the story = lots of duplicate content on the Web.
In that we have 11,000+ audio book titles offered at our Web store, I expect Google sees us as having lots of duplicated (on the Web) content and devalues our site.
Some of our competitors who rank very high for our generic keywords use the same publisher's description.
Any suggestions on how we could make our individual audio book title pages unique will be greatly appreciated.
-
Your sitemap.xml can't be used to solve this issue, Larry. The sitemap is only used to tell the search engines which pages exist on the site, not what to do if many of those pages share similar content.
In your case, likely the best approach is to use the rel=canonical tag to inform the search engines that you aware that the different formats of the audiobooks share similar descriptions, and to pick one format to be the primary page. Once you've determined the primary page, the other formats' pages would use the canonical tag in their headers to point to the primary page.
This essentially tells the search engines "these other pages are useful to the user, so I don't want to hide them, but they are really variations of the primary page, so assign all their value to the primary page, please".
This process is only a suggestion to the search engines, but it is usually heeded. The only real alternative would be to combine all the different format pages into one page with a description of the book, then listing the other formats and their prices. Kinda doubt your eCommerce system would allow this "out of the box". (You would then 301-redirect all the other format pages to the new main page.)
As for the fact that the book descriptions are the same as the publisher's and all the other sites - the only way around this is to write your own custom descriptions. There are many reasons the other sites could be ranking well even with those duplicate descriptions, ranging from better overall site authority, to having been online longer, to having better, more powerful incoming links.
It's a tough spot to be in, but you could start by rewriting the descriptions for, say, the top 25 books (according to your Analytics and your own instincts for which ones are the most valuable sales) and see if that results in an improvement to rankings and conversions.
One other way to beat the duplicate content in this case would be to get customers to leave reviews which are included on each page. These reviews would be different from other sites, making the overall content look different to the search engines. But this is also a lot of work to get to scale up as your customers must be encouraged to come back to your site at a later date to leave the review.
Hope that helps;
Paul
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Homepage canonical url with splash or not with splash? All other links are without but logo links with splash
Hello, There is so much contradicting information about the homepage canonical URL. Many websites have all the links without the trailing splash but their homepage URL still contains the splash. Now Moz is an example with this. Their urls don't have the splash, and their canonical does not have the splash. Why is it so and why so much different ways people have it?
On-Page Optimization | | advertisingcloud0 -
How can i get a higher ranking
Hey All, As a rookie on SEO, I optimised this page http://www.nhad.de/fernstudium/sprachkurse/arabisch/sprachkurse-arabisch/arabisch.aspx for the keywords Arabisch lernen. I saw a huge improvement from position 100+ to position 45. but now it's stuck around this position. Are there any things I missed on this page? Are there still things to increase the ranking? I hope to learn from you. thanks in advance!
On-Page Optimization | | NHA_DistanceLearning0 -
Which is Best Practice for creating URLs for subdomain?
My website is related to education. We have created sub domains for all major colleges, universities & Entrance exams like Gre, Toefl ETC. for eg: amityuniversity.abc.com (Amity is Name of University ) Now if have to mention city name in URL as well (college is located in multiple locations) amityuniversity-delhi.abc.com
On-Page Optimization | | rohanarora536
amityuniversitydelhi.abc.com Now my Q is can we use hyphens in sub domains if we have to add city name or shall we create without using any hyphens. In Directory structure we can always separate words with hyphens, can we follow same practice in subdomain as well Which is a best URL for subdomain amity-university-delhi.abc.com
amityuniversity-delhi.abc.com
or amityuniversitydelhi.abc.com0 -
Can you expound why i have to avoid using meta keywords?
I'm using the on page report card and it tells me that i have to avoid using meta keywords.I'm a little bit confused. I thought that it's important to use it all the time so search engine can better index the site. if I use SEO Quake it will tell me in the diagnostic test that I need to input keywords.
On-Page Optimization | | jsevilla0 -
"Canonical URL Tag Usage" recommendation in SEOmoz "On-Page Optimization" Tool
Here comes another one related to SEOmoz "On-Page Optimization" Tool. The tool says the following about one of our pages: Canonical URL Tag Usage Explanation: Although the canonical URL tag is generally thought of as a way to solve duplicate content problems, it can be extremely wise to
On-Page Optimization | | gerardoH
use it on every (unique) page of a site to help prevent any query strings, session IDs, scraped versions, licensing deals or future
developments to potentially create a secondary version and pull link juice or other metrics away from the original. We believe
the canonical URL tag is a best practice to help prevent future problems, even if nothing is specifically duplicate/problematic
today. Recommendation: Add a canonical URL tag referencing this URL to the header of the page. Let's say our page is http://www.example.com/brands/abc-brand and on its header we'll place the following tag: Is this correct? I thought the canonical tag was meant for duplicates of the original page, for example: http://www.example.com/brands/print/abc-brand href="http://www.example.com/brands/abc-brand**?SESSID=123** Thanks in advance.0 -
Does google treat all urls equal?
Sorry for the lame title, i couldn't think of a better one. I want to know if google treats this: http://www.domain.com/products/some-product-name the same as it would treat: http://www.domain.com/?products=some-product-name if not, could you tell me the differences?
On-Page Optimization | | adriandg0 -
Keywords in URL:
what kind of URL should we use? www.keyword.net/keyword-city or www.keyword.net/city which URL you would prefer?
On-Page Optimization | | alibeef0 -
Do we need to use the canonical tag on non-indexed pages?
Hi there I have been working in / learning SEO for just over a year, coming from a non dev background, so there are still plenty of the finer points on-page points I am working on. Slowly building up confidence and knowledge with the great SEOMoz as a reference! We are working on this site http://www.preciseuk.co.uk (we are still tweaking the tags and content by the way- not finished yet!) Because a lot of the information is within accordians, a page is generated for each tab of the accordian expanded, for example: http://www.preciseuk.co.uk/facilities-management.php is the main page but then you also have: http://www.preciseuk.co.uk/facilities-management.php?tab=0 http://www.preciseuk.co.uk/facilities-management.php?tab=1 http://www.preciseuk.co.uk/facilities-management.php?tab=2 http://www.preciseuk.co.uk/facilities-management.php?tab=3 http://www.preciseuk.co.uk/facilities-management.php?tab=4 http://www.preciseuk.co.uk/facilities-management.php?tab=5 All of which are in the same file. According to the crawl test, these pages are not indexed. Because it is all in one file, should we add the canonical tag to it, so that this is replicated in all the tab pages that are generated? eg. Thanks in advance for your help! Liz OneResult
On-Page Optimization | | oneresult
[email protected]2