How to prevent duplicate content at a calendar page
-
Hi,
I've a calender page which changes every day.
The main url is
/calendarFor every day, there is another url:
/calendar/2012/09/12
/calendar/2012/09/13
/calendar/2012/09/14So, if the 13th september arrives, the content of the page
/calendar/2012/09/13
will be shown at
/calendarSo, it's duplicate content.
What to do in this situation?
a) Redirect from /calendar to /calendar/2012/09/13 with 301? (but the redirect changes the day after to /calendar/2012/09/14)
b) Redirect from /calendar to /calendar/2012/09/13 with 302 (but I will loose the link juice of /calendar?)
c) Add a canonical tag at /calendar (which leads to /calendar/2012/09/13) - but I will loose the power of /calendar (?) - and it will change every day...
Any ideas or other suggestions?
Best wishes,
Georg.
-
Ah... yeah, that's tricky. There's no magic solution, I'm afraid. You've really got three options:
(1) Leave it alone
(2) Re-organize your site architecture to push individual date pages down a level or two, so that they get less internal link-juice.
(3) Re-organize such that you focus search engines on chunks of time or maybe date/aspect combinations, but then de-index the individual date combos. This would take a much better understanding of your site structure than I currently have. The goal would be to focus your index on some smaller combination of pages that still covers 80% of your search traffic.
The big problem is just that this is a lot potential dilution, and I suspect that many of these pages look very similar to Google. I'm also certain that not all pages have the same value, either for SEO or users, so there's some hybrid approach where you could prune back but not lose everything. Long-term, I think that's worth the time and trouble to sort out, but it's not an emergency or something I'd rush into.
-
Hi Peter,
thanks for your answer!
Well, it's even more complicated!
It's an astrology calendar with planet aspect data for each day starting from 1900-01-01 to 2099-12-31, so there are around 73,000 pages, it's a big database.
People are searching for a date and the planet aspects. So I need the "old pages" and the future pages in the index.
People are also searching their birthday and want to know their zodiac. My calendar is providing this info.
This is an example:
http://www.schicksal.com/horoskop/tageshoroskop/1951/09/10The best thing is to do nothing at the moment I think. The alternativ is to cut the content of the current day from the main page and let the user click a button which redirects to the current day page. But this is not user friendly and I will do nothing at them moment.
Any other idea would be great
Best wishes,
Georg.
-
Sadly, the short answer is that you can't have it all. Either you index the separate calendar pages, get more pages/content out there and risk some "thinning" of your index, or you focus on one page, maximize the SEO value, but then lose the individual pages.
I would not 301 or 302 to the individual calendar URLs - that kind of daily URL shifting is going to look suspicious, Google will not re-cache consistently, and you're going to end up with a long-term mess, I strongly suspect.
I actually tend to agree with Muhammed and Paragon that a viable option would be to let the individual days have their own content, but then canonical to the main calendar page to focus the search results. That way, users can still cycle through each individual day, but Google will focus on the core content. In a way, that's how a blog home-page works - the content changes daily, but you're still keeping the bots focused on one URL.
Think of it in terms of usability, too. How valuable is old/outdated content to search users? They might find something relevant on an old page, but they still probably want to see the main calendar and view recent content.
Where are the links to the individual days, if "/calendar" always has today's content? I'm wondering if there's a hybrid approach, like letting the most recent 30 days all have their own URLs, but then redirecting or using rel-canonical to point to the main page after 30 days.
-
What about adding to all of the other pages i.e not to /calendar/ the links will be followed but not indexed by Google.
-
Hi Georg,
Setting up a redirect or canonicalization for the the calendar page in the ways you describe might make it harder to build up any kind of authority for your calendar.
You could consider adding canonicalization for all the individual day pages that points to the main calendar page. ie. Each page /calendar/YYYY/MM/DD would have rel canonlical=/calendar/. Not sure this is the best idea though.
I don't know how your calendar is setup but you could also look at differentiating the pages by doing just a listing of events on the main page and including summaries or detail on the current day page. Or maybe including some additional information about your calendar on the main page like what type of events are included and how to submit events and not including that information on the individual day pages.
I've always taken the approach of minimizing duplicate content as much as possible but not getting excessive with it. I think in a case like this you could do more harm than good. The calendar page is an ever changing page, it's not like you have the exact same static content on two pages.
Hope this helps!
Zach
-
Hi Muhammed,
because the content is different. This would devaluate all calendar pages.
Best wishes,
Georg. -
Hi Georg What about adding canonical tag(s) from each days (/calendar/2012/09/13) calender pages to the main page (/calendar)
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Page content not being recognised?
I moved my website from Wix to Wordpress in May 2018. Since then, it's disappeared from Google searches. The site and pages are indexed, but no longer ranking. I've just started a Moz campaign, and most pages are being flagged as having "thin content" (50 words or less), when I know that there are 300+ words on most of the pages. Looking at the page source I find this bit of code: page contents Does this mean that Google is finding this and thinks that I have only two words (page contents) on the page? Or is this code to grab the page contents from somewhere else in the code? I'm completely lost with this and would appreciate any insight.
Technical SEO | | Photowife1 -
Duplicate content - working with CMS constraints
Hi, We use an industry-specific CMS and I'm struggling to figure out how we can fix duplicate content issues. Thankfully, the vendor has agreed to work on 301 vs 302 redirects. However, they aren't currently able to give us the ability to add rel=canonical tags to page headers (we've put it in their "suggestion box" which tends to take a long time, if ever, to materialize). My understanding is that the tag will not be recognized if it's in the body code, correct? (aka the part of the page we can edit from the CMS) Is there anything else I can do?
Technical SEO | | combska0 -
Duplicate content on report
Hi, I just had my Moz Campaign scan 10K pages out of which 2K were duplicate content and URL's are http://www.Somesite.com/modal/register?destination=question%2F37201 http://www.Somesite.com/modal/register?destination=question%2F37490 And the title for all 2K is "Register" How can i deal with this as all my pages have the register link and login and when done it comes back to the same page where we left and that it actually not duplicate but we need to deal with it propely thanks
Technical SEO | | mtthompsons0 -
How different does content need to be to avoid a duplicate content penalty?
I'm implementing landing pages that are optimized for specific keywords. Some of them are substantially the same as another page (perhaps 10-15 words different). Are the landing pages likely to be identified by search engines as duplicate content? How different do two pages need to be to avoid the duplicate penalty?
Technical SEO | | WayneBlankenbeckler0 -
Does turning website content into PDFs for document sharing sites cause duplicate content?
Website content is 9 tutorials published to unique urls with a contents page linking to each lesson. If I make a PDF version for distribution of document sharing websites, will it create a duplicate content issue? The objective is to get a half decent link, traffic to supplementary opt-in downloads.
Technical SEO | | designquotes0 -
How can i see the pages that cause duplicate content?
SEOmoz PRO is giving me back duplicate content errors. However, i don't see how i can get a list of pages that are duplicate to the one shown. If i don't know which pages/urls cause the issue i can't really fix it. The only way would be placing canonical tags but that's not always the best solution. Is there a way to see the actual duplicate pages?
Technical SEO | | 5MMedia0 -
How do i deal with duplicate content on the same domain?
I'm trying to find out if there's a way we can combat similar content on different pages on the same site, without having to re write the whole lot? Any ideas?
Technical SEO | | indurain0 -
Seomoz is showing duplicate page content for my wordpress blog
Hi Everyone, My seomoz crawl diagnostics is indicating that I have duplicate content issues in the wordpress blog section of my site located at: http://www.cleversplash.com/blog/ What is the best strategy to deal with this? Is there a plugin that can resolve this? I really appreciate your help guys. Martin
Technical SEO | | RogersSEO0