How to prevent duplicate content at a calendar page
-
Hi,
I've a calender page which changes every day.
The main url is
/calendarFor every day, there is another url:
/calendar/2012/09/12
/calendar/2012/09/13
/calendar/2012/09/14So, if the 13th september arrives, the content of the page
/calendar/2012/09/13
will be shown at
/calendarSo, it's duplicate content.
What to do in this situation?
a) Redirect from /calendar to /calendar/2012/09/13 with 301? (but the redirect changes the day after to /calendar/2012/09/14)
b) Redirect from /calendar to /calendar/2012/09/13 with 302 (but I will loose the link juice of /calendar?)
c) Add a canonical tag at /calendar (which leads to /calendar/2012/09/13) - but I will loose the power of /calendar (?) - and it will change every day...
Any ideas or other suggestions?
Best wishes,
Georg.
-
Ah... yeah, that's tricky. There's no magic solution, I'm afraid. You've really got three options:
(1) Leave it alone
(2) Re-organize your site architecture to push individual date pages down a level or two, so that they get less internal link-juice.
(3) Re-organize such that you focus search engines on chunks of time or maybe date/aspect combinations, but then de-index the individual date combos. This would take a much better understanding of your site structure than I currently have. The goal would be to focus your index on some smaller combination of pages that still covers 80% of your search traffic.
The big problem is just that this is a lot potential dilution, and I suspect that many of these pages look very similar to Google. I'm also certain that not all pages have the same value, either for SEO or users, so there's some hybrid approach where you could prune back but not lose everything. Long-term, I think that's worth the time and trouble to sort out, but it's not an emergency or something I'd rush into.
-
Hi Peter,
thanks for your answer!
Well, it's even more complicated!
It's an astrology calendar with planet aspect data for each day starting from 1900-01-01 to 2099-12-31, so there are around 73,000 pages, it's a big database.
People are searching for a date and the planet aspects. So I need the "old pages" and the future pages in the index.
People are also searching their birthday and want to know their zodiac. My calendar is providing this info.
This is an example:
http://www.schicksal.com/horoskop/tageshoroskop/1951/09/10The best thing is to do nothing at the moment I think. The alternativ is to cut the content of the current day from the main page and let the user click a button which redirects to the current day page. But this is not user friendly and I will do nothing at them moment.
Any other idea would be great
Best wishes,
Georg.
-
Sadly, the short answer is that you can't have it all. Either you index the separate calendar pages, get more pages/content out there and risk some "thinning" of your index, or you focus on one page, maximize the SEO value, but then lose the individual pages.
I would not 301 or 302 to the individual calendar URLs - that kind of daily URL shifting is going to look suspicious, Google will not re-cache consistently, and you're going to end up with a long-term mess, I strongly suspect.
I actually tend to agree with Muhammed and Paragon that a viable option would be to let the individual days have their own content, but then canonical to the main calendar page to focus the search results. That way, users can still cycle through each individual day, but Google will focus on the core content. In a way, that's how a blog home-page works - the content changes daily, but you're still keeping the bots focused on one URL.
Think of it in terms of usability, too. How valuable is old/outdated content to search users? They might find something relevant on an old page, but they still probably want to see the main calendar and view recent content.
Where are the links to the individual days, if "/calendar" always has today's content? I'm wondering if there's a hybrid approach, like letting the most recent 30 days all have their own URLs, but then redirecting or using rel-canonical to point to the main page after 30 days.
-
What about adding to all of the other pages i.e not to /calendar/ the links will be followed but not indexed by Google.
-
Hi Georg,
Setting up a redirect or canonicalization for the the calendar page in the ways you describe might make it harder to build up any kind of authority for your calendar.
You could consider adding canonicalization for all the individual day pages that points to the main calendar page. ie. Each page /calendar/YYYY/MM/DD would have rel canonlical=/calendar/. Not sure this is the best idea though.
I don't know how your calendar is setup but you could also look at differentiating the pages by doing just a listing of events on the main page and including summaries or detail on the current day page. Or maybe including some additional information about your calendar on the main page like what type of events are included and how to submit events and not including that information on the individual day pages.
I've always taken the approach of minimizing duplicate content as much as possible but not getting excessive with it. I think in a case like this you could do more harm than good. The calendar page is an ever changing page, it's not like you have the exact same static content on two pages.
Hope this helps!
Zach
-
Hi Muhammed,
because the content is different. This would devaluate all calendar pages.
Best wishes,
Georg. -
Hi Georg What about adding canonical tag(s) from each days (/calendar/2012/09/13) calender pages to the main page (/calendar)
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Content incorrectly being duplicated on microsite
So bear with me here as this is probably a technical issue and i am not that technical. We have a microsite for one of our partner organisations and recently we have detected that content from our main site appearing in the URLs for the microsite - both in search results and then when you click through to the SERP. However, this content does not exist on the actual website at all. Anyone have a possible explanation for this? I have tried searching the web but nothing. I assume there is something in the set up of the microsite that is associating it with the content on the main site.
Technical SEO | | Discovery_SA0 -
Fullsite=true coming up as duplicate content?
Hello, I am new to the fullsite=true method of mobile site to desktop site, and have recently found that about 50 of the instances in which I added fullsite=true to links from our blog show as a duplicate to the page that it is pointing to? Could someone tell me why this would be? Do I need to add some sort of rel=canonical to the main page (non-fullsite=true) or how should I approach this? Thanks in advance for your help! L
Technical SEO | | lfrazer0 -
Affiliate urls and duplicate content
Hi, What is the best way to get around having an affiliate program, and the affiliate links on your site showing as duplicate content?
Technical SEO | | Memoz0 -
Duplicate Content - Mobile Site
We think that a mobile version of our site is causing a duplicate content issue; what's the best way to stop the mobile version being indexed. Basically the site forwards mobile users to "/mobile" which is just a mobile optimised version of the original site. Is it best to block the /mobile folder from being crawled?
Technical SEO | | nsmith7870 -
How do I get rid of duplicate content
I have a site that is new but I managed to get it to page one. Now when I scan it on SEO Moz I see that I have duplicate content. Ex: www.mysite.com, www.mysite.com/index and www.mysite.com/ How do I fix this without jeopardizing my SERPS ranking? Any tips?
Technical SEO | | bronxpad0 -
Where to put content on the page? - technical
The new algo update says any images at the top of the page negatively affect user experience if they are adverts? how does google know if its an advert or relevant banner? When trying to put text as far up as possible on the page, is it ok to make it appear higher in the code but appear further down using css? Or does Google not go from the code top to bottom when working this out, more how it renders? Any advice much appreciated.
Technical SEO | | pauledwards0 -
Duplicate content, how to solve?
I have about 400 errors about duplicate content on my seomoz dashboard. However I have no idea how to solve this, I have 2 main scenarios of duplication in my site: Scenario 1: http://www.theprinterdepo.com/catalogsearch/advanced/result/?name=64MB+SDRAM+DIMM+MEMORY+MODULE&sku=&price%5Bfrom%5D=&price%5Bto%5D=&category= 3 products with the same title, but different product models, as you can note is has the same price as well. Some printers use a different memory product module. So I just cant delete 2 products. Scenario 2: toners http://www.theprinterdepo.com/brother-high-capacity-black-toner-cartridge-compatible-73 http://www.theprinterdepo.com/brother-high-capacity-black-toner-cartridge-compatible-75 In this scenario, products have a different title but the same price. Again, in this scenario the 2 products are different. Thank you
Technical SEO | | levalencia10 -
Thin/Duplicate Content
Hi Guys, So here's the deal, my team and I just acquired a new site using some questionable tactics. Only about 5% of the entire site is actually written by humans the rest of the 40k + (and is increasing by 1-2k auto gen pages a day)pages are all autogen + thin content. I'm trying to convince the powers that be that we cannot continue to do this. Now i'm aware of the issue but my question is what is the best way to deal with this. Should I noindex these pages at the directory level? Should I 301 them to the most relevant section where actual valuable content exists. So far it doesn't seem like Google has caught on to this yet and I want to fix the issue while not raising any more red flags in the process. Thanks!
Technical SEO | | DPASeo0