Press Releases and Duplicate Content on Event Related Site
-
I have a site that lists events. I ask those submitting events to submit original content if possible, but frequently they submit press releases which are already published elsewhere. I rewrite some of the press releases, but do not have time to rewrite every press release that comes my way. I want my users to get a comprehensive list of events, but I don't want get a penalty for duplicate content. What is the best solution?
-
Syndication tag can be used if you are not worried about your pages in Google news.
You may also use "noindex" or "canonical" if you want to play a very safe game. But as you know, your pages won't be indexed.
If you mention the content source at prominent places, you will be O.K.
Example:
In title you may say as "....event by ........."
In the body, you may say as "...thanks to .........."
As long as the overall percentage of duplicate content in your site is in very reasonable level and you link back to content source, you won't find any problem.
Google clarified that content duplication alone will not lead to negative impact.
http://googlewebmastercentral.blogspot.com/2008/06/duplicate-content-due-to-scrapers.html
-
Thanks Devin,
That seems pretty reasonable and in tune with what I was thinking myself. Unfortunately I think if I want to provide users with a complete list of events I will have to probably include some press release material. A couple of further related questions might be:
- Should I be trying to prevent search bots from crawling pages if the content is not original (by using noindex tag and/or use of robots.txt file)?
- I wonder if the
syndication-source
tag might be more relevant than the canonical tag?
http://support.google.com/news/publisher/bin/answer.py?hl=en&answer=191283
-
Hi Andy,
A non-technical solution would be to have an email template and send it to whoever submits a press release. Something like...
"Dear so and so,
_Thank you very much for your submission to my site. I would be delighted to include your event in my listings. Unfortunately I am very wary of featuring a press release that has been published ad verbatim on other websites. I would really appreciate if you could alter your submission to include _whatever specifications you need .
Once I receive this I will publish your event immediately. "
Alternatively, you could use the rel=canonical tag targeting the original page the press release was published on. This would prevent you being done for duplicate content. However, your page will be less likely to appear for searches conducted for that event. I'd imagine you'd prefer your own pages to appear in the SERPs so this probably isn't ideal. I've never had to deal with this kind of problem myself so I'd be very interested to hear what others have to say.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Would a lot of images on one post be categorized as thin content?
As an example, if i write an article on 12 best print ads by BMW, it will have 12 images and possible 12 single liners and a paragraph. The images will have the necessary alt tags. But overall, will this post be counted as low content and is there changes of being penalized by google for it?
Content Development | | marketing910 -
Same content but translated. Penalization?
Hi There, I’ve got a question. There are two website that are under the same proprietor but must stay not related (different brand, different IP, different country, different language). The question is: Does google penalize one of the site if I entirely translate the content from site 1 to site2? Thank you very much for you input 😉
Content Development | | Midleton0 -
Need suggestion to place longer content on products category page
Hi All, I wanted to place longer content on products category page, Currenty I am showing product listing first and then small description at the end of listing.I don't want to add longer content either bottom or top. I want to make two tabs at the top of each category pages like Products | Informtion In Product section (after clicking on it) I want to display all products listing & in Information tab (after clicking on it) 2-3 paragraphs of webpage content but I'm afraid If I will place the content in this way Google won't index content and my purpose of adding webpage content to target long tail keywords won't fulfill. Please suggest me if you have any better idea & let me know what I am going to do would be good or not in SEO perspective. Thanks
Content Development | | Alick3000 -
The same phrase in many different pages of one site
Hi,
Content Development | | webg
Recently, I had to add the same phrase, with 15 words, nearly, in 700 posts in a same blog. In this phrase is written about the site ownership and eventually some links showing the posts sources. I thought in create a image, but it will be some variations in the source words (2 or 3), therefore I chose to use text format. I'd like to read some comments and opinions about this kind of insertion (the same phrase in many different pages of one site). For exemple, did you handle this in your site? Problems or benefits (mainly with indexing)? Special code to indicate in this case? Any threat?0 -
Typepad.com blog migration & duplicate content
I've migrated a typepad.com blog with a bunch of content (but little traffic) onto a hosted WordPress site under my own domain name (the way I should've done it in the first place). Now I don't want to confuse Google that the new site is duplicating content from the other site, so would I be better off with: 1) meta-refresh redirecting each typepad.com post to the same post on the new blog, or 2) just killing the typepad.com blog entirely so Google will not find duplicate posts anywhere. In favor of #2 is the fact that these posts get very little traffic today. I figure I will lose more traffic from duplicate content ranking penalties than from losing the posts themselves in the original blog. What do you think?
Content Development | | chriscrabtree0 -
What is the best way to get around duplicate content when you are advertising exactly the same content on two different sites?
I am currently trying to improve exposure for an online degrees website but the content for the degree program pages is exactly the same as the company's main website. What would you suggest for getting around the duplicate content issue as a lot of the curriculum content will obviously be the same for each module, etc? Thanks
Content Development | | BeattieGroup0 -
Displaying archive content articles in a writers bio page
My site has writers, and each has their own profile page (accessible when you click their name inside an article). We set up the code in a way that the bios, in addition to the actual writer photo/bio, would dynamically generate links to each article he/she produces. Figured that someone reading something by Bob Smith, might want to read other stuff by him. Which was fine, initially. Fast forward, and some of these writers have 3,4, even 15 pages of archives, as the archive system paginates every 10 articles (so www.example.com/bob-smith/archive-page3, etc) My thinking is that this is a bad thing. The articles are likely already found elsewhere in the site (under the content landing page it was written for, for example) and I visualize spiders getting sucked into these archive black holes, never to return. I also assume that it is just more internal mass linking (yech) and probably doesnt help the overall TOS/bounce/exit, etc. Thoughts?
Content Development | | EricPacifico0 -
Remove Scraped Content?
There is a site I work for that has content that, when you search in Google a snippet of text from, they are not the top result for. I believe what has happened is that they had written blogs and articles and added them to their site and article directories at the same time and the article directories got cached first. If we're not coming up first for our article, that means we are not believed to be the original author, correct? Should I remove all content from our site where this is happening, even though we actually did create these articles?
Content Development | | poolguy0