How to measure the penalty of duplicate content if we populate our provider bios on WebMD?
-
I work for a large healthcare system and we have an initiative to populate 2,500 of our our provider bios on WebMD. The proposed method for providing content is to supply it via API, in exactly the same way provider bio content appears on our site.
When my colleague and I pointed out this would be an anti-practice as it would be disseminating duplicate content, we were asked to weigh:
- The penalty of the duplication
- The time and resources necessary to provide an alternative method (i.e., is there a programmatic way to supply unique content to WebMD)
A few other questions we are investigating is if we can include links to each provider bio from WebMD to our main site. If this is the case, we can include a very short intro and direct users to our site if they want to learn more. The benefit of being included on WebMD is showing up for searches pertaining to expertise/specialties, as this will open our system to new users who likely won't search our providers by name.
Any advice on how to measure the potential effect of displaying duplicate content on WebMD, considering their impressive domain authority?
-
Thanks, all. I'll present these findings to our organization and we'll go from there.
-
No worries
-
Thanks Andy for sharing that post!
-
No problem at all John - please reply back if you have any other questions.
-Andy
-
Andy, glad I read that post - a great one. Thanks.
-
Hi,
I just want to address a point that has been missed here because duplicate content across domains is one of the Panda signals to Google and can end up resulting in an algorithm hit. Remember that how Google treats your own internal duplicate content and that on an external site are very different.
A good rule of thumb is do NOT expect to rank high in Google with content found on other, more trusted sites, and don’t expect to rank at all if all you are using is automatically generated pages with no ‘value add’.
Have a read of this article as it runs through lots of information regarding duplicate content. Here is another excerpt to be mindful of...
…in some cases, content is deliberately duplicated across domains in an attempt to manipulate search engine rankings or win more traffic. Deceptive practices like this can result in a poor user experience, when a visitor sees substantially the same content repeated within a set of search results. Google tries hard to index and show pages with distinct information. This filtering means, for instance, that if your site has a “regular” and “printer” version of each article, and neither of these is blocked with a noindex meta tag, we’ll choose one of them to list. In the rare cases in which Google perceives that duplicate content may be shown with intent to manipulate our rankings and deceive our users, we’ll also make appropriate adjustments in the indexing and ranking of the sites involved. As a result, the ranking of the site may suffer, or the site might be removed entirely from the Google index, in which case it will no longer appear in search results. GOOGLE.
I would never advise duplicating content to be used across different domains - this is a very bad practice and one that should be avoided at all costs.
CleverPhD has advised the best way to handle this and re-write the content for the Bios.
-Andy
-
Just to follow-up on Russ' point, if you want to estimate cost. Contract out a couple part-time writers to go and do some web research on the providers and rewrite the bios/profiles. You will need someone from your internal team to supervise the part-timers who is familiar with the healthcare industry and writing to look through and make sure that what the writers put down is correct. This should take you 4-6 months. Your costs will be the 60-70% salary for the full-time person (as they will not just be doing this project), plus plan to pay about 20 bucks an hour for 20 hours a week from each part timer. You can adjust and get another (third) part-timer if you like for a bit more cost but faster results.
We did this for about 2,000 locations for a site I work on. We found that you would not want to have anyone doing this full-time as they would probably go insane and quality suffers. Find a way to break up the tasks so that persons spend part of the time researching, part time proofing the other's work and part time writing. Helps with a better output. Sure, you could use software to "spin" the bios, but they would come out looking like crapola. That was why we used people and were happy with the results.
We did see a significant jump in our organic traffic, so for us it was worth it. You may take a look and decide not to, but wanted to put this option out there.
-
You aren't going to suffer a penalty from this. There really is no such thing as a "duplicate content penalty", just the chance that you will be out-ranked. If you want to quantify the potential risk, just look at all the organic traffic to those bio pages on your site and determine what would happen were you to lose rankings for some percentage of them. My guess is that you won't lose rankings dramatically and, when you do, it will just be the 1 position supplanted by a now-even-better ranking from WebMD.
That being said, duplicate content is best if you can avoid it. If it is possible, find a way to modify the bios on-the-fly as they are syndicated. Make sure you include brand mentions in the opening paragraph (you could use a boilerplate sentence or two to start off each bio).
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Domain Transition: Leaving low quality content behind
We're in the initial stages of planning a domain transition / rebrand. We're considering 301'ing our low and high(er) quality content split to two different domains. One for the low quality, one for our high. Best practices normally tell you to not split your content between between multiple domains. However, what if the majority of pages on your site are thin/outdated, and attract low volume/long tail? Does it make sense to bring that low quality/volume content over the new domain, when you know you'll never have the resources (nor would it make sense to) mass improve the quality of these pages? I'm concerned the quality of these pages are affecting our overall domain authority. Some background on our site/business: Current site has 15,000+ pages. 98% of our site is a product directory of professional/enterprise business management software. While a small handful of our product pages have quality original long form content (maybe 50-100), most of the product pages are a combination of: thin, outdated, overly sales-y content provided directly from product developers, and/or catch only very low-volume/long tail organic traffic. 95% of our pages attract fewer than 20 visits/mo, 90% of our pages attract fewer than 10 visits/mo. We have a small business of about 10 employees. Most of which don't maintain our site. It's unrealistic for us to genuinely improve the quality of that many pages. Nor does it make sense to improve most of these pages, as they'll attract only very low volume keywords. Individually these low quality pages don't bring in many customers, but on aggregate they do. 70% of our organic conversions come from pages with less than 20 visits/mo. A few questions: Is this content negatively affecting our domain authority in any way? While I don't believe we've been hit with a penalty, Google knows that on average our pages aren't very helpful to many users, and I'm concerned that affects our ability to rank with pages that matter. None of the content was mass produced in any form of scraping efforts or anything nefarious like that. Would there be any negative/positive affect to offloading these low quality/volume pages to a different domain during the rebrand?
Branding | | dsbud0 -
How to Measure Impact and Potential Strategies for Competitors with Similar Brand Name in the Same Industry
Hello Everyone, So we have a site (brand1.com) but one of our competitors has a very similar brand name and domain (brand2.org). They’re similar enough that could be confused by users and search engines and target the same topics. When you do a manual brand search they would come up and both have about the same Domain Authority. Assuming we can’t have them take their site down do any of you have any thoughts on how we could potentially measure potential impact they might be having or ideas on how best to approach this? Our thought was to track what they are and are not doing so we could do it at a higher level or fill in what’s missing. We would also emphasize differences with an emphasis on local optimization (they’re in a different area). We would love to be able to have some concrete data on whether they’re having an impact so thought we would find out if any of you have any experience or insight? Any help would be very much appreciated. Please let us know if there’s any further details we could provide that might help. Looking forward to hearing from all of you! Thanks in advance. Best,
Branding | | Ben-R0 -
Content Advice for SEO Newbies
Hi all, I've been asked to put together a presentation as part of an internal series for marketers within the company that don't know much about SEO, but want to learn the basics and contribute. My topic for this one is on-page SEO/content marketing's role in SEO. I have lots of ideas for this already, but I thought I'd turn to the Moz forum to get some feedback and help me prioritize the points I hit. So, if you could give SEO newbies working on content for a company site, blog, etc. just one piece of advice, what would it be? Looking forward to seeing your responses. Thanks, Andrew
Branding | | SafeNet_Interactive_Marketing0 -
Linkedin: Inshares - Can I see who inshared my content?
Hi All, Just wondering... since the demise of Linkedins' Signal tool, is there a way to actually see who and where my content is being shared on Linkedin? Blog posts being published at the minute are getting inshares almost as soon as they're live and I want to know who's doing it. Any advice would be appreciated.
Branding | | SanjidaKazi1 -
Social Media Content - Duplicate Content?
Hi All, What's your opinion on sharing the same content across your social media outlets. We are targeting only slightly different markets across each social media outlet. I find it hard to develop content for each outlet 3-5 times a week. There really is so much to share. At the same time, I wouldn't want to get canned for any duplicate content or anything like that. Along those lines, can anyone provide some advice on which social media outlets are "followed" vs. "not-followed," both in terms of links and overall indexing? Thanks!
Branding | | CSawatzky0 -
Duplicate Content Question
I have a question about duplicate content. We have our mission statement on our home page, a few paragraphs. When I searched Copyscape the only pages that came back were sites like Google Plus, Manta, Linkedin, AngieLists ect. All of them have the same exact copy. Would this be something that is hurting us for duplicate content??
Branding | | chuck-layton
It is our mission statement so we kind of want to be the same across those sites. Any input would be great. Thanks, Scott0 -
Content Marketing for E-Commerce Sites
Let's have a real discussion about content marketing for B2B and B2C e-commerce sites. As an SEO/inbound marketer (these days, I'm not sure what to call myself other than my first name), it's part of my job to keep a pulse on what's going on in the online marketing community. My daily routine starts with checking several sites for news/discussion (Moz, Inbound.org, SearchEngineLand, etc). Anyone actively involved in the community knows the word "content" appears in more articles than any other word (ok, maybe there a few others). Want to increase brand awareness? Generate content. Want to drive more traffic to your site? Generate content. Want to build quality links? Generate content. Want to discover the Higgs particle before the physicists? Generate content (and distribute to the right audience, so not to the chemists - ok maybe to the chemists, they're a related audience). Content, content, content, we're told! Yes I did see the Rand's WBF from a couple months back about content-less marketing, but frankly his suggestions fall under the traditional model of advertising and word-of-mouth. We're online marketers baby, we're expanding and changing the traditional model - with content! Enough of content marketing about content marketing. Let's see some content marketing for the small B2C, mom n' pop client who sells gardening tools. Let's see the amazing infographic you made for your local pizzeria client that drove traffic to their site. Let's see the Q+A discussion thread you identified and contributed to as means to display 'market leadership' in your niche of home air purifiers. Look, I love the idea of content marketing to increase brand awareness and drive traffic. Displaying market leadership by answering questions and offering something beneficial to your target audience should be the way to grow business (along with having a good product/service, I guess). But it's much easier said than done. And to be clear, I never expected otherwise. The motivation for this post was to start a discussion about real-world, applied content marketing, not content marketing about content marketing. Let the conversation begin.
Branding | | b40040400