Remove Scraped Content?
-
There is a site I work for that has content that, when you search in Google a snippet of text from, they are not the top result for. I believe what has happened is that they had written blogs and articles and added them to their site and article directories at the same time and the article directories got cached first.
If we're not coming up first for our article, that means we are not believed to be the original author, correct?
Should I remove all content from our site where this is happening, even though we actually did create these articles?
-
I explained the answer to this in the second part of my original post.
-
I would hope you had a link, when possible, back to your site. If not, then the page should be dated by creation and last update which Google can see. Although I would not leave anything up to guess work, but make sure you have links, and I would even put the date it was posted onto the post on your site like news article are. Just another indicator.
I would not remove the content if in fact, it did originate from you.
-
Yes, it was intentionally distributed. I would like to know whether the duplicate content on our site is being seen (by Google) as copied, not original, scraped, pulled from another source because we're so lazy we can't come up with any material of our own??
If this is the case, I will be removing the content, as the quality of the content sucks and there is quite a bit of it. Please, do not respond "if the content sucks, then why have it on your site..."
-
The term "scraped content" is most often used for content that has been grabbed from your website by a visiting robot.
Based upon your posting, the duplicate content that you are talking about was intentionally distributed.
-
Then how do you determine if Google is seeing content as scraped? As you know, Google has made it very clear recently how they feel about scraped content.
-
If we're not coming up first for our article, that means we are not believed to be the original author, correct?
Search engines can not identify original authors. (unless you use the rel="author" attribute and then they are merely taking your word for it) They only know which page with the content was discovered first. The content could have been on other pages first or the content could have been published first offline. Search engines don't have divine powers
The page that ranks first in the SERPs is the one that has the best combination of relevance, domain authority and other ranking factors. Has nothing to do with authorship.
Should I remove all content from our site where this is happening, even though we actually did create these articles?
I would not do that if the content is valuable for your visitors, has acquired links from other sites or if the content is pulling traffic from search.
The take-away from this is not to give your content away if you want to rank for it in search. Giving it away can create strong competitors and feed existing competitors.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Content Spinner Tool??? Worth? Recommendations?
Hi all !! We have different websites in the english language (UK, IE, US, etc...), but we focus our content strategy (landing pages) in just one of them. We were thinking about using a Content spin tool to use this content in the other websites and give them an extra push in terms of SEO (content is the king :)). Did you have any idea if they work fine? Did you have any experience with them? Can you recommend any? Are they tools really worth or they are not working fine and after the spin a full review (and probably some re-write) is necessary? Of course I´m talking about the paid ones.... Not even thinking about the free tools 😉 Thanks for your help in advance !!!
Content Development | | AutoEurope0 -
Why is content getting longer?
I find it odd that with the way life is today -- the gotta-have-it-now, instant gratification, can't hold someone's attention span for longer than 3 seconds -- why Google is wanting content to be REALLY long?? I've read articles saying content should be as long as 2,000 words per page. This just seems nuts to me. No one wants to read anymore. Look at how short Twitter posts are and how videos are so prevalent now. Any thoughts?
Content Development | | SEOhughesm0 -
The etiquette of reproducing someone else's content
Hello - Here is a scenario, representative of something that I just saw play out. Site A is a new blog about travel (as an example topic) Site B is an older, established blog about travel Site C is a new blog launched and owned by Site B that focuses on a particular travel niche (luxury travel, for example) Here is what happens next Site A writes an original piece of content Site C then republishes Site A's content, paraphrasing all of the text, but giving Site A credit with a link Site B (the established site) publishes a blurb about the article, directing readers with a link to "read more" on Site C. It credits Site A as the original author, but does not link to it. If you were able to follow that, here is what I would like to know. Did Site C do anything wrong by republishing a paraphrased version of Site A's content, even though it gave credit with a link? Did Site B do anything wrong by linking to Site C (which is for all intents and purposes the same website), but not linking to Site A (the original source)? My sense is that the established blog (Site B) is trying to get it's new publication (Site C) to outrank the original author (Site A) using its own content. In general though, I am curious to get some thoughts on this situation because it raises a few ethical questions that I am not sure about, namely: Is there anything wrong with publishing "spun" content, if it is done well and links back to the source? Is there anything wrong with linking to a republished version of an article on a sister website, rather than linking to the original article. Thanks
Content Development | | timsegraves1 -
Thin/spun content or No content at all?
So I'm working on a site right now that has been plagued with spun/rewritten content..we are talking in the thousands. Now would it be a smarter idea to just remove all the content or leave the content? This is an ecommerce site and I'm tasking a few people to rewrite fresh authentic content. So all of the spun/thin content will be removed eventually. The main question is to just delete it all right now or just replace as we go. I'd like to know what you guys think.
Content Development | | William.Lau0 -
Is it possible to aggregate news content?
Currently all of our content is produced internally, but I would like to expirement with aggregating 50% of our content from external publishers. The content wouldn't be full articles, but instead the excepts with a link back to the source. Is there a way to aggregate content without being penalized? We are using wordpress.
Content Development | | ejovi0 -
Filling Up Content For A New News Publishing Site
Hello, SEO Gurus. I have a client whom I've been working with for a few months now, and part of our service offering is to publish and promote fresh, daily content on his site's blog. This strategy has been a huge success thus far, he is very happy with the content, etc. Now, he is getting ready to launch a second site, which will be a news publishing site for his industry niche, and we will once again be providing the content on a daily basis: we're going to be producing 10 to 15 articles a day. It's a big operation for us. The client, however, is concerned that he doesn't want the site to appear "thin" on content in the early going, and asked if it would be possible to populate the new site with the articles we wrote on the other site's blog. My gut reaction to this is that it would be an exceedingly bad idea to do this. While we are the ones who authored the original content (and we've used author tags and publishing markup), the best bet is to simply start fresh. Besides that, seeing as we'll be pumping out tons of content on a daily basis, it won't take long to fill up the content coffers. That being said, I just wanted to run this past you all and see if anyone had any alternative ideas on how to use the old content without it being duplicate content. I was thinking that maybe designating all of the old articles with noindex, nofollow could be an option? Many thanks in advance for your time and attention. Sincerely, Mike
Content Development | | RCNOnlineMarketing0 -
If we use content copied from another site ( assuming we have not plagerized), does it hurt our Google Rankings?
We have permission from another company to copy their content and use it on our site. This happens when we are describing a manufacturer's product and we copy pages from their site and add these pages to our site in order to describe the product we are selling. Is this considered duplicate content? Can this practice hurt us?
Content Development | | huskers0 -
Why is this store getting hurt in SERPs when they removed duplicate content?
I work with an e-commerce client who got hit hard by Panda. They are very cautious, and want small-scale tests to prove each hypothesis before committing to larger changes. Recently, we reworked content on 30 product detail pages. Before, these product pages featured some original content mixed with some manufacturer content. The change we made was to remove the manufacturer content completely from the product page, leaving about 300 words of high-quality, original content--all of which was written by subject matter experts. I assumed that Google viewed this manufacturer text as duplicate content. However, when these 30 modified pages were compared to the control, they performed significantly worse. Question 1: Does any have any idea why these pages would perform worse than the control?
Content Development | | merch_zzounds
Question 2: Do you have any tips for convincing this client to try another test or get the buy-in to make the larger changes that--in theory--need to happen? FWIW, this client has about 10,000 product detail pages--the vast majority of which contain just manufacturer content. I appreciate your thoughts.0