Remove Scraped Content?
-
There is a site I work for that has content that, when you search in Google a snippet of text from, they are not the top result for. I believe what has happened is that they had written blogs and articles and added them to their site and article directories at the same time and the article directories got cached first.
If we're not coming up first for our article, that means we are not believed to be the original author, correct?
Should I remove all content from our site where this is happening, even though we actually did create these articles?
-
I explained the answer to this in the second part of my original post.
-
I would hope you had a link, when possible, back to your site. If not, then the page should be dated by creation and last update which Google can see. Although I would not leave anything up to guess work, but make sure you have links, and I would even put the date it was posted onto the post on your site like news article are. Just another indicator.
I would not remove the content if in fact, it did originate from you.
-
Yes, it was intentionally distributed. I would like to know whether the duplicate content on our site is being seen (by Google) as copied, not original, scraped, pulled from another source because we're so lazy we can't come up with any material of our own??
If this is the case, I will be removing the content, as the quality of the content sucks and there is quite a bit of it. Please, do not respond "if the content sucks, then why have it on your site..."
-
The term "scraped content" is most often used for content that has been grabbed from your website by a visiting robot.
Based upon your posting, the duplicate content that you are talking about was intentionally distributed.
-
Then how do you determine if Google is seeing content as scraped? As you know, Google has made it very clear recently how they feel about scraped content.
-
If we're not coming up first for our article, that means we are not believed to be the original author, correct?
Search engines can not identify original authors. (unless you use the rel="author" attribute and then they are merely taking your word for it) They only know which page with the content was discovered first. The content could have been on other pages first or the content could have been published first offline. Search engines don't have divine powers
The page that ranks first in the SERPs is the one that has the best combination of relevance, domain authority and other ranking factors. Has nothing to do with authorship.
Should I remove all content from our site where this is happening, even though we actually did create these articles?
I would not do that if the content is valuable for your visitors, has acquired links from other sites or if the content is pulling traffic from search.
The take-away from this is not to give your content away if you want to rank for it in search. Giving it away can create strong competitors and feed existing competitors.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
What would be Best Content Starategy For My Dental Website?
Greetings to all Moz Fellows! Our organization specializes in digital dentistry, and we aim to establish a thriving blog focused on this subject to attract a wide range of clients from all across the United States. We would greatly appreciate any advice on the most effective strategies for achieving this goal.
Content Development | | SEOBrandBoca
@MOZ-LUCY1230 -
The blog section of my website just got deleted, Would it get my website penalized if I posted the same content again?
The blog section of my website just got deleted, Would it get my website penalized if I posted the same content again?
Content Development | | DustChasersToronto0 -
Content Curation & Newsjacking
Greetings everyone! Its my first post, and I had a question in regards to content curation & newsjacking. I tried to dig around the forum a bit prior to posting this, but havent really found much that covered this topic specifically, but I was wondering if there is anyone here who currently does this, and if so, are there any tips that you'd be willing to share with someone who's just getting into it? Thanks
Content Development | | bashseo0 -
Duplicate Content From Huffington Post Blog
A client who writes blog posts for Huffington Post also wants an identical version of the blog posted to his personal site. Do you think there could be a problem of being punished for duplicate content? Would a better SEO practice be to have the client do an on-site blog just linking to the Huffington Post blog and providing information about it?
Content Development | | EmarketedTeam0 -
How to make new content Indexed faster by google
I would like to know what can I do. Normally it takes google around 3 days to index my content. I got a site map, swiched the crawling rate to the fastest in my webmaster tools. I also tried crawling my homepage as google bot and sending it to the index with all linked pages but even if I do so my content takes around 3 days if not more to get indexed. I publish around 20 posts a week. My SEOmoz page authority is 48. Some sites of my competition seem to be getting their content indexed in the same day. What else can be done?
Content Development | | sebastiankoch0 -
Blog and Website = 2 different URL's - Is it WORTH to merge content on to one site
Good day Mozzers! A friend of mine recently asked for my help in regards to online marketing. While getting familiar with his online presence, I realized that he has a blog hosted under a completely different URL Main Site = http://pardons.org/ (page rank 4)
Content Development | | vip4service
Blog = http://pardons.wordpress.com/ (page rank 3) What I am battling with is whether or not he should take all of the blog content he has, and merge it on to his main site. It has over 280+ blog posts spanning over a few years, so there is A LOT of content that could benefit his main site. However is it worth it, or should he continue to run everything as 2 different sites? Also, of you suggest moving the content over, what would be the best way to do it in your opinion? He currently has links on his blog TO his main site, so there is a little bit of link juice there, but with a average of 300 views a day, he only get about 10 clicks to his main site from the blog. Thanks a ton for your help!0 -
Building Content on E-Commerce Store
Hey guys, In 2011 it seems more and more important to build great content on your website to help SERP rankings. With an E-Commerce store what is the best way to add content? Would using the blog and adding related blog articles related to the product work and internally linking the anchor text to the specific product page? Obviously it would be more beneficial to rank the specific product page so wouldn't this method take away from those efforts? Or do we bank on being able to channel the visitor from the blog to the product page? Thanks Jason
Content Development | | mediapoint0 -
Duplicate content via syndication?
I have a full text RSS feed of my blog available for users with RSS readers. A few sites have said they would like to republish the unedited feed on their site (so my blog postings show up on their sites with links back to my site embedded). I'm wondering if this is a good/bad idea (to let them republish my postings) and/or if I should do anything in the feed to protect myself from an SEO point of view? Am I at risk of some kind of duplicate content penalty from Google, or will Google figure out that I'm the original source (which would be good) since the blog postings have links back to my site? Thanks!
Content Development | | scanlin0