Remove Scraped Content?
-
There is a site I work for that has content that, when you search in Google a snippet of text from, they are not the top result for. I believe what has happened is that they had written blogs and articles and added them to their site and article directories at the same time and the article directories got cached first.
If we're not coming up first for our article, that means we are not believed to be the original author, correct?
Should I remove all content from our site where this is happening, even though we actually did create these articles?
-
I explained the answer to this in the second part of my original post.
-
I would hope you had a link, when possible, back to your site. If not, then the page should be dated by creation and last update which Google can see. Although I would not leave anything up to guess work, but make sure you have links, and I would even put the date it was posted onto the post on your site like news article are. Just another indicator.
I would not remove the content if in fact, it did originate from you.
-
Yes, it was intentionally distributed. I would like to know whether the duplicate content on our site is being seen (by Google) as copied, not original, scraped, pulled from another source because we're so lazy we can't come up with any material of our own??
If this is the case, I will be removing the content, as the quality of the content sucks and there is quite a bit of it. Please, do not respond "if the content sucks, then why have it on your site..."
-
The term "scraped content" is most often used for content that has been grabbed from your website by a visiting robot.
Based upon your posting, the duplicate content that you are talking about was intentionally distributed.
-
Then how do you determine if Google is seeing content as scraped? As you know, Google has made it very clear recently how they feel about scraped content.
-
If we're not coming up first for our article, that means we are not believed to be the original author, correct?
Search engines can not identify original authors. (unless you use the rel="author" attribute and then they are merely taking your word for it) They only know which page with the content was discovered first. The content could have been on other pages first or the content could have been published first offline. Search engines don't have divine powers
The page that ranks first in the SERPs is the one that has the best combination of relevance, domain authority and other ranking factors. Has nothing to do with authorship.
Should I remove all content from our site where this is happening, even though we actually did create these articles?
I would not do that if the content is valuable for your visitors, has acquired links from other sites or if the content is pulling traffic from search.
The take-away from this is not to give your content away if you want to rank for it in search. Giving it away can create strong competitors and feed existing competitors.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Cornerstone Content?
Yoast keeps pestering me about Cornerstone Content. Is it really a ranking factor? Ryan
Content Development | | drdougweiss0 -
Safest Way to remove a blog?
I have a Magento site that is around 4 years old. It has 2 different wordpress blogs on the same domain. domain/blog domain/nicheblog I would like to completely remove the 2 blogs as the information on them is of low quality and its outdated information. What is the safest way for me to remove this content with out having negative effects on my rankings? thanks
Content Development | | Shop-Sq0 -
Any recommended affordable content writer or services?
Any recommended affordable high quality content writer for Ayurveda health products website?
Content Development | | JordanBrown0 -
Where to add content
Hello, In looking at GA for a client, his top 100 landing pages are all category pages with only a slight amount of articles and product pages. We haven't added content to the product pages, we just rewrote descriptions for unique content. They are about 100-200 words per product. Does that mean we should focus on adding content to category pages first? We're thinking of totaling 500 words or so (though less sometimes) of quality content to category pages. Your recommendations?
Content Development | | BobGW0 -
Duplicate Legal Content
Oftentimes lawyer websites will publish laws (codes, statutes, regulations, case law, etc). They add no value to the text, it's just copy pasted. Therefore, the same text/content may be on potentially hundreds of websites. Does google interpret this as duplicate content, or does it recognize government content as special? I want to have the laws on my website as well, however I am debating whether to add no follow tags or not. Or I'm thinking about adding value to the content by breaking down the specific law. However, even then at least 50% of the content on the page will still be the law, and I'm not sure if that is enough to be considered duplicate content.
Content Development | | irnikij0 -
Will our two retail sites get hit with duplicate content?
Our retail site just rolled out a second online store. The URL is new and it is showing some of the same products from the same vendors (probably about 40% of the fist store is in the second store). Down the road, we will remove the products from the first site, however, we are keeping it for now. The products show up on both sites, with the same images, and the same descriptions and almost the same URL query string. Are we going to get hit with any penalties due to duplicate content?
Content Development | | klmarketing0 -
How to organize content for ecommerce site
Hello, We've decided to create 24 articles of content for our ecommerce site, everything from an FAQ to history of the products to 10 articles on the top 10 products. Really useful to the user. How do you suggest that we make our content visible to the users? We could put a nice button on our right banner that says "Extensive Help Session" or we could put a banner on our home page or we could make it a tab at the top of the screen. We could additionally make a well organized footer with links to the articles. Or we could do all of those but that might be overkill. What do you suggest?
Content Development | | BobGW0 -
Is it considered as duplicate content ?
Hello, I see a lot of errors on my webmaster tools because of this ajax code on my questions pages of the site (screen) : www.dismoicomment.fr The code : | / ADD ANSWER FORM |
Content Development | | elitepronostic
| | $("#answer-add-button").click(function () { |
| | $.ajax({ |
| | type: 'POST', |
| | url: '/answers/quelle-assurance-choisir-pour-un-scooter/', |
| | data: $("form#answer-add").serialize(), |
| | dataType: 'html', |
| | success: function(data) { |
| | |
| | if(data=="answer") { |
| | $('.answer-add-message').show().empty(); |
| | $(document).ready(function() { |
| | $(' Vous avez déjà répondu à cette question. ').appendTo('.answer-add-message'); |
| | }); | I have add a line on my robots.txt : http://www.dismoicomment.fr/robots.txt for remove all urls with /answers/. These urls with /answers/ aren't indexed in google. Do you think that it is dangerous and that can be considered as duplicate content ? 1129546035.png0