Remove Scraped Content?
-
There is a site I work for that has content that, when you search in Google a snippet of text from, they are not the top result for. I believe what has happened is that they had written blogs and articles and added them to their site and article directories at the same time and the article directories got cached first.
If we're not coming up first for our article, that means we are not believed to be the original author, correct?
Should I remove all content from our site where this is happening, even though we actually did create these articles?
-
I explained the answer to this in the second part of my original post.
-
I would hope you had a link, when possible, back to your site. If not, then the page should be dated by creation and last update which Google can see. Although I would not leave anything up to guess work, but make sure you have links, and I would even put the date it was posted onto the post on your site like news article are. Just another indicator.
I would not remove the content if in fact, it did originate from you.
-
Yes, it was intentionally distributed. I would like to know whether the duplicate content on our site is being seen (by Google) as copied, not original, scraped, pulled from another source because we're so lazy we can't come up with any material of our own??
If this is the case, I will be removing the content, as the quality of the content sucks and there is quite a bit of it. Please, do not respond "if the content sucks, then why have it on your site..."
-
The term "scraped content" is most often used for content that has been grabbed from your website by a visiting robot.
Based upon your posting, the duplicate content that you are talking about was intentionally distributed.
-
Then how do you determine if Google is seeing content as scraped? As you know, Google has made it very clear recently how they feel about scraped content.
-
If we're not coming up first for our article, that means we are not believed to be the original author, correct?
Search engines can not identify original authors. (unless you use the rel="author" attribute and then they are merely taking your word for it) They only know which page with the content was discovered first. The content could have been on other pages first or the content could have been published first offline. Search engines don't have divine powers
The page that ranks first in the SERPs is the one that has the best combination of relevance, domain authority and other ranking factors. Has nothing to do with authorship.
Should I remove all content from our site where this is happening, even though we actually did create these articles?
I would not do that if the content is valuable for your visitors, has acquired links from other sites or if the content is pulling traffic from search.
The take-away from this is not to give your content away if you want to rank for it in search. Giving it away can create strong competitors and feed existing competitors.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Should cornerstone content have 3,500 words? Does Google discern words from the main text and from the references?
Is it true that cornerstone content should have at least 3,500 words? I've done some research and found that the recommended amount is between 2K-10k. Also, the content that we create/publish has a lot of references/citations at the end of each article. Does Google discern words from the main text and from the references? Meaning should I count references as part of the word count? Thanks for the help!
Content Development | | kvillalobos0 -
Duplicate Content for Non-SEO Purposes
Duplicate Content for Non-SEO Purposes There are a few layers to this question, but at the most basic level the question is... -Will having the same article (in the form of archived e-newsletter issues) on multiple different websites' newsletter archives HURT those sites? I'm fairly sure it won't HELP any of them in terms of SEO, but will having these back issues of their e-newsletters archived on their websites get them penalized? For the purpose of this question, these are not clients we are doing SEO for, just hosting and their e-newsletters. So it's fine if the archives provide no SEO benefit, we just don't want to leave them up if they will become LIABILITIES for the websites. -If having the same article in archived issues of e-newsletters on multiple different websites WOULD be harmful, would moving these archives to a sub-domain change anything or would it be best to simply take the archives down altogether? -Alternately, would spinning these articles make any difference in whether or not these sites get penalized? -Lastly, would spinning make the articles usable for archived e-newsletters for clients that ARE signed on for SEO services? I have a hunch about this, but I'd love to hear your expert opinions. Thanks!
Content Development | | BrianAlpert780 -
Reposting content.
I have some good articles I wrote for article directories a couple of years ago. I took them down 6 months ago. I am hoping to repost them somewhere better if the content isn't listed on google and passes Copyscape. Would this be safe?
Content Development | | T0BY1 -
Duplicate content problem
Hi, i have a serious problem. I work in joomla and sometimes it can be annoying. When you set up a category, you need to give it a name and maybe this is a huge error on my part as i did not really think about the names beforehand. The situation i have now is, all my sections are in front page mode, but because you have to name the categories in order to write articles, i am now left with a load of blog sections such as http://www.in2town.co.uk/benidorm/benidorm-news Now i have a main section called Benidorm news so i have duplicate sections, i want to know if i can redirect the http://www.in2town.co.uk/benidorm/benidorm-news to go to the main benidorm section or if there is a better way of doing it. i have left this blod layout the way it is to show you, but the others i just have it where it shows the title and then goes to the article. I work in k2 and would be grateful if anyone can let me know the solution to this as semoz is showing that i have many duplicate titles and content many thanks
Content Development | | ClaireH-1848860 -
Thumbs up or thumbs down to content rotators
Hi there - Our team is in the process of a website redesign. We're currently using a content rotator and are wondering if any folks have data to support whether this is actually a good practice despite it's popularity? Overall, I'm not impressed by the click throughs as a percentage of site traffic and most of our visitors are not repeat visitors so this may not really be necessary. Thoughts and experiences appreciated!
Content Development | | pasware0 -
Removing and resubmitting an article to another blog
I received a guest post earlier from an SEO company. I have published it, and it's been indexed in Google. Now the person wants me to remove the guest post, probably because they already have a guest post with a link pointing to the same client on my blog. I don't mind removing it, but are their any negative aspects in republishing that same article on another blog? Would it be a mistake to remove that article from my blog?
Content Development | | Briardale0 -
What is the best way to get around duplicate content when you are advertising exactly the same content on two different sites?
I am currently trying to improve exposure for an online degrees website but the content for the degree program pages is exactly the same as the company's main website. What would you suggest for getting around the duplicate content issue as a lot of the curriculum content will obviously be the same for each module, etc? Thanks
Content Development | | BeattieGroup0 -
Keeping web site fresh re content
We currently use a wp blog but we don not host on the web domain. What are the advanatges to moving the blog to the domain The only 1 I can think of is every time we update the blog this should help keep the web site fresh with new content .
Content Development | | NotThatFast0