Spam posts indexed, what to do now?
-
Hi,
So we had a staff problem last week and we let some spam posts (cheap nike jerseys etc.) that also got indexed by Google. (We just checked and there are lik 105 already indexed)
Of course we have now removed all these spam posts but what is the best practice at this point? Are we supposed to do something else to remove these from Google's index? (maybe through google webmaster tools?) We have already edited robots.txt to disallow those pages as a quick remedy.
And finally, could this have done any harm? We were quite slow noticing these posts to remove them. They were there for about 12 days.
thanks
-
Good to know
-
Hi,
Thanks for the comprehensive answer. We don't have any vulnerabilities. It was all my fault as I completely forgot that I had given administrative access to one of our former content managers who had temporarily allowed anonymous users to post on this certain section of the site. And once he left, we forgot to update that permission and never really noticed those posts, until today.
-
haha I just say you said "all those links had auto-nofollow on them"
NO PROBLEM MAN! rest easy! You cannot get penalized for nofollow links!
-
Thanks for the quick response. We're just requesting URL removal for all those URL's. I hope this makes it all good. No sign of ranking drop at the moment. We're lucky those pages were automatically filtered out by our sitemap.xml and all those links had auto-nofollow on them. Time to consider buying a service like Mollom I guess.
-
Do you know how the spam posts were published on your site? Just make sure the vulnerability is fixed so it doesn't happen again. Once the spam posts you found have been deleted from your site, you shouldn't have to do anything more since they will fall out of Google's index. Keep an eye on Google Webmaster Tools though to see if you notice any more spam pages pop up on Google's radar and then manually remove them.
Here is Google's official answer - http://support.google.com/webmasters/bin/answer.py?hl=en&answer=164734
When a page is updated or removed, it will automatically fall out of our search results. You don’t need to do anything to make this happen.
However, if you urgently need to remove content from Google's search results (for example, if you’ve already removed, updated, or blocked a page accidentally displaying confidential information like credit card numbers), you can request expedited removal of those URLs.
Our removal tools are intended for pages that urgently need to be removed—for example, if they contain confidential data that was accidentally exposed. Using the tools for other purposes may cause problems for your site.
Another Google resource if your site was actually hacked or compromised - http://support.google.com/webmasters/bin/answer.py?hl=en&answer=1269119
To take your site "offline" after being hacked. If your site was hacked and you want to get rid of bad URLs that got indexed, use the URL removal tool to remove any new URLs that the hacker created—for example, http://www.example.com/buy-cheap-cialis-skq3w598.html. But we don't recommend removing your entire site, or removing URLs that you'll eventually want indexed. Instead, clean up the hacking and let us recrawl your site.
-
So someone was posting articles on your site that linked to other sites like paid links?
If you removed the posts no need to block them in robots.txt because they no longer exist so will not get crawled anymore. Yes definitely request removal in WMT URL removal tool and get those pages out of Google's index ASAP.
You're probably OK. Just keep your fingers crossed and an eye on rankings and run a tight ship so that doesn't happen again, definitely something you can get penalized for. Good thing you caught it quickly.
EDIT: if you meant that you let spam comments get posted live/approved by the admin then all you can do is remove the spammy posts and make sure your comment settings are set to need admin approval before getting posed live. No need to block in robots.txt or remove URLs in that case but it doesn't hurt. If the links are off of your site you should be fine.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
My WP website got attack by malware & now my website site:www.example.ca shows about 43000 indexed page in google.
Hi All My wordpress website got attack by malware last week. It affected my index page in google badly. my typical site:example.ca shows about 130 indexed pages on google. Now it shows about 43000 indexed pages. I had my server company tech support scan my site and clean the malware yesterday. But it still shows the same number of indexed page on google. Does anybody had ever experience such situation and how did you fixed it. Looking for help. Thanks FILE HIT LIST:
Technical SEO | | Chophel
{YARA}Spam_PHP_WPVCD_ContentInjection : /home/example/public_html/wp-includes/wp-tmp.php
{YARA}Backdoor_PHP_WPVCD_Deployer : /home/example/public_html/wp-includes/wp-vcd.php
{YARA}Backdoor_PHP_WPVCD_Deployer : /home/example/public_html/wp-content/themes/oceanwp.zip
{YARA}webshell_webshell_cnseay02_1 : /home/example2/public_html/content.php
{YARA}eval_post : /home/example2/public_html/wp-includes/63292236.php
{YARA}webshell_webshell_cnseay02_1 : /home/example3/public_html/content.php
{YARA}eval_post : /home/example4/public_html/wp-admin/28855846.php
{HEX}php.generic.malware.442 : /home/example5/public_html/wp-22.php
{HEX}php.generic.cav7.421 : /home/example5/public_html/SEUN.php
{HEX}php.generic.malware.442 : /home/example5/public_html/Webhook.php0 -
Home page not indexed by any search engines
We are currently having an issue with our homepage not being indexed by any search engines. We recently transferred our domain to Godaddy and there was an issue with the DNS. When we typed our url into Google like this "https://www.mysite.com" nothing from the site came up in the search results, only our social media profiles. When we typed our url into Google like this "mysite.com" we were sent to a GoDaddy parked page. We've been able to fix the issue over at Godaddy and the url "mysite.com" is not being redirected to "https://mysite.com" but, Google and the other search engines have yet to respond. I would say our fix has been in place for at least 72 hours. Do I need to give this more time? I would think that at lease one search engine would have picked up on the change by now and would start indexing the site properly.
Technical SEO | | bcglf1 -
Quality link not indexed after two months
Hi! Bit of an odd one, but I thought I's ask. Recently I wrote an article for Smashing Mag. It's was a great success and not really an SEO exercise at all, but after several weeks my author page hasn't been indexed (http://www.smashingmagazine.com/author/sam-wright/?rel=author). I just assumed give the quality of the site that it wouldn't take that long. I know it's just a case of leaving it, by any thoughts on why its not been picked up?
Technical SEO | | Blink-SEO0 -
Noindex Pages indexed
I'm having problem that gogole is index my search results pages even though i have added the "noindex" metatag. Is the best thing to block the robot from crawling that file using robots.txt?
Technical SEO | | Tedred0 -
Removing some of the indexed pages from my website
I am planning to remove some of the webpages from my website and these webpages are already indexed with search engine. Is there any way by which I need to inform search engine that these pages are no more available.
Technical SEO | | ArtiKalra0 -
Development site accidentally got indexed and now appears in SERPs. How to fix?
I work at a design firm, and we just redesigned a website for a client. When it came time for the coding, we initially built a development site to work out all the kinks before going live. Then we relaunched the actual site about a week ago. Here's the problem: Somehow, the developer who coded the site for us (a freelancer) allowed the development site to be indexed by Google. Now, when you enter the client's name into Google, the development site appears higher in the results pages than the real site! In fact, the real site isn't even in the top 50 search results. The client is understandably angry about this for multiple reasons. We quickly added a robots.txt file to the development site and a 301 redirect to the real site. However, that did seemed to have no effect on the problem. Any ideas on how to fix this mess? Thank you in advance!
Technical SEO | | matt-145670 -
Search Result Page, Index or Not?
I believe Google doesn't want to index and show other search result pages in there SERP.
Technical SEO | | DigitalJungle
So instead of adding "noindex, follow" tag i have changed the url in my search result page like this: Original
http://www.mysite.com/kb-search.aspx?=travelguide&type=wiki&s=3 To
http://www.mysite.com/travelguide/attraction-guide.html And the search result page contains the title of the articles, a short descriptions (300 chars.) and a link to the articles. Does it help? Or should i add noindex, follow tag? Helps Please?0 -
Google crawl index issue with our website...
Hey there. We've run into a mystifying issue with Google's crawl index of one of our sites. When we do a "site:www.burlingtonmortgage.biz" search in Google, we're seeing lots of 404 Errors on pages that don't exist on our site or seemingly on the remote server. In the search results, Google is showing nonsensical folders off the root domain and then the actual page is within that non-existent folder. An example: Google shows this in its index of the site (as a 404 Error page): www.burlingtonmortgage.biz/MQnjO/idaho-mortgage-rates.asp The actual page on the site is: www.burlingtonmortgage.biz/idaho-mortgage-rates.asp Google is showing the folder MQnjO that doesn't exist anywhere on the remote. Other pages they are showing have different folder names that are just as wacky. We called our hosting company who said the problem isn't coming from them... Has anyone had something like this happen to them? Thanks so much for your insight!
Technical SEO | | ILM_Marketing
Megan0