Spam posts indexed, what to do now?
-
Hi,
So we had a staff problem last week and we let some spam posts (cheap nike jerseys etc.) that also got indexed by Google. (We just checked and there are lik 105 already indexed)
Of course we have now removed all these spam posts but what is the best practice at this point? Are we supposed to do something else to remove these from Google's index? (maybe through google webmaster tools?) We have already edited robots.txt to disallow those pages as a quick remedy.
And finally, could this have done any harm? We were quite slow noticing these posts to remove them. They were there for about 12 days.
thanks
-
Good to know
-
Hi,
Thanks for the comprehensive answer. We don't have any vulnerabilities. It was all my fault as I completely forgot that I had given administrative access to one of our former content managers who had temporarily allowed anonymous users to post on this certain section of the site. And once he left, we forgot to update that permission and never really noticed those posts, until today.
-
haha I just say you said "all those links had auto-nofollow on them"
NO PROBLEM MAN! rest easy! You cannot get penalized for nofollow links!
-
Thanks for the quick response. We're just requesting URL removal for all those URL's. I hope this makes it all good. No sign of ranking drop at the moment. We're lucky those pages were automatically filtered out by our sitemap.xml and all those links had auto-nofollow on them. Time to consider buying a service like Mollom I guess.
-
Do you know how the spam posts were published on your site? Just make sure the vulnerability is fixed so it doesn't happen again. Once the spam posts you found have been deleted from your site, you shouldn't have to do anything more since they will fall out of Google's index. Keep an eye on Google Webmaster Tools though to see if you notice any more spam pages pop up on Google's radar and then manually remove them.
Here is Google's official answer - http://support.google.com/webmasters/bin/answer.py?hl=en&answer=164734
When a page is updated or removed, it will automatically fall out of our search results. You don’t need to do anything to make this happen.
However, if you urgently need to remove content from Google's search results (for example, if you’ve already removed, updated, or blocked a page accidentally displaying confidential information like credit card numbers), you can request expedited removal of those URLs.
Our removal tools are intended for pages that urgently need to be removed—for example, if they contain confidential data that was accidentally exposed. Using the tools for other purposes may cause problems for your site.
Another Google resource if your site was actually hacked or compromised - http://support.google.com/webmasters/bin/answer.py?hl=en&answer=1269119
To take your site "offline" after being hacked. If your site was hacked and you want to get rid of bad URLs that got indexed, use the URL removal tool to remove any new URLs that the hacker created—for example, http://www.example.com/buy-cheap-cialis-skq3w598.html. But we don't recommend removing your entire site, or removing URLs that you'll eventually want indexed. Instead, clean up the hacking and let us recrawl your site.
-
So someone was posting articles on your site that linked to other sites like paid links?
If you removed the posts no need to block them in robots.txt because they no longer exist so will not get crawled anymore. Yes definitely request removal in WMT URL removal tool and get those pages out of Google's index ASAP.
You're probably OK. Just keep your fingers crossed and an eye on rankings and run a tight ship so that doesn't happen again, definitely something you can get penalized for. Good thing you caught it quickly.
EDIT: if you meant that you let spam comments get posted live/approved by the admin then all you can do is remove the spammy posts and make sure your comment settings are set to need admin approval before getting posed live. No need to block in robots.txt or remove URLs in that case but it doesn't hurt. If the links are off of your site you should be fine.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Not all images indexed in Google
Hi all, Recently, got an unusual issue with images in Google index. We have more than 1,500 images in our sitemap, but according to Search Console only 273 of those are indexed. If I check Google image search directly, I find more images in index, but still not all of them. For example this post has 28 images and only 17 are indexed in Google image. This is happening to other posts as well. Checked all possible reasons (missing alt, image as background, file size, fetch and render in Search Console), but none of these are relevant in our case. So, everything looks fine, but not all images are in index. Any ideas on this issue? Your feedback is much appreciated, thanks
Technical SEO | | flo_seo1 -
Historic issue with incomplete indexing
Hi there We run quite a big site in the UK in the commercial real-estate space. Historically we have always had a challenge getting our "primary" landing pages indexed, which are location based property result pages. e.g. https://realla.co/to-rent/commercial-property/oxford For example, for the "towns" category we have 8,549 submitted in our xml sitemap, with only 3,171 indexed. This is a general issue across all our sitemaps. 120k submitted, 80k indexed. Our pages are linked through breadcrumbs, and nearby links. In the new search console these pages are reported as "crawled - currently not indexed" These all sit under the folder: site:https://realla.co/to-rent/commercial-property/* site:https://realla.co/to-rent/office/* We have done extensive work to optimise performance, including AMP pages. Each location page has many details pages for individual properties e.g. https://realla.co/to-rent/details/0ffbbd0a1a1147edb8847c5ce6179509 One action we have remaining is to nest the details under the locations pages, which may help. These details pages are indexed fully. Any feedback much appreciated
Technical SEO | | ianparryuk0 -
Any idea why pages are not being indexed?
Hi Everyone, One section on our website is not being indexed. The product pages are, but not some of the subcategories. These are very old pages, so thought it was strange. Here is an example one one: https://www.moregems.com/loose-cut-gemstones/prasiolite-loose-gemstones.html If you take a chunk of text, it is not found in Google. No issues in Bing/Yahoo, only Google. You think it takes a submission to Search Console? Jeff
Technical SEO | | vetofunk1 -
IP Redirect causing Indexing Issue
Hi, I am trying to redirect any IP from outside India that comes to Store site (https://store.nirogam.com/) to Global Store site (https://global.nirogam.com/) using this methodThis is causing various indexing issues for Store site as Googlebot from US also gets redirected!- Very few pages for "store.nirogam.com/products/" are being indexed. Even after submission of sitemap it indexed ~50 pages and then went back to 1 page etc. Only ~20 pages indexed for now.- After this I tried manually indexing via "Crawl -> Fetch as Google" - but then it showed me a redirect to global.nirogam.com. All have their "status -> Redirected" - This is why bots are not able to index the site.What are possible solutions for this? How can we tell bots to index these pages and not get redirected?Will a popup method where we ask user if they are outside India help in solving this issue?All approaches/suggestions will be highly appreciated.
Technical SEO | | pks3330 -
Pages Not Getting Indexed
Hey there I have a website with pretty much 3-4 pages. All of them had a canonical pointing to one page and the same content ( which happened by mistake ) I removed the canonical URL and added one pointing to its page. Also, I added the original content that was supposed to be there to begin with. It's been weeks but those pages are not getting indexed on the SERPS while the one that they use to point with the canonical does.
Technical SEO | | AngelosS0 -
Rel=canonical + no index
We have been doing an a/b test of our hp and although we placed a rel=canonical tag on the testing page it is still being indexed. In fact at one point google even had it showing as a sitelink . We have this problem through out our website. My question is: What is the best practice for duplicate pages? 1. put only a rel= canonical pointing to the "wanted original page" 2. put a rel= canonical (pointing to the wanted original page) and a no index on the duplicate version Has anyone seen any detrimental effect doing # 2? Thanks
Technical SEO | | Morris770 -
Blog post summary pages
I'm wondering post-panda if its wise to block access to blog post summary pages like this one: http://www.howtotradestocks.org/blog/page/15/ Any thoughts?
Technical SEO | | PeterM220 -
Importance of an optimized home page (index)
I'm helping a client redesign their website and they want to have a home page that's primarily graphics and/or flash (or jquery). If they are able to optimize all of their key sub-pages, what is the harm in terms of SEO?
Technical SEO | | EricVallee340