Remove internal site SERPS from Google Index?
-
1. Internal Serp pages did not have a robots meta tag
2. As a result, client site has thousands (~4,400) of internal site SERP pages in the Google index.
3. We added the NoIndex, Follow attribute to all internal SERPS
4. We Disallowed: domain.com/internal-search-operator in Robots.txt
5. No new SERP pages are being indexed, but the other 4000 something that were already there are still in the index weeks later.
6. The pages are dynamically created and still work, so I can't use the Remove Content tool from google, because the pages don't 404.
Is there any way to get these pages out of the index besides just waiting and hoping google eventuall drops them?
Thanks
-
You can still submit a url removal request from GWT, because it checks for 1 of 3 things:
- 404 header response code
- NOINDEX meta tag
- Robots.txt disallow rule
So even if its not 404 Google will still do the removal.
-
You can create a formal request to Google using Webmaster Tools and tell them the URLs or list of URLs that you'd like removed from the index. Whether or not they actually remove them is a completely different story.
-
I should have explained it what I meant by SERPS better.
These pages are generated by doing a text search on the site. (Magento) So yes, they are product listings, but obviously most queries are different, so the dynamically created pages are all unique but useless.
Thanks for the idea about rel=canonical them back to the search page - I will look into that.
-
Note: By SERPs I'm assuming you're referring to Search Results within the site (e.g. a product listing) and not actual Google SERPs.
If so, it sounds like it could be a case for canonical. If the pages are all site.com/search.htm?searchterm=xxxx&page=y&rows=100 kind of thing you could canonical them all back down to search.htm.
If you're not familiar with canonical here's a YouMoz post that explains it pretty well:
http://www.seomoz.org/blog/complete-guide-to-rel-canonical-how-to-and-why-not
Based on my experience in the past the canonicalized pages will eventually 'disappear' from the index (not really, but Google doesn't display them anymore) in time. They would also eventually fall out already with what you've done in regards to noindex no follow etc., but I've found it takes longer.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Site Wide Links
Howdy Moz! So our agency has been around for long enough to have a few sites we've built that have our credit in their footer resulting in a site wide link. Mostly just our name. We've heard that Google does not particularly like site wide links, should we go through and remove some of these old links?
On-Page Optimization | | wearehappymedia0 -
Google is indexing urls with parameters despite canonical
Hello Moz, Google is indexing lots of urls despite the canonical in my site. Those urls are linked all over the site with parameters like ?, and looks like Google is indexing them despite de canonical. Is Google deciding to index those urls because they are linked all over the site? The canonical tag is well implemented.
On-Page Optimization | | Red_educativa0 -
Removing entire site sections categories from an ecommerce store, best practices ?
Hi Whats best practice when removing entire site sections from an e-commerce store due to ending those lines and not having similar alternative pages to redirect them too ? Should you remove pages in GWT etc ? Any other measures to prevent probs such as 404 spikes etc ?? Cheers Dan
On-Page Optimization | | Dan-Lawrence0 -
Google Xml Sitemaps
Which plugin is good to use to create and submit my sitemap: sitemap from yoast or google xml sitemap plugin?
On-Page Optimization | | Sebastyan22
Which one is better? I already saw this video but I get an error when I submited it to webmaster tools and I don't know why:http://www.quicksprout.com/university/how-to-set-up-and-optimize-a-sitemap/_''Your Sitemap appears to be an HTML page. Please use a supported sitemap format instead.''_Thank you !0 -
Duplicate content on partner site
I have a trade partner who will be using some of our content on their site. What's the best way to prevent any duplicate content issues? Their plan is to attribute the content to us using rel=author tagging. Would this be sufficient or should I request that they do something else too? Thanks
On-Page Optimization | | ShearingsGroup0 -
How Google differentiates web sites like directories?
Hi, I want to ask how google differentiates web sites like directories or company listing websites? How it understands that is a normal thing to have many links in a directory site? Are there some guides links about what to do and avoid and how to make SEO optimization for a directory web site.
On-Page Optimization | | vladokan0 -
Removing OLD pages
Dear all, I was removing tons of old pages from my directory (about 400 pages), I was setingup a 404 custom page, all is fine, so when I go to an existing page I get a 404 and redirected to my 404 page. The problem is Google Webmaster tools list all these pages as 404, and never clean my list (1 year til now), so I assume something is wrong. Question what is the best way or natural to remove old pages from one directory? Note: previously I tryed add on these pages the NOINDEX/NOFOLLOW meta tag and I got from google Soft-404. Thank you
On-Page Optimization | | SharewarePros0 -
0 urls indexed in GWT, many found with site: command
Hi, This is happening with a brand new site. We have created sitemaps and submitted them to Google Webmaster Tools. GWT says sitemaps are ok, "x" number of urls submitted, but no urls indexed. When I check in Google with site:domain.com I see that many of the urls are already indexed. Why this discrepancy between GWT and reality? Thanks for your time!
On-Page Optimization | | gerardoH0