How long does Google take to reduce the index size?
-
A few months ago, we have incorporated our custom search in our website www.ergodotisi.com . We hadn't been paying a lot of attention to our webmaster analytics, to find out a few months later than the Google Index had grown from 2K- 3K pages to one million because it was crawling all combinations of search filters. We have now followed the right instructions to add noindex meta tags and blocked most search result pages from the robot.txt. We allow indexing of some main categories by setting new seo-friendly url structures. A few weeks have passed and the index size has only reduced to 700K. How long does it take before it removes most of the duplicated search result pages from the index? Is it still crawling those pages but has not fully decided to remove most of them? How bad is this for SEO?
-
How long does it take before it removes most of the duplicated search result pages from the index?
Every site is different but I have seen it take 6 - 9 months for pages to drop out.
Is it still crawling those pages but has not fully decided to remove most of them?
It's possible. As Gaston has already pointed out, search engines will need to access those files again to see you want them noindexed.
How bad is this for SEO?
It temporarily dilutes the amount of SEO equity available to flow to pages you DO want indexed.
-
Hello there,
Did you left some time, without blocking those pages, to google bot to recrawl them?
If you implemented at the same time the noindex tag and the disallow in the robots.txt you are not letting google know that those pages should be deindexed.
Remember that blocking pages in the robots.txt avoid to be scanned again and the new robots tag is not seeng by google bot.My advise is to let google bot recrawl all those pages and wait a few days, may be 2-3 weeks. Slowly the amount of indexed pages will decrease.
Hope i've helped.
Best luck.
GR.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Fetch as Google is showing this, help!
Our Fetch as Google in Google Webmaster Tools is showing this. What is this?? Thanks! https://imgur.com/k6KOQZz
On-Page Optimization | | bluejay78780 -
Why am not ranking on google with 30 of domain authority
Hello my name is Alexander muller and around 2 years ago i have build this website https://muller-designs.com/ i love to use MOZ to analyze other sites, so i know that my page is better than some others but they still aut rank my website can someone give me an explanation for this ? and i also have a couple of other questions it would be great if someone can help me Is on-site optimisation really that important??? How often should I update my blog???
On-Page Optimization | | alexmuller871 -
Meta Robots index & noindex Both Implemented on Website
I don't want few of the pages of website to get indexed by Google, thus I have implemented meta robots noindex code on those specific pages. Due to some complications I am not able to remove meta robots index from header of every page Now, on specific pages I have both codes 'index & noindex' implemented. Question is: Will Google crawl/index pages which have noindex code along with index code? Thanks!
On-Page Optimization | | Exa0 -
Long Meta Titles on Dynamic Pages
What to do with long meta titles on press release pages. Unlike other pages on the site, press release pages have no physical value and are dynamically created picking data from the database. Such pages i notice are automatically picking the URL/H1 as meta title and meta description. How to shorten such meta titles and descriptions? Do such errors (related to dynamically created pages) matter? Tanveer
On-Page Optimization | | Sequelmed0 -
Is that a problem for indexing?
Hi all, I have an issue driving me crazy thant I think it could be impacting in the SERPs. My site has a spanish version "www.tarifakitesurfcamp" and an english version "www.tarifakitesurfcamp.com/en". These two "pages" in the CMS ("inicio" and "home") have the same title and the same description tag (in spanish) as the plugin ALL in one SEO only allows me to write one unique title and description for the home page via GENERAL OPTIONS (there's an option for "home title" and for "home description").. If a try to assign a title and a description individually for each page it doesn't work (I can't see the titles and description in the source code" of those pages. On the other hand, there's another page that is http://www.tarifakitesurfcamp.com/?attachment_id= which I can't locate in the CMS within pages section. This page has the same tittle and description as well. Could anyone give me a solution? Thanks.
On-Page Optimization | | juanmiguelcr0 -
What is everyone doing to reduce the number of links on a page?
Some clients of mine have sites that are throwing the "too many links on one page" error and we're not just talking a little more than the status quo 100 links, it's much more. I believe it could be due to the fly-out navigation. My Solution: shorten the Tier 2 categories in the left nav down to 5 and add a "View All" link after the 5th and remove top nav fly-outs. I'm not sure if these are best practices or the best for usability though?
On-Page Optimization | | LisaS130 -
ON SITE SEARCH INDEXED BY GOOGLE - no follow or no index
Google indexes alll our internetal searches: search box is brand - clothes types - size type - and for each page it creates a page that which creates duplicate page title and unnecessary content. Should I do a nofollow on the advance search or a no index. Many thanks for the info. Sonja
On-Page Optimization | | reallyitsme0 -
Confirmation regarding canonical and syndication google tags
Hi, We are in the process of improving our CMS upstream to resolve our duplicate content issues. We were hit pretty hard by the Panda update. One of the steps we have taken is implementation of the canonical link tag across all domains in our site. You see, we are a news release service with muliple channels and websites to represent each. The problem is that a client will submit a release and in many cases the news item is relevant to multiple channels I.E. multiple websites under the same IP range. Site Examples:
On-Page Optimization | | jarrett.mackay
www.hotelnewsresource.com www.restaurantnewsresource.com
www.travelindustrywire.com From a user perspective, it makes sense that they should be able to access the article from the site they are browsing without being redirected to the site we feel carries the most relevance. We hope the canconical tag will resolve this issue for us. I have also read about the syndication tag and was looking for feedback or recommendations if we should implement that also, but it may be overkill as the two tags objectives seem to be similar. I guess my first question is if the syndication tag is only used by Google News. Secondly, and a little off topic is that we also offer an API and like many other sites, I have read, our content partners are now doing better in primary and long tail rankings even thought we are the original source. My assumption is that we should modify the API to force using both caconical and syndication tags as well. Lastly, I´m curious if anyone has tested the original source tag and if we should implement that as well. Thanks everyone. Jarrett0