Google has indexed a lot of test pages/junk from the development days.
-
With hind site I understand that this could have been avoided if robots.txt was configured properly.
My website is www.clearvisas.com, and is indexed with both the www subdomain and with out.
When I run site:clearvisas.com in Google I get 1,330 - All junk from the development days.
But when I run site:www.clearvisas.com in Google I get 66 - these results all post development and more in line with what I wanted to be indexed.
Will 1,330 junk pages hurt my seo?
Is it possible to de-index them and should I?
If the answer is yes to any of the questions how should I proceed?
Kind regards,
Fuad
-
Thanks Ryan.
-
It's impossible to say conclusively without examining your site and the content; however, since you refer to them as "junk" pages, it is likely they should best be removed to protect your other pages.
-
Thanks Ryan.
Are the un-wanted/irrelevant pages likely to affect my organic seo?
-
Thanks for your view David, its much appreciated. Thanks, Fuad
-
I would suggest following option 3 from David's recommendations.
Simply add the "noindex" tag to the pages you want removed from Google. The pages will then be removed the next time they are crawled.
You are correct the issue could have been avoided by blocking the site during development, which is a recommended practice; however, it is recommended to minimize entries in the robots.txt file of a live site. You can add the pages in robots.txt and Google can still index them.
The above applies if you feel the need to keep the pages around. If you no longer need those pages, removing them and providing a 410 error (GONE) would be the best approach.
-
Go to Google Webmaster Tools => Optimization => Remove URLS
In order for Google to remove the URL, you will need to do 1 of the following:
1. Block it with robots.txt, but it sounds like it's too late for that.
2. If you removed the old development content, make sure that the old content's URL produces a 404 or 410 status code.
3. Block the content with a Meta noncontent tag.
In my opinion, option 2 is the easiest since you should have a 404 page anyway.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Big ranking drop after 12/Jan/2017
Hello, moz users. Has anyone confirmed that keyword ranking dropped since last few days? I've tracked several keyword ranking by Moz and SEMrush and the ranking dropped big time on both tools today. one of the keywords dropped its ranking from 10 to out of SERP. Mozcast has shown high temperature so I assume there is or has been algo update but I would like to know someone encountered the same issue. Thanks.
Algorithm Updates | | Yuki-hero0 -
Anyone notice a 25% + drop in Google Traffic since the 23/24 August 2013?
Hi Guys, My site has seen a 25% drop in Google Traffic since Saturday the 24th August 2013. I see it being mentioned here: http://www.seroundtable.com/google-update-17268.html but not anywhere else really. I want to find out if there is anyone else that has been hit and what it is we have been hit by. Thanks
Algorithm Updates | | joblife0 -
Am I the only one experiencing this Google SERP problem?
I perform Google searches every single day, sometimes several times in a day. These searches have nothing to do with being a marketer--they're simply as a consumer, researcher, person who needs a question answered, or in other words: a typical person. For about the past month or so, I have been unsuccessful at finding what I'm looking for on the first try EVERY SINGLE TIME. Yes, I mean it--every single time. I'm left either going all the way to the third page, clicking dozens of results and retuning to the SERPs, or having to start over with a differently worded query. This is far too often to be a coincidence. Has this been happening to anymore else? I know there was a recent significant algorithm update, right? I always look at algorithm updates through the eyes of an SEO, but I'm currently looking at it through the eyes for an average searcher, and I'm frustrated! It's been like trying to find something on Bing!
Algorithm Updates | | UnderRugSwept0 -
Has anyone experienced a dramatic decrease in Google rankings followed by a dramatic increase in the past few days?
I don't want to be one of those whiny people always asking about rankings, but for the first time in a while, I've seen some crazy fluctuations in Google rankings. I was wondering if anyone had any similar experiences lately.
Algorithm Updates | | innovationsimple0 -
In the body of index page i want to be able to add text that can be picked up by crawlers but I do not want these text to be visible? How can I code this?
in the body of index page i want to be able to add text that can be picked up by crawlers but I do not want these text to be visible? How can I code this?
Algorithm Updates | | FinindDesign0 -
Stop google indexing CDN pages
Just when I thought I'd seen it all, google hits me with another nasty surprise! I have a CDN to deliver images, js and css to visitors around the world. I have no links to static HTML pages on the site, as far as I can tell, but someone else may have - perhaps a scraper site? Google has decided the static pages they were able to access through the CDN have more value than my real pages, and they seem to be slowly replacing my pages in the index with the static pages. Anyone got an idea on how to stop that? Obviously, I have no access to the static area, because it is in the CDN, so there is no way I know of that I can have a robots file there. It could be that I have to trash the CDN and change it to only allow the image directory, and maybe set up a separate CDN subdomain for content that only contains the JS and CSS? Have you seen this problem and beat it? (Of course the next thing is Roger might look at google results and start crawling them too, LOL) P.S. The reason I am not asking this question in the google forums is that others have asked this question many times and nobody at google has bothered to answer, over the past 5 months, and nobody who did try, gave an answer that was remotely useful. So I'm not really hopeful of anyone here having a solution either, but I expect this is my best bet because you guys are always willing to try.
Algorithm Updates | | loopyal0 -
How do you get photo galleries indexed on Google News?
I work for a news site and some of our photo galleries get indexed by Google News while others never do. I'm trying to determine why some are more successful than others even though they all follow the same guidelines regarding keyword-rich headlines & copy, h1s, etc. When comparing what's been indexed in the past with current galleries, there doesn't appear to be an obvious pattern. Can anyone share some insight into this?
Algorithm Updates | | BostonWright0 -
How Google Determines Sitelinks
Does anyone have authoritative information on how Google determines which links to use as sitelinks? I thought I saw that Top Landing Pages was a metric Google used (in part).
Algorithm Updates | | joshfialkoff-778630