404 page not found after site migration
-
Hi,
A question from our developer.
We have an issue in Google Webmaster Tools.
A few months ago we killed off one of our e-commerce sites and set up another to replace it. The new site uses different software on a different domain. I set up a mass 301 redirect that would redirect any URLs to the new domain, so domain-one.com/product would redirect to domain-two.com/product. As it turns out, the new site doesn’t use the same URLs for products as the old one did, so I deleted the mass 301 redirect.
We’re getting a lot of URLs showing up as 404 not found in Webmaster tools. These URLs used to exist on the old site and be linked to from the old sitemap. Even URLs that are showing up as 404 recently say that they are linked to in the old sitemap. The old sitemap no longer exists and has been returning a 404 error for some time now. Normally I would set up 301 redirects for each one and mark them as fixed, but there are almost quarter of a million URLs that are returning 404 errors, and rising.
I’m sure there are some genuine problems that need sorting out in that list, but I just can’t see them under the mass of errors for pages that have been redirected from the old site. Because of this, I’m reluctant to set up a robots file that disallows all of the 404 URLs.
The old site is no longer in the index. Searching google for site:domain-one.com returns no results.
Ideally, I’d like anything that was linked from the old sitemap to be removed from webmaster tools and for Google to stop attempting to crawl those pages.
Thanks in advance.
-
I agree that the 301 redirect would be your best option as you can pass along not only users but the bots to the right page.. You may need to get a developer in to write some regular expressions to parse the incoming request and then automatically find the correct new URL. I have worked on sites with a large number of pages and using some sort of automation is the only way to go.
That said, if you simply want to kill the old URLs you can show the 404s or 410s. As you mention, then you end up with a bunch of 404 errors in GWT. I have been there too, it's like damned if you do, damned if you don't. We had some URLs that were tracking URLs from an old site and we are now here a year later (been showing 410s for over a year on the old tracking URLs) they still show up in GWT as errors.
We are trying a new solution for how to remove these URLs from the index without getting 404 errors. We show a 200 and then we put up a minimal html page with the meta robots noindex tag.
http://support.google.com/webmasters/bin/answer.py?hl=en&answer=93710
"When we see the noindex meta tag on a page, Google will completely drop the page from our search results, even if other pages link to it. "
So, we allow Google to find the page, get a 200 (so no 404 errors), but then use the meta noindex tag to tell Google to remove it from the index and stop crawling the page.
Remember, this is the "nuclear" option. You only want to do this to remove the pages from the Google index. Someone mentioned using GWT to remove URLs, but if I remember correctly, you only have so many pages you can do this with at a time.
If you list the files within the robots.txt. Google will not spider the files, but then if you remove the page from robots.txt file, they will start to try spidering again. I have seen Google come back a year later on URLs when I take them out of robots. This is what happened to us and so we tried just showing the 410/404, but Google still keeps crawling. We recently moved to this option with the 200/noindexmeta and it seems to be working.
Good luck!
-
You can but the 404s should stop being crawled on their own. There's a webmaster tool that you can use to make that happen faster as well
http://support.google.com/webmasters/bin/answer.py?hl=en&answer=64033
-
Yeah it's a 404 http://www.tester.co.uk/17th-edition-equipment/multifunction-testers/fluke-1651b-multifunction-installation-tester
with over 200,000 404's its a lot to go through and 301. For some reason they it got migrated they just pointed the old url to a new one replacing the root domain name without creating matching url's. Doh.
I was thinking about robot.txt filling them all?
-
A 404 should cause Google to de-index the content. Go to one of the bad URLs and view the headers to make sure that your webserver is returning a status 404 and not just a 404 "page".
As hard and time consuming as it might be, I would still pursue a 301 option. It's the cleanest way to resolve the issue. Just start nibbling at it and you can make a dent. Doing nothing just lets the problem grow.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Migrating Magento site to Shopify Plus without dropping in SERPS
We have been looking at moving our ecommerce store www.pretavoir.co.uk from Magento to Shopify Plus. However, as we rank quite well at present we are interested in hearing experience others may have had making this change and also any advice that you may have... Also, any general comments on Shopify appreciated..
Web Design | | seanmccauley0 -
Duplicate items across different pages?
On our new website we have a testimonials page which you can cycle through them. We also have the testimonial on the our work / project page. Essentially this is duplicate content from another page, what's the best thing to do here? In the sake of SEO, remove the duplicate content and only have one? Or won't it make much difference?
Web Design | | vortexuk0 -
Site with no ads hit by Page Layout update?
Hi there! Can a site that has no ads on it be hit by Google's latest Page Layout update? Can it be hit for just one or two keywords? My site (www.ink2paper.com) has a decline in Google organic traffic in early Feb so my suspicion is the Page Layout update. However I have no ads on the site. Digging into GWMT I find that it is only one or 2 keywords that seems to have taken a dive, mainly [photo paper]. I used to get around 80 imps a day for this term. Then on 6 Feb it was down to 50; 7 Feb = 34; 8 Feb just 4 impressions! I got a spike back at usual levels on 10 & 11 Feb, but since then it has been back down to only 5 or so impressions a day. [photographic paper] took a small hit at the start of February, but has nose dived since the start of April. The homepage performs well for Google organic traffic - low bounce (22%) and good ecom conversion rate (14%) - although this is likely to be largely branded traffic. I feel my site is a 'good' result for the search term [photo paper], although there is always room for improvement of course! Any suggestions as to why Google has stopped showing my site for these keywords? All help is greatly appreciated. Cheers,
Web Design | | SimonHogg
Simon0 -
Migrating to Wordpress
Hi Mozzers, happy friday! I'm moving a new clients website from a really bad CMS to Wordpress and wondered what I need to do to do this, get the A record of the old programmers? If someone could do me a checklist that'd be great! Thanks!
Web Design | | KarlBantleman0 -
What site would be best to push
http://www.buypropertyanywhere.com/bulgaria/smolyan/pamporovo/P-14659.php this Is where the ad is first created it is the fresh content. It is then copied word for word to the following http://www.housesalesbulgaria.com/bulgaria/smolyan/pamporovo/P-14659.php
Web Design | | Feily
http://www.worldofproperty.cn/property-details.php?lang=14&pId=14659
http://www.buypropertyanywhere.ru/property-details.php?lang=14&pId=14659
http://www.worldwideproperty.in/property-details.php?lang=14&pId=14659 Would it be best to push buypropertyanywhere as the main site and left the others drop or to develop buypropertyanywhere with each country as a mini site within it and use the existing urls for example www.housesalesbulgaria.com and 301s to direct back to the mini site ie www.buypropertyanywhere.com/bulgaria. Thanks in advance0 -
What is value in site aggregation?
If one was to own 5 sites that were in a similar vertical and at some point decide that managing 5 sites was more of a pain than managing 1,2, or 3, is it possible to combine sites via 301 redirects and increase the overall DA, and for urls that are similar, PA? So, we have site Hairbrush.com, comb.info, trimmer.com, hairmud.org, and barber.net: Assuming that they have a DA of Hairbrush.com = 32 Comb.info = 36 Trimmer.com = 27 Hairmud.org = 21 Barber.net = 44 Is there any testing that has shown combining the first 4 would increase the Comb.info from DA of 36 to DA of 51, etc? Is there any testing regarding the same, but with Page Authority? Thanks PS Assume other variables are equal. I also realize this could look as if they were all ECommerce, but what if they were service or bricks and mortar?
Web Design | | RobertFisher0 -
Best Site navigation solution
Hi there, We are getting our website redesigned and would like to know whether to increase the links on our site wide navigation or not. At the moment we have around 30 links from the navigation. We want to use exploding navigation menu and increase the links to our most important categories. Say if we increase to 60-70 would that be alright. (what will be the highest we can go for) At the moment categories that get links from navigation are ranking pretty good. If we increase would we loose those rankings. What will be the pros and cons of increasing navigation links? Second question we are also adding fooer links to top 10 categories in the footer. Would this be ok as far as seo and google concerned. Many Thanks
Web Design | | Jvalops0 -
Advice on migrating from .com to .co.uk without dropping in rank?
I have a retail business in the UK whose website has *.com address and it has taken 3 years to reach a page rank of 3. We are building an updated site which will have a completely new url structure and optimized for SEO. We are considering launching the new site at a *.co.uk as we understand this will have advantages in local search and ranking as we are primarily targeting UK traffic. Does anyone have comments on **.com vs .co.uk and/or have any advice on how to handle the migration while minimizing any drop in traffic and ranking?
Web Design | | brian.james0