Replacing a site map
-
We are in the process of changing our folder/url structure. Currently we have about 5 sitemaps submitted to Google.
How is it best to deal with these site maps in terms of either (a) replacing the old URLs with the new ones in the site map and (b) what affect should we have if we removed the site map submission from the Google Webmaster Tools console.
Basically we have in the region of 20,000 urls to redirect to the new format, and to update in the site map.
-
Another thought might be to place a noindex on the new pages to start with and as we migrate and 301 redirect the old to the new remove the noindex on the new pages ?
That can work but it's not an approach i would use. It seems like a lot of extra work, you run the risk of forgetting to remove the noindex tag on some pages, and also you may wind up not having pages properly indexed for a month.
If you publish a page today, Google may crawl the new page and see the noindex tag. You can then remove the noindex tag but Google may not recrawl the page for some time leaving your site without an indexed page.
As part of the process of publishing the page, I would 301 the old URL to the new URL immediately.
-
Another thought might be to place a noindex on the new pages to start with and as we migrate and 301 redirect the old to the new remove the noindex on the new pages ?
Thoughts ??
-
since the site has over 10,000 pages we need to make sure all redirects etc are set-up before we go live with the new URLs ?
Whether your site has 10 pages or a million pages you should ensure all internal links work without the need for redirection. Any old external links should be redirected to the correct page on your site if one exists. Otherwise you can allow the URL to 404 if there is not a current equivalent page.
Set up your site's 404 page so users are offered a basic "page not found" message along with your site's navigation and a search function. You should set up a log to track which URLs are generating 404 errors.
Prior to launching the site run a crawl diagnostic to help ensure nothing has been missed.
-
Perfect thanks. Just one final question, since the site has over 10,000 pages we need to make sure all redirects etc are set-up before we go live with the new URLs ?
What is the best way to go forward with regards launching the site ?
Should we launch the new pages and then go through the URLs redirecting them ?
Thoughts please ??
-
We are changing our site structure for two main reasons
-
Ease of functionality, and having the ability to target friendly URLs suitable for SEO
-
Plus we've a new CMS, that allows this custom written URLs
The current structure has too many folders that are too deep, and is becoming too un-manageable. The new CMS gives us totally control from one control panel.
I understand that we will loose some PR, but believe it will be for the better of the site and user experience.
-
-
We are changing our site structure for two main reasons
-
Ease of functionality, and having the ability to target friendly URLs suitable for SEO
-
Plus we've a new CMS, that allows this custom written URLs
The current structure has too many folders that are too deep, and is becoming too un-manageable. The new CMS gives us totally control from one control panel.
I understand that we will loose some PR, but below it will be for the better of the site and user experience.
-
-
Big question: are you changing folder/url structure for aesthetics or functionality? Often times it's not worth making such a large change in hopes of getting some SEO-friendly URL's, as the weight on SEO-friendly URL's isn't what it once was. And the headache involved, as well as the inevitable loss in traffic, is quite often not worth it at all.
With that said, refresh your entire sitemap with the new URL's once they are made. Remove all old urls.
IMPORTANT: setup 301 redirects, either using .htaccess or PHP (or whatever language your site uses), to redirect all old urls to the respective new urls. You will lose a fair chunk of PR during this change, but if you feel your site will benefit greatly from a structure change, then you will be willing to take the hit.
Don't leave any redirect un-turned. Then, you'll just have to wait it out while Google re-indexes your entire site trying to figure out your new url structure. Could take a week, could take months. All depends on what Google has valued your site as. For example, if CNN changed their entire URL structure, they probably would miss a beat. Smaller websites tend to take much larger hits in the SERP's.
So, just be sure it's a necessary action, trust me. And don't ever remove those 301's from your .htaccess as you never know what Google still has in their index for your site.
-
A sitemap should be a link representation of your site. It should contain a link to every page you wish to be included in Google's index.
How is it best to deal with these site maps in terms of either (a) replacing the old URLs with the new ones in the site map
Just make the switch. If a page no longer exists on your site, remove the link. If you create a new page on your site, add the link.
what affect should we have if we removed the site map submission from the Google Webmaster Tools console
For the most part, none. During the next crawl Google would look for your sitemap at your root address: www.mydomain.com/sitemap.xml. Google will also check your robots.txt file for a path to your sitemap file. If a sitemap is not located, it will crawl your site normally.
The primary purpose of a sitemap is to allow Google to become aware about new pages on your site it otherwise might not find. If your site offers solid navigation, a site map is not necessary at all.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Why Can't Googlebot Fetch Its Own Map on Our Site?
I created a custom map using google maps creator and I embedded it on our site. However, when I ran the fetch and render through Search Console, it said it was blocked by our robots.txt file. I read in the Search Console Help section that: 'For resources blocked by robots.txt files that you don't own, reach out to the resource site owners and ask them to unblock those resources to Googlebot." I did not setup our robtos.txt file. However, I can't imagine it would be setup to block google from crawling a map. i will look into that, but before I go messing with it (since I'm not familiar with it) does google automatically block their maps from their own googlebot? Has anyone encountered this before? Here is what the robot.txt file says in Search Console: User-agent: * Allow: /maps/api/js? Allow: /maps/api/js/DirectionsService.Route Allow: /maps/api/js/DistanceMatrixService.GetDistanceMatrix Allow: /maps/api/js/ElevationService.GetElevationForLine Allow: /maps/api/js/GeocodeService.Search Allow: /maps/api/js/KmlOverlayService.GetFeature Allow: /maps/api/js/KmlOverlayService.GetOverlays Allow: /maps/api/js/LayersService.GetFeature Disallow: / Any assistance would be greatly appreciated. Thanks, Ruben
Technical SEO | | KempRugeLawGroup1 -
HTML Site for Speed
I have a few small sites and landing pages on Wordpress that I want to load a lot quicker than they do. It occurred to me that if there is not a lot of content management necessary, I should simply make the static web pages straight html instead of trying all the modifications necessary to get some Wordpress sites and themes to load quicker. I have noticed the html sites I have load lighting fast on slow hosting service. Is this a good idea, can anyone think of drawbacks to it? Security? Responsiveness? SEO? And what about taking some company's sites with blog straight html so the home page loads quick, and then using Wordpress for the blog?
Technical SEO | | phogan0 -
Site Redesign: 302 Query
Hi there, We'll be redesigning our website www.example.com and as such want to 302 users from www.example.com and all other pages to a new URL www.example.com/landingpage while we go through the redesign. The new landing page will have copy and a sign up form on it and once the redesign is completed, we plan on removing the 302 and sending all traffic back to the original url www.example.com. I'd just like to check that a 302 is the most relevant option here? Obviously, once redesign is completed we'll 301 any old URLs to their new locations once completed.
Technical SEO | | Hemblem0 -
Do you need an on page site map as well as an XML Sitemap?
Do on page site maps help with SEO or are they more for user experience? We submit and update our XML Sitemaps for the search engines but wondering if /sitemap for users is necessary?
Technical SEO | | bonnierSEO0 -
Quoting From Another Part of Your Site
Suppose that you wrote an awesome piece of content and want to feature short segments of that content on your product pages. Is there any risk (such as duplicate content problems) to quoting a paragraph from another page of your site?
Technical SEO | | Charlessipe0 -
News Archive within a site - Detrimental?
I have a clients site, which we publish fresh news content daily and have done for some time. this has built up a huge archive of news over the past few years. All the news resides in a /news sub folder. The most interesting news articles are syndicated through facebook and twitter. Note: All the news is original content. The archive is based around a chronological filing system so /news/2010/December would retrieve news articles published in December 2010. My first question is this, in the Post Panda seo world, since the news in the archives receives little to no traffic, will these be classed as low quality pages, even though they are generally informative and timely news articles? My Second Question, It's my understanding that a site can be penalised for harbouring low quality pages, thus would I be better to do away with my news archive of around 600+ articles and impose a rule that news is removed from the site after a set period (say 6 -12 months)?
Technical SEO | | Entrusteddev0 -
Can I noindex most of my site?
A large number of the pages on my site are pages that contain things like photos and maps that are useful to my visitors, but would make poor landing pages and have very little written content. My site is huge. Would it be benificial to noindex all of these?
Technical SEO | | mascotmike0