Sitemap.xml problem in Google webmaster
-
Hi,
My sitemap.xml is not submitting correctly in Google Webmaster.
There is 697 url submitted but only 56 are in Google index.
At the top of webmaster this is what it says ->>>
http://www.example.com/sitemap.xml has been resubmitted.
But when when I clicked status button RED X occurs.
Any suggestions about this, thanks...
-
Cheers for your reply and answer
& Yes most of your assumptions were correct I am using sitemap generation. The issue is fixed there was a problem with the sitmap when created but it's all sorted now & submitted correctly in WMT.
Thanks...
-
Cheers for your reply and answer
& Yes most of your assumptions were correct I am using sitemap generation. The issue is fixed there was a problem with the sitmap when created but it's all sorted now & submitted correctly in WMT.
Thanks...
-
For the 8 invalid pages, you need to fix the URLs. Based on your questions I assume you are using some form of sitemap generation software. Apparently it is not configured correctly. You will need to take a look at these pages to determine why the URLs are invalid and/or contact the sitemap software vendor.
With respect to the indexing, submitting a sitemap is no guarantee that the pages will be indexed. You can submit a 1000 page site and have every page indexed, or you can have only a couple hundred pages indexed. There are a variety of factors involved.
Some factors which can affect indexing:
-
Is your robots.txt file blocking any of these pages?
-
Are any of these pages duplicate content?
-
Are any of the pages invalid URLs?
-
Are any of these pages canonicalized to other pages?
-
Are any of these pages 301'd to other pages?
-
How well is your site's navigation working? Sitemaps help Google find island pages and such, but your site will be crawled much better with proper navigation along with both internal and external links.
-
How popular is your site and these pages? Pages with good PA are crawled regularly and sites with high DA are crawled more frequently and deeper then other sites.
-
-
I'm just wondering how to do go about fixing these? I ses that they are not valid. Also once fixed do you think this will solve the sitmap issue? (like are these 8 not valid pages causing 600+ pages not being indexed) thanks.
-
The links in your reply are not valid. Try clicking on one of them. They are to your secure Google WMT page and they have an extra http:// prefix.
-
Errors look this ->
1916
Invalid URLThis is not a valid URL. Please correct it and resubmit.URL:http://exhibitions/info_22.htmlParent tag: urlTag: locProblem detected on: Aug 4, 2011 1919Invalid URLThis is not a valid URL. Please correct it and resubmit.URL:http://irish-myths-and-legends/info_12.htmlParent tag: urlTag: locProblem detected on: Aug 4, 2011There is about 10 errors like the above, any suggestions?
-
You need to click on the sitemap in Google WMT and it will inform you of the issue. There are many possible causes ranging from the sitemap link not being accessible to the file not being formatted correctly.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Image Sitemap
I currently use a program to create our sitemap (xml). It doesn't offer creating an mage sitemaps. Can someone suggest a program that would create an image sitemap? Thanks.
Technical SEO | | Kdruckenbrod0 -
How do I get my pages to go from "Submitted" to "Indexed" in Google Webmaster Tools?
Background: I recently launched a new site and it's performing much better than the old site in terms of bounce rate, page view, pages per session, session duration, and conversions. As suspected, sessions, users, and % new sessions are all down. Which I'm okay with because the the old site had a lot of low quality traffic going to it. The traffic we have now is much more engaged and targeted. Lastly, the site was built using Squarespace and was launched the middle of August. **Question: **When reviewing Google Webmaster Tools' Sitemaps section, I noticed it says 57 web pages Submitted, but only 5 Indexed! The sitemap that's submitted seems to be all there. I'm not sure if this is a Squarespace thing or what. Anyone have any ideas? Thanks!!
Technical SEO | | Nate_D0 -
Can I have an http AND a https site on Google Webmaster tools
My website is https but the default property that was configured on Google WMT was http and wasn't showing me any information because of that. I added an https property for that, but my question is: do I need to delete the original HTTP or can I leave both websites?
Technical SEO | | Onboard.com0 -
Google webmaster tools says access denied error 403
Hi, this keeps on happening, just check early today and it tells me i have access denied and 403 errors I have this from time to time in my google webmaster tools and i have checked the pages and they work properly, so i am puzzled why this has happened. I have contacted my hosting company who have said there is not a problem but there must be a problem somewhere which could affect my site rankings. can anyone let me know what this could be please. i work in joomla | parenting-magazine | 403 | 8/10/13 |
Technical SEO | | ClaireH-184886
| | 2 | personal-finance-money-advice | 403 | 8/10/13 |
| | 3 | 201308081607/emmerdale/emmerdale-chas-confronts-cameron-over-affair-with-debbie | 403 | 8/10/13 |
| | 4 | 201308081606/emmerdale/emmerdale-declan-gets-a-visit-from-the-police | 403 | 8/10/13 |
| | 5 | 201308081608/emmerdale/emmerdale-cameron-debbie-affair-is-out-in-the-open | 403 | 8/10/13 |
| | 6 | 201308081614/uk-holiday-news/visitscotland-launch-campaign-to-boost-tourism | 403 | 8/10/13 |
| | 7 | dog-advice/training-your-puppy-a-beginners-guide | 403 | 8/10/13 |
| | 8 | gadgets/hp-envy-13-laptop-review | 403 | 8/10/13 |
| | 9 | gadget-talk/everyday-smartphone-gadgets-which-could-revolutionise-your-life | 403 | 8/10/13 |
| | 10 | news-gadgets/the-htc-one-mobile-phone-review | 403 | 8/10/13 |
| | 11 | gadget-talk/five-iphone-apps-for-home-improvement | 403 | 8/10/13 |
| | 12 | gadget-talk/are-android-apps-useful-for-business-success | 403 | 8/10/13 |
| | 13 | gadget-talk/television-gadgets-the-future-of-television-is-coming | 403 | 8/10/13 | | | |0 -
Webmaster tools
Hello, My sites are showing odd "links to your site" data in WMT. Its not showing any links to the homepages and reduced links for other pages. Anyone else seeing this? Penguin refresh maybe?
Technical SEO | | jwdl0 -
I have custom 404 page and getting so much 404 error on Google webmaster, what should i do?
I have a custom 404 page with popular post and category links in the page, everyday i have 404 crawl error on webmaster tools, what should i do?
Technical SEO | | rimon56930 -
Is this against google rules
Hi i am wanting to know if this is against google rules. I am building a website which will have lots of different sections and i wanted to know if you were allowed to have a new domain name pointing to a section of the site. so for example if i had a site with a domain name of manchester and then i wanted a section of the site to be called www.manchester.com/complimentary health I want to know if to help with traffic to the site and to have a better domain name, if it was allowed to have a new domain name pointing to that section of the site which could be called www.complimentaryhealth.com and have that pointing to the section. would love to hear your thoughts on this
Technical SEO | | ClaireH-1848860 -
How to disallow google and roger?
Hey Guys and girls, i have a question, i want to disallow all robots from accessing a certain root link: Get rid of bots User-agent: * Disallow: /index.php?_a=login&redir=/index.php?_a=tellafriend%26productId=* Will this make the bots not to access any web link that has the prefix you see before the asterisk? And at least google and roger will get away by reading "user-agent: *"? I know this isn't the standard proceedure but if it works for google and seomoz bot we are good.
Technical SEO | | iFix0