Google Webmaster tools Sitemap submitted vs indexed vs Index Status
-
I'm having an odd error I'm trying to diagnose. Our Index Status is growing and is now up to 1,115. However when I look at Sitemaps we have 763 submitted but only 134 indexed. The submitted and indexed were virtually the same around 750 until 15 days ago when the indexed dipped dramatically.
Additionally when I look under HTML improvements I only find 3 duplicate pages, and I ran screaming frog on the site and got similar results, low duplicates.
Our actual content should be around 950 pages counting all the category pages. What's going on here?
-
Bingo! My theory was correct. It was the extra // on the product pages in the site map. Once they fixed that, it went to indexing the sitemap again.
-
www, and parameters should not be an issue, robots file is ok (although waiting on the developer to change my_account and view_cart to my-account and view-cart)
On dev changes. This is a new site, and we have been struggling with some duplicate content generated by the ecommerce platform. We implemented a number of things to fix duplication issues around the same time this all started in google webmaster tools. Next and prev canonicals to the category pages and clean off session variables/refferal text, and canonicals on the product pages to clean off the session variables/referral text. Additionally the developer had a noindex tag on the product pages that we had them remove at the same time. Finally, we changed the content on the category pages from list with a grid view option to list view only and no followed the the secure account setting links like shopping cart, login etc.
I also have a number of fixes submitted to the developer for the site map, although to my knowledge it has not changed since day one. Changefreq is all messed up, it's randomly assigning this, no logic behind it, and 611 urls have // in between parameters instead of / could this be causing it? Follow my logic here, sitemap has all these pages with duplicate // in them, google hits the page, the canonicals we implemented says hey that's not it, it's / so then google ignores those pages in the sitemap. Is this it, or am I barking up the wrong tree? Any other thoughts?
-
I assume you have checked your robots.txt file and every other no index no follow robots X possibility that there is out there?
it appears like you are having issues with your web site architecture
https://www.distilled.net/blog/seo/indexation-problems-diagnosis-using-google-webmaster-tools/
I hope that is of help to you,
Thomas
-
Are there parameters being indexed? Is www and non-www getting indexed at the same time? Categories and tags being indexed? Any dev changes to the site that you know of?
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
How to submit Google xml sitemap properly in 2016?
Hello everyone!
Technical SEO | | SEObd
I'm new in the field of SEO. I'm looking for submitting XML web site guideline or tutorial. But there is no proper guideline. All of the tutorials are about the wordpress website. What should I do for my PHP website? Can I submit XML site map without help of developer? Please help me.0 -
Did anyone else noticed Google index bug?
Noticed page indexation drop in Search Console for most of my sites. Guys from Search Engine Land seem to know about that: http://selnd.com/1YqiOoQ Did anyone else noticed something weird?
Technical SEO | | solvid1 -
Where are the crawled URLS in webmaster tools coming from?
When looking at the crawl errors in Webmaster Tools/Search Console, where is Google pulling these URLs from? Sitemap?
Technical SEO | | SEOhughesm0 -
How To Cleanup the Google Index After a Website Has Been HACKED
We have a client whose website was hacked, and some troll created thousands of viagra pages, which were all indexed by Google. See the screenshot for an example. The site has been cleaned up completely, but I wanted to know if anyone can weigh in on how we can cleanup the Google index. Are there extra steps we should take? So far we have gone into webmaster tools and submitted a new site map. ^802D799E5372F02797BE19290D8987F3E248DCA6656F8D9BF6^pimgpsh_fullsize_distr.png
Technical SEO | | yoursearchteam0 -
Should I Edit Sitemap Before Submitting to GWMT?
I use the XML sitemap generator at http://www.auditmypc.com/xml-sitemap.asp and use the filter that forces the tool to respect robots.txt exclusions. This generator allows me to review the entire sitemap before downloading it. Depending on the site, I often see all kinds of non-content files still listed on the sitemap. My question is, should I be editing the sitemap to remove every file listed except ones I really want spidered, or just ignore them and let the Google spiderbot figure it all out after I upload-submit the XML?
Technical SEO | | DonB0 -
Bing Webmaster tool
Hi Fellas, I wanted to know once you verify the BIng Webmaster tool (via xml file) for a dev site, do you have to do the verification process again for the final site? I thought I needed to do the verification again but once I added the final website (which have almost a similar URL) into the webmaster tool account, it seemed that I didn't have to verify it. I am a bit confused. Thank you for clarifying
Technical SEO | | Ideas-Money-Art0 -
Sitemaps for Google
In Google Webmaster Central, if a URL is reported in your site map as 404 (Not found), I'm assuming Google will automatically clean it up and that the next time we generate a sitemap, it won't include the 404 URL. Is this true? Do we need to comb through our sitemap files and remove the 404 pages Google finds, our will it "automagically" be cleaned up by Google's next crawl of our site?
Technical SEO | | Prospector-Plastics0 -
Root vs. Index.html
Should I redirect index.html to "/" or vice versa? Which is better for duplicate content issues?
Technical SEO | | DavetheExterminator0