CMS program that have indexing problems
-
I am working on optimizing a website that was built with an CMS program. I searched for what pages are indexed on Google using "site:url" command. For some reason none of the pages are indexed on Google. What is the best way to index these web pages?
-
Thanks Mat!
I will install Google webmaster once i get the log in information from web designer. I used the SEOmoz campaign and validator.w3.org to figure out if there are any crawl errors. The only crawl errors I found was duplicated page title and content. On the validator i found 24 HTML errors. I did not see any any no follow or blocked spiders or no index.
The URL structure has a "shop.domainname.com" instead of a "www.domainname.com". I could switch hosting services and fix this issue could this be the problem?
I have seen similar scenarios before I think this is the issue. Thanks again!
-
If NO pages at all have been indexed then the first thing to check is that you are not blocking google. the easiest way to check all the most likely issues there is to get your site registered on google webmaster tools and then look for any crawl errors.
However if you want to check manually start with the robots.txt file - make sure that there are not any disallows in there. If there are then make sure that they are not blocking the mail content (if you are not sure then post the contents back here - happy to look). Also look in the source code of the page for any "no index" instructions.
If neither of those flag up problems then you need to ask does Google know about the site? Has it been linked to from external places, or submitted? How long since it went live.
If you post (or message me) the URL I'd be happy to take a quick look for you.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Meta Robots index & noindex Both Implemented on Website
I don't want few of the pages of website to get indexed by Google, thus I have implemented meta robots noindex code on those specific pages. Due to some complications I am not able to remove meta robots index from header of every page Now, on specific pages I have both codes 'index & noindex' implemented. Question is: Will Google crawl/index pages which have noindex code along with index code? Thanks!
On-Page Optimization | | Exa0 -
Strange google indexing behaviour
Hi all Looking for a second opinion on a strange issue with has occurred on my site. The site is a magento store and because I am using all the default merchant descriptions at the moment I have noindexed the product pages (there are 300k products, the plan is to rewrite the content as we go, starting with most popular sellers). The Gbot is blocked from the pages and all the products have header tag. We forgot to noindex the popular search terms page on the site and as a result google has indexed some search result pages - we may keep this open, not sure yet, We are seeing a very strange thing in the serps. Google has indexed the search result pages, as mentioned above, however, the description and title tag being used do not belong to that page, they belong to the product page the search result links to. If i do a search in google for the indexed pages i get the categories and lots of, what appears to be, product pages. https://www.google.co.uk/search?q=site:arropa.co.uk/store&espv=2&biw=1536&bih=772&ei=LE5xVd3qA4HlUNnggKgH&start=250&sa=N One would assume that a page listed with the title of Ladies 1 Pair Young Trasparenze Mumbai Animal Print . and the description of Come on, program a little of your crazy side! Part of the edgy, sassy Young Trasparenze Medley, these soft touch, nontransparent stockings function a crazy, (along with the price) would be an entry for that individual product. However, clicking on that product opens up a search results page (very slowly as the site is processing an update still - it is not for public use thus far) which can be seen here http://arropa.co.uk/store/catalogsearch/result/?q=+ladies+1+pair+young+trasparenze+mumbai+animal+print+tights+75+off+military+l+ yes, the search result page is for that particular item but nowhere on the page is the title, description and price, nor has it ever been. Am a little puzzled about this and what it would do re duplicate content as im using the manufacturer data at present. Ideally i would like to keep the search results pages open. Any thoughts would be most welcome. Couple of things to note. Im aware the site is too slow for general public use. It will be fully cached once running, as i say, it has 300k+ products so isn't small. Also, am aware that there are no images. They exist but we are moving the images around, hence being down. Always a fun task when there are 25gb of the things!! Many thanks Carl
On-Page Optimization | | WonkyDog0 -
Title Tags for Index Pages
What tactics do you use to change the title tags of your index page so they're not all the same? For example, if you have an index page that has 100 pages, each with the same title, what tactics do you use to give each page a unique title and how important is it?
On-Page Optimization | | felt0 -
Page Not Indexed
Hi Guys I wrote and published an article last night on my site but it is yet to be indexed. This is strange as articles are usually indexed pretty quickly. Could you have a quick look and see what the problem is? http://www.rankmytri.com/tomtom-running-and-triathlon-watch/ Also all my Blog posts (in the blog section of the site) are not indexed as well (and I dont think they have been for a while) yet I dont have any messages from Google in my webmaster tools. Thoughts? Thanks in advance Ross
On-Page Optimization | | ross88guy0 -
Google Index/Cashe questions
I have 15k+ pages. I have 4.5k pages indexed. What relation is the google cashe to indexing pages? My site gets cashed every two days. The competition in my SERP goes 2-3weeks to get cashed. What does this indicate? Is your cashe date your last google crawl? How can I get google to crawl my site? Is there a way I can get google to crawl my site starting from an internal page. This way I could set up a better linking structure that would benefit from doing activities that get that page indexed to help get my site indexed more thoroughly...
On-Page Optimization | | JML11790 -
Problem with Occurrences of Keyword
At "On-Page Report " i have noticed that the only important problem my site has is the "Occurrences of Keyword " it says that i have ONLY 14.156 keyword repeat. My page ofcourse does not have so many repeats of same keyword. In fact this keyword is shown 10 times as i saw at source code of this page i tested. This report is for one page or for all pages of domain ? My keyword was two words keyword if that matters but there is no way that keywords to be repeated so many times.
On-Page Optimization | | Web-Builders0 -
Can you find the "problem" metric or metrics?
I just dropped from #2 for my main keyword to #5 and am not sure why. My companies Ranking Metrics compared to top 5 Page Authority:#2, MozRank:#1, MozTrust:#1, MT/MR:#1, Total Links: #1, Internal Links:#1, External Links: #1, Followed Links:#1, No Follow Links: #1, Linking Root Domains:#1, OnPageAnalysis Grade: "A", Broad Keyword usage: Yes, Broad keyword in document:
On-Page Optimization | | Boodreaux0 -
New CMS system - 100,000 old urls - use robots.txt to block?
Hello. My website has recently switched to a new CMS system. Over the last 10 years or so, we've used 3 different CMS systems on our current domain. As expected, this has resulted in lots of urls. Up until this most recent iteration, we were unable to 301 redirect or use any page-level indexation techniques like rel 'canonical' Using SEOmoz's tools and GWMT, I've been able to locate and redirect all pertinent, page-rank bearing, "older" urls to their new counterparts..however, according to Google Webmaster tools 'Not Found' report, there are literally over 100,000 additional urls out there it's trying to find. My question is, is there an advantage to using robots.txt to stop search engines from looking for some of these older directories? Currently, we allow everything - only using page level robots tags to disallow where necessary. Thanks!
On-Page Optimization | | Blenny0