SEOMOZ crawl all my pages
-
SEOMOZ crawl all my pages including ".do" (all web pages after sign up )
. Coz of this it finishes all my 10.000 crawl page quota and be exposed to dublicate pages.
Google is not crawling pages that user reach after sign up. Because these are private pages for customers I guess
The main question is how we can limit SEOMOZ crawl bot. If the bot can stay out of ".do" java extensions it'll perfect to starting SEO analysis.
Do you know think about it?
Cheers
Example;
.do java extension (after sign up page) (Google can't crawl)
Normal Page (Google can crawl)
http://magaza.turkcell.com.tr/telefon/Apple-iPhone-3GS-8GB/1001694/.html
-
Hi There,
Thanks for writing in and sorry for the confusion.
It actually isn't possible for the SEOmoz crawler to access pages that require a user login. I went to those URLs and I was able to access the pages with out be logged in as a user, so they don't require user sign up to access them. Since these pages are linked to by other pages on your site and our crawler is not being blocked from these pages and the pages don't actually require a user to be signed in to access them, we will crawl them. I can't say why Google wouldn't be crawling those pages, but there is definitely nothing in place that would stop our crawler from accessing them.
If you would like to stop our crawler from accessing those pages in the future, you may consider adding a disallow directive in your robots.txt file using the user-agent rogerbot.
I hope this helps. Let me know if you have any other questions.
Chiaryn
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Page not ranking because of React.js ?
Hey guys, I'm struggling with this part of my website which uses react.js . My developers used this saying it's much better and much quicker (which I think so too) but we have really low traffic coming from google compared to the other parts of the website (not using react.js). Moz gives me a score of 85% for the page but we get less than 100 visits / day and we were targeting 10.000 visits/day giving the traffic of this section in our competitors website (our whole website has 60.000 visits / day). (Section is online since 3 months now) Can you help me see what is wrong there ? I'm in Belgium so we have the website in 3 languages (FR/NL/EN) but the most important ones are FR & NL. FR : https://gocar.be/fr/prix-voitures-neuves/Audi/A3/A3-Sportback/1-0-TFSI_39CER NL : https://gocar.be/nl/prijzen-nieuwe-wagens/Audi/A3/A3-Sportback/1-0-TFSI_39CER EN : https://gocar.be/en/price-new-cars/Audi/A3/A3-Sportback/1-0-TFSI_39CER Main competitors having a better ranking than us (exemple in FR) : https://www.moniteurautomobile.be/modele--audi--a3/prix.html https://www.vroom.be/fr/prix/audi-a3/citadine-2012/197 Cheers ! Jean-Philippe
Intermediate & Advanced SEO | | Gocar_be0 -
Duplicate Pages #!
Hi guys, Currently have duplicate pages accross a website e.g. https://archierose.com.au/shop/cart**#!** https://archierose.com.au/shop/cart The only difference is the URL 1 has a hashtag and exclamation tag. Everything else is the same. We were thinking of adding rel canonical tags on the #! versions of the page to the correct URLs. But Google doens't seem to be indexing the #! versions anyway. Does anyone know why this is the case? If Google is not indexing them, is there any point adding rel canonical tags? Cheers, Chris https://archierose.com.au/shop/cart#!
Intermediate & Advanced SEO | | jayoliverwright0 -
NoIndex Purchase Page
We ran a ScreamingFrog report of one of our websites and found that there are thousands of instances of a single page with a different URL parameter, for example: purchase.cfm?id=1234
Intermediate & Advanced SEO | | ErnieB
purchase.cfm?id=1235
purchase.cfm?id=1236
purchase.cfm?id=1237 and we do not need purchase.cfm to be indexed for any reason as there is practically no content on that page to begin with, but it's just part of the purchase steps in our website. What is the best way to deal with this for Google & SEO? Should we do a Meta NoIndex of this purchase.cfm page? Thank you.0 -
Incorrect cached page indexing in Google while correct page indexes intermittently
Hi, we are a South African insurance company. We have a page http://www.miway.co.za/midrivestyle which has a 301 redirect to http://www.miway.co.za/car-insurance. Problem is that the former page is ranking in the index rather than the latter. The latter page does index occasionally in the same position, but rarely. This is primarily for search phrases like "car insurance" and "car insurance quotes". The ranking was knocked down the index with Penquin 2.0. It was not ranking at all but we have managed to recover to 12/13. This abnormally has only been occurring since the recovery. The correct page does index for other search terms like "insurance for car". Your help would be appreciated, thanks!
Intermediate & Advanced SEO | | miway0 -
Cleaning bad pages
We have 10,000 of bad pages, which panda could track and penalize us for that. If we delete them we will get 404 error, and after that we could again get penality from G algo. How can i delete them to follow google rules and avoid penalities? If we make redirect of 10k pages with 301 to index, can 10k old pages be treated as duplicate?
Intermediate & Advanced SEO | | bele0 -
NOINDEX listing pages: Page 2, Page 3... etc?
Would it be beneficial to NOINDEX category listing pages except for the first page. For example on this site: http://flyawaysimulation.com/downloads/101/fsx-missions/ Has lots of pages such as Page 2, Page 3, Page 4... etc: http://www.google.com/search?q=site%3Aflyawaysimulation.com+fsx+missions Would there be any SEO benefit of NOINDEX on these pages? Of course, FOLLOW is default, so links would still be followed and juice applied. Your thoughts and suggestions are much appreciated.
Intermediate & Advanced SEO | | Peter2640 -
There's a website I'm working with that has a .php extension. All the pages do. What's the best practice to remove the .php extension across all pages?
Client wishes to drop the .php extension on all their pages (they've got around 2k pages). I assured them that wasn't necessary. However, in the event that I do end up doing this what's the best practices way (and easiest way) to do this? This is also a WordPress site. Thanks.
Intermediate & Advanced SEO | | digisavvy0 -
1 of the sites i work on keeps having its home page "de-indexed" by google every few months, I then apply for a review and they put it back up. But i have no idea why this keeps happening and its only the home page
1 of the sites i work on (www.eva-alexander.com) keeps having its home page "de-indexed" by google every few months, I then apply for a review and they put it back up. But i have no idea why this keeps happening and its only the home page I have no idea why and have never experienced this before
Intermediate & Advanced SEO | | GMD10