Rogerbot's crawl behaviour vs google spiders and other crawlers - disparate results have me confused.
-
I'm curious as to how accurately rogerbot replicates google's searchbot
I've currently got a site which is reporting over 200 pages of duplicate/titles content in moz tools. The pages in question are all session IDs and have been blocked in the robot.txt (about 3 weeks ago), however the errors are still appearing.
I've also crawled the page using screaming frog SEO spider. According to Screaming Frog, the offending pages have been blocked and are not being crawled. Webmaster tools is also reporting no crawl errors.
Is there something I'm missing here? Why would I receive such different results. Which one's should I trust? Does rogerbot ignore robot.txt? Any suggestions would be appreciated.
-
Thanks for your response. I was beginning to think this question had been left to rot.
I'm not getting any errors in WMT. What is concerning is that Roger is returning almost 300 errors of dupe content, which is obviously a problem. Screaming frog is no longer finding the pages (they've been blocked in the robot.txt) I guess what I'm trying to ask here is how can I be sure that my dupe content has been effectively blocked from google's spider.
Is there anyway to check?
Thanks for your help.
-
I've see similar concerns from others, it seems "rogerbot" does ignore certain things that other bots consider.
Don't worry about it, if it's not being flagged in WMT it shouldn't be an issue.
Take Roger as a guide rather than an iron fist bot like googlebot.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
What's the best tool to use to compare competirors
A client of ours has asked us to compare their search rankings to competitors. What's the best tool to use in SEOMoz to do this?
Moz Pro | | BillyBobGriffin0 -
Is SEOmoz spider ignoring my redirect?
My client was previously serving their website from both a .co.uk and a .com domain. The DNS for each of these domains was pointing to the same place, rather than redirecting. I saw this as a potential duplicate content problem so I set the .co.uk to 301 redirect on to the .com. As a user, the 301 seems to be working correctly. However, now that I have done this, SEOmoz is picking up thousands of "inbound" links from the .co.uk domain. Essentially, every single link on the internal site, is being duplicated in my stats as an inbound link as well. It appears that the spider is ignoring the redirect. I'm not sure if it's a legitimate issue that will upset Google too, or if it's just a bug with SEOMoz's spider.
Moz Pro | | MadisonSolutions0 -
Change the labels' name
Hello, I defined various labels and linked them to groups of keywords. Nevertheless, I'd like to change the name of one label, but it seems to be impossible. How could I do ? Thanks,
Moz Pro | | soliste690 -
Google Webmaster Tools and Open Site explorer's links not matching up
My question is why do my GWT's links not match up to the ones on Open Site Explorer. I watched John Mueller's video, and he said that they had problems with the link counts recently. After checking my links today, I can see that the problem is fixed, but my link count differs from Open Site Explorer. On Web Master Tools I have 307 links, but on Open Site Explore I have 42. Has anyone dealt with this problem? thanks. Peter
Moz Pro | | PeterRota0 -
Crawl Diagnostics - unexpected results
I received my first Crawl Diagnostics report last night on my dynamic ecommerce site. It showed errors on generated URLs which simply are not produced anywhere when running on my live site. Only when running on my local development server. It appears that the Crawler doesn't think that it's running on the live site. For example http://www.nordichouse.co.uk/candlestick-centrepiece-p-1140.html will go to a Product Not Found page, and therefore Duplicate Content errors are produced. Running http://www.nhlocal.co.uk/candlestick-centrepiece-p-1140.html produces the correct product page and not a Product Not Found page Any thoughts?
Moz Pro | | nordichouse0 -
What's Happened To OSE's External Links For My Site?
Hi, I'm just taking my first steps with Open Site Explorer. I've hit a problem that I'd be really grateful for some help with. I'm running a website (that I didn't create) and want to get a clear picture of the inbound links. When I enter the URL into the OSE search bar, there are no external domains listed under the 'Inbound Links' and 'Linking Domains' tabs. Only internal site links are registered. Setting the filters, "only external" + "pages on this root domain", under the Inbound Links tab, does bring up a list of external sites. But I haven't had to enter filters just to see a list of inbound links with any other sites. Also, the breakdown under the Linking Domains tab remains the same - only links within my site are shown. Does this ring a bell? Any ideas what I might be doing wrong, or what might be wrong with my site to cause this problem? Cheers for helping, Josh
Moz Pro | | JoshAustin460 -
Crawl Diagnostics Summary
Sorry if I am not asking in the right place. On Crawl Diagnostics Summary it says this right..?? : "To get you started quickly Roger is crawling up to 250 pages on your site. You should see these results within two hours. The full crawl will complete within 7 days.". so it's passed a day and it still doesn't show nothing. It says "Processing Crawl Data for 358 pages" How much should i wait??
Moz Pro | | Dussk0 -
What's name of SEOmoz and Open Site Explorer robots?!
I would like to exclude in robots.txt SEOmoz and Open Site Explorer bots to don't let them index my sites… what's their names?
Moz Pro | | cezarylech0