Error 406 with crawler test

ArchieDonnithorne

hi to all. I have a big problem with the crawler of moz on this website: www.edilflagiello.it.

On july with the old version i have no problem and the crawler give me a csv report with all the url but after we changed the new magento theme and restyled the old version, each time i use the crawler, i receive a csv file with this error:

"error 406"

Can you help me to understan wich is the problem? I already have disabled .htacces and robots.txt but nothing.

Website is working well, i have used also screaming frog as well.

ArchieDonnithorne

thank you very much Dirk. this sunday i try to fix all the error and next i will try again. Thanks for your assistance.

DirkC

I noticed that you have a Vary: User-Agent in the header - so I tried visiting your site with js disabled & switched the user agent to Rogerbot. Result: the site did not load (turned endlessly) and checking the console showed quite a number of elements that generated 404's. In the end - there was a timeout.

Try screaming frog - set user agent to Custom and change the values to

Name:Rogerbot

Agent: Mozilla/5.0 (compatible; rogerBot/1.0; UrlCrawler; http://www.seomoz.org/dp/rogerbot)

It will be unable to crawl your site. Check your server configuration - there are issues in how you deal with the Mozbot useragent.

Check the attached images.

Dirk

2emkZWc qWVIqVo jb5FpQK

ArchieDonnithorne

nothing. After i fix all the 40x error the crawler is always empty. Any other ideas?

ArchieDonnithorne

thanks, i'm wait another day

axel.loesken

I know the Crawl Test reports are cached for about 48 hours so there is a chance that the CSV will look identical to the previous one for that reason.

With that in mind, I'd recommend waiting another day or two before requesting a new Crawl Test or just waiting until your next weekly campaign update, if that is sooner

ArchieDonnithorne

i have fixed all error but csv is always empty and says:

http://www.edilflagiello.it,2015-10-21T13:52:42Z,406 : Received 406 (Not Acceptable) error response for page.,Error attempting to request page

here the printscreen: http://www.webpagetest.org/result/151020_QW_JMP/1/details/

Any ideas? Thanks for your help.

ArchieDonnithorne

thanks a lot guy! I'm going to check this errors before next crawling.

axel.loesken

Great answer Dirk! Thanks for helping out!

Something else I noticed is that the site is coming back with quite a few errors when I ran it through a 3rd party tool, W3C Markup Validation Service and it also was checking the page as XHTML 1.0 Strict which looks to be common in other cases of 406 I've seen.

DirkC

If you check your page with external tools you'll see that the general status of the page is 200- however there are different elements which generate a 4xx error (your logo generates a 408 error - same for the shopping cart) - for more details you could check this http://www.webpagetest.org/result/151019_29_14E6/1/details/.

Remember that Moz bot is quite sensitive for errors -while browsers, Googlebot & Screaming Frog will accept errors on page, Moz bot stops in case of doubt.

You might want to check the 4xx errors & correct them - normally Moz bot should be able to crawl your site once these errors are corrected. More info on 406 errors can be found here. If you have access to your log files you could check in detail which elements are causing the problems when Mozbot is visiting your site.

Dirk

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Error 406 with crawler test

Got a burning SEO question?

Browse Questions

Explore more categories

Related Questions

Crawl tests stuck in queue

How do can the crawler not access my robots.txt file but have 0 crawler issues?

Error: 804 : HTTPS (SSL) error encountered when requesting page

Crawl Test Takes Long Time

Ww.domain.com coming up with error

Getting 'Sorry, but that URL is inaccessible' error msg when trying to run On-Page Grader

When attempting to crawl my site, I'm getting the error: Oops! That URL doesn’t resolve, which means your report will be blank. Please fix the issue or change the URL. What's going on here?

Crwal errors : duplicate content even with canonical links