Googlebot soon to be executing javascript - Should I change my robots.txt?

danatanseo

This question came to mind as I was pursuing an unrelated issue and reviewing a site's robots/txt file.

Currently this is a line item in the file:

Disallow: https://*

According to a recent post in the Google Webmasters Central Blog: [http://googlewebmastercentral.blogspot.com/2014/05/understanding-web-pages-better.html](http://googlewebmastercentral.blogspot.com/2014/05/understanding-web-pages-better.html "Understanding Web Pages Better") Googlebot is getting much closer to being able to properly render javascript. Pardon some ignorance on my part because I am not a developer, but wouldn't this require Googlebot be able to execute javascript? If so, I am concerned that disallowing Googlebot from the https:// versions of our pages could interfere with crawling and indexation because as soon as an end-user clicks the "checkout" button on our view cart page, everything on the site flips to https:// - If this were disallowed then would Googlebot stop crawling at that point and simply leave because all pages were now https://  ???  Or am I just waaayyyy over thinking it?...wouldn't be the first time!  Thanks all! [](http://googlewebmastercentral.blogspot.com/2014/05/understanding-web-pages-better.html "Understanding Web Pages Better")

danatanseo

Excellent answer. Thanks so much Doug. I really appreciate it! Adding a "nofollow" attribute to the Checkout button is a good suggestion and should be fairly easy to implement. I realize that internal nofollows are not normally recommended, but in this instance, may not be a bad idea.

DougRoberts

Hi Dana,

When you click on the checkout button - what's the mechanism for taking people to the https:// site. Is it just that the checkout link uses https:// in it's link? Is there some javascript wizardry you're particularly concerned about?

Even though googlebot follows this one link to the https version of the cart, it will still have all the other links on the previous page queued up to follow (non-https) so I don't think this will stop the crawl at that point. It would be a nightmare if googlebot stopped crawling hte entire site everytime it went down a rabbit hole!

That's not to say that you wouldn't want to consider no-following your checkout button. I'm sure neither you, nor google want to the innards of the cart pages to be indexed? There's probably other pages you'd rather Googlebot spent it's time finding right?

My take on the Google blog about understanding Javascript is that the aim is to try and do a better job discovering content that might be hidden by Javascript/Ajax. It's a problem for google when the raw html that they're crawling doesn't accurately reflect the content that is displayed in front of a real visitor.

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Googlebot soon to be executing javascript - Should I change my robots.txt?

Browse Questions

Explore more categories

Related Questions

Have you ever seen or experienced a page indexed which is actually from a website which is blocked by robots.txt?

Homepage title tag: "Keywords for robots" vs "Phrases for users"

404s in Google Search Console and javascript

Big change to title tags in SERPs for me, anyone else?

Adding the link masking directory to robots.txt?

Changes in Google "Site:" Search Algorithm Over Time?

Javascript hidden divs, links to anchor content

New Algorithm changes