Blocking AJAX Content from being crawled

AU-SEO

Our website has some pages with content shared from a third party provider and we use AJAX as our implementation. We dont want Google to crawl the third party's content but we do want them to crawl and index the rest of the web page. However, In light of Google's recent announcement about more effectively indexing google, I have some concern that we are at risk for that content to be indexed.

I have thought about x-robots but have concern about implementing it on the pages because of a potential risk in Google not indexing the whole page. These pages get significant traffic for the website, and I cant risk.

Thanks,

Phil

BryceHoward

Hey Phil. I think I've fully understood your situation but just to be clear I'm presuming you've URL's exposing 3rd party JSON/XML content that you don't want being indexed by Google. Probably the most foolproof method for this case is using the "X-Robots-Tag" HTTP header convention (http://code.google.com/web/controlcrawlindex/docs/robots_meta_tag.html). I would recommend going with "X-Robots-Tag: none", which should do the trick (I really don't think "noarchive" or other options are required if they're not indexing it at all). You'll need to modify your server-side scripts to do this. I'm assuming there's not much pain required for you (or the 3rd-party?) to do this. Hope this helps! ~bryce

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Blocking AJAX Content from being crawled

Got a burning SEO question?

Browse Questions

Explore more categories

Related Questions

Duplicate Content from Wordpress Template

Content in Accordion doesn't rank as well as Content in Text box?

Crawl at a stand still

How to block text on a page to be indexed?

Duplicate content or titles

Same URL in "Duplicate Content" and "Blocked by robots.txt"?

How to block google robots from a subdomain

Tracking a Crawl error