Robots.txt advice

eLab_London

Hey Guys,

Have you ever seen coding like this in a robots.txt, I have never seen a noindex rule in a robots.txt file before - have you?

user-agent: AhrefsBot

User-agent: trovitBot
User-agent: Nutch
User-agent: Baiduspider
Disallow: /

User-agent: *
Disallow: /WebServices/
Disallow: /*?notfound=
Disallow: /?list=
Noindex: /?*list=
Noindex: /local/
Disallow: /local/
Noindex: /handle/
Disallow: /handle/
Noindex: /Handle/
Disallow: /Handle/
Noindex: /localsites/
Disallow: /localsites/
Noindex: /search/
Disallow: /search/
Noindex: /Search/
Disallow: /Search/
Disallow: ?

I have never seen a noindex rule in a robots.txt file before - have you?
Any pointers?

Martijn_Scheijbeler

Never seen this, doubt it's any useful as this isn't part of any search engines recommended statements to use. I don't think this would have any impact on what search engine robots would look at as it's not a statement in the robots.txt documentation.

Tylerj

Best I could find was-

Unlike disallowed pages, noindexed pages don’t end up in the index and therefore won’t show in search results. Combine both in robots.txt to optimise your crawl efficiency: the noindex will stop the page showing in search results, and the disallow will stop it being crawled

From-https://www.deepcrawl.com/blog/best-practice/robots-txt-noindex-the-best-kept-secret-in-seo/

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Robots.txt advice

Got a burning SEO question?

Browse Questions

Explore more categories

Related Questions

Very Old Pages Creeping Up - Advice

Large robots.txt file

Robots.txt - blocking JavaScript and CSS, best practice for Magento

Screaming frog Advice

SEO advice with having a blog on sub domain.

Advice needed on how to handle alleged duplicate content and titles

Any advice for setting up a Job Board?

Blocking Dynamic URLs with Robots.txt