Robots.txt advice

eLab_London

Hey Guys,

Have you ever seen coding like this in a robots.txt, I have never seen a noindex rule in a robots.txt file before - have you?

user-agent: AhrefsBot

User-agent: trovitBot
User-agent: Nutch
User-agent: Baiduspider
Disallow: /

User-agent: *
Disallow: /WebServices/
Disallow: /*?notfound=
Disallow: /?list=
Noindex: /?*list=
Noindex: /local/
Disallow: /local/
Noindex: /handle/
Disallow: /handle/
Noindex: /Handle/
Disallow: /Handle/
Noindex: /localsites/
Disallow: /localsites/
Noindex: /search/
Disallow: /search/
Noindex: /Search/
Disallow: /Search/
Disallow: ?

I have never seen a noindex rule in a robots.txt file before - have you?
Any pointers?

Martijn_Scheijbeler

Never seen this, doubt it's any useful as this isn't part of any search engines recommended statements to use. I don't think this would have any impact on what search engine robots would look at as it's not a statement in the robots.txt documentation.

Tylerj

Best I could find was-

Unlike disallowed pages, noindexed pages don’t end up in the index and therefore won’t show in search results. Combine both in robots.txt to optimise your crawl efficiency: the noindex will stop the page showing in search results, and the disallow will stop it being crawled

From-https://www.deepcrawl.com/blog/best-practice/robots-txt-noindex-the-best-kept-secret-in-seo/

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Robots.txt advice

Browse Questions

Explore more categories

Related Questions

What happens to crawled URLs subsequently blocked by robots.txt?

Search Results Pages Blocked in Robots.txt?

Robots.txt Blocking - Best Practices

Huge increase in server errors and robots.txt

Massive URL blockage by robots.txt

Will disallowing in robots.txt noindex a page?

Need advice for indexing a multilingual website

Blocking Pages Via Robots, Can Images On Those Pages Be Included In Image Search