Robots.txt advice

eLab_London

Hey Guys,

Have you ever seen coding like this in a robots.txt, I have never seen a noindex rule in a robots.txt file before - have you?

user-agent: AhrefsBot

User-agent: trovitBot
User-agent: Nutch
User-agent: Baiduspider
Disallow: /

User-agent: *
Disallow: /WebServices/
Disallow: /*?notfound=
Disallow: /?list=
Noindex: /?*list=
Noindex: /local/
Disallow: /local/
Noindex: /handle/
Disallow: /handle/
Noindex: /Handle/
Disallow: /Handle/
Noindex: /localsites/
Disallow: /localsites/
Noindex: /search/
Disallow: /search/
Noindex: /Search/
Disallow: /Search/
Disallow: ?

I have never seen a noindex rule in a robots.txt file before - have you?
Any pointers?

Martijn_Scheijbeler

Never seen this, doubt it's any useful as this isn't part of any search engines recommended statements to use. I don't think this would have any impact on what search engine robots would look at as it's not a statement in the robots.txt documentation.

Tylerj

Best I could find was-

Unlike disallowed pages, noindexed pages don’t end up in the index and therefore won’t show in search results. Combine both in robots.txt to optimise your crawl efficiency: the noindex will stop the page showing in search results, and the disallow will stop it being crawled

From-https://www.deepcrawl.com/blog/best-practice/robots-txt-noindex-the-best-kept-secret-in-seo/

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Robots.txt advice

Got a burning SEO question?

Browse Questions

Explore more categories

Related Questions

SEO'ing a sports advice website

Scary bug in search console: All our pages reported as being blocked by robots.txt after https migration

Please Help me! I need advice for my website

Screaming frog Advice

.htaccess question/opinion/advice needed

Do I need to disallow the dynamic pages in robots.txt?

Robots.txt 404 problem

Should I robots block this directory?