What is the sense of robots.txt?

jallenyang

Using robots.txt to prevent search engine from indexing the page is not a good idea. so what is the sense of robots.txt? just for attracting robots to crawl sitemap?

RyanKent

While your robots.txt file is not the best means to control search engines, it does have a purpose. To respond to your questions:

the file does not "attract" any robots, but robots who do visit can learn a bit about your site and understand what content you don't wish to be crawled
you can block parts of your site that you feel have no value for indexing such as Keri mentioned your "print" version of pages, or overlays pages, or login pages, etc.

The idea is that you own the website, and you can have a measure of control over it. You can disallow specific crawlers, etc. although it's up to each crawler whether they actually respect your wishes.

More details can be read at: http://www.robotstxt.org/

KeriMorgret

There are often times pages you don't want indexed, and that's what robots.txt is there for. These are just some things you may not want indexed:

Premium content for subscription-only members
Your admin directory
Printable versions of pages
Development servers

You keep things you don't want out of the index, and you also don't waste the crawl budgets of the search engines on stuff that's not what you want in the engines in the first place.

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

What is the sense of robots.txt?

Browse Questions

Explore more categories

Related Questions

Google Webmaster Tools is saying "Sitemap contains urls which are blocked by robots.txt" after Https move...

Robots.txt

RegEx help needed for robots.txt potential conflict

Do I need robots.txt and meta robots?

Google (GWT) says my homepage and posts are blocked by Robots.txt

Robots.txt versus sitemap

Can I Disallow Faceted Nav URLs - Robots.txt

Search Engine Blocked by Robot Txt warnings for Filter Search result pages--Why?