Robots.txt file

seoug_2005

Does it serve any purpose if we omit robots.txt file ? I wonder if spider has to read all the pages, why do we insert robots.txt file ?

Greenman

As Ryan said, robots.txt file is very useful when you wanna block (disallow) some pages. Indeed, if you don't want that spider crawls your page you must use robots.txt (noindex tags will let bot crawls, but not index, your page). I have got a small website but i dropped robots.txt in my folder. Maybe write just Allow: / could be useless, but you can say: "I respect protocols"

RyanKent

A good source to learn about the robots.txt file is here: http://www.robotstxt.org/

The robots.txt file is completely optional. I don't use the file at all on small sites.

The file offers a means to block crawlers which choose to honor the file's instructions from crawling all or part of a site. It also provides the location of a sitemap.

To that end, sitemaps are completely unnecessary for SEO assuming your site has proper navigation. Even if you choose to use a sitemap, you can offer the location via WMT rather then the robots.txt file.

With respect to blocking areas of your site, the primary use would be for CMS, forums, ecommerce or other sites where the software was limited and does not allow the site owner to use noindex on all pages.

As a rule, robots.txt should simply never be used except as a means of last resort. In my experience the file is overused by site owners and SEOs. One exception where I use a robots.txt is during a site's development when I do not wish the site to be crawled at all.

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Robots.txt file

Browse Questions

Explore more categories

Related Questions

Robots.txt Question for E-Commerce Sites

Using a dash or underscores in file names.

"translation" of code in htaccess file

I have more pages in my site map being blocked by the robot file than I have being allowed to be crawled. Is Google going to hate me for this?

Using Robots Meta Tag on Review Form Pages

How do i block an entire category/directory with robots.txt?

New CMS system - 100,000 old urls - use robots.txt to block?

The SEOmoz crawler is being blocked by robots.txt need help