Robots.txt file

seoug_2005

Does it serve any purpose if we omit robots.txt file ? I wonder if spider has to read all the pages, why do we insert robots.txt file ?

Greenman

As Ryan said, robots.txt file is very useful when you wanna block (disallow) some pages. Indeed, if you don't want that spider crawls your page you must use robots.txt (noindex tags will let bot crawls, but not index, your page). I have got a small website but i dropped robots.txt in my folder. Maybe write just Allow: / could be useless, but you can say: "I respect protocols"

RyanKent

A good source to learn about the robots.txt file is here: http://www.robotstxt.org/

The robots.txt file is completely optional. I don't use the file at all on small sites.

The file offers a means to block crawlers which choose to honor the file's instructions from crawling all or part of a site. It also provides the location of a sitemap.

To that end, sitemaps are completely unnecessary for SEO assuming your site has proper navigation. Even if you choose to use a sitemap, you can offer the location via WMT rather then the robots.txt file.

With respect to blocking areas of your site, the primary use would be for CMS, forums, ecommerce or other sites where the software was limited and does not allow the site owner to use noindex on all pages.

As a rule, robots.txt should simply never be used except as a means of last resort. In my experience the file is overused by site owners and SEOs. One exception where I use a robots.txt is during a site's development when I do not wish the site to be crawled at all.

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Robots.txt file

Browse Questions

Explore more categories

Related Questions

Help recover lost traffic (70%) from robots.txt error.

Are there detrimental effects of having multiple robot tags

Robot.txt file issue on wordpress site.

Description tag not showing in the SERPs because page is blocked by Robots, but the page isn't blocked. Any help?

How do i block an entire category/directory with robots.txt?

Best practice for Meta-Robots tag in categories and author pages?

Using meta robots 'noindex'

In my report of my website it was indicated that I had 19 links/locations blocked by meta-robots. What does this mean and how do I fix it. My website is a Wordpress website.