Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Your browser does not seem to support JavaScript. As a result, your viewing experience will be diminished, and you have been placed in read-only mode.

Please download a browser that supports JavaScript, or enable it if it's disabled (i.e. NoScript).

Disallow my store in robots.txt?

Intermediate & Advanced SEO

626

Locked

raywhite last edited by
Should I disallow my store directory in robots.txt?

Here is the URL: https://www.stdtime.com/store/

Here are my reasons for suggesting this:
1. SEOMOZ finds crawl "errors" in there that I don't care about
2. I don't think I care if the search engines index those pages
3. I only have one product, and it is not an impulse buy
4. My product has a 60 day sales cycle, so price is less important than features
1 Reply Last reply
Reply Quote 0
AlanMosley last edited by

You seem to have evaluated things well, i will give you one reason why yopu should not, links pointing to the non indexed pages will be pouring their link juice away. , you are better off using a no-index,follow meta tag, at least then the link juice will flow back out of the links when followed.

Robots text is a nasty tool to use, you need a more surgical approch
1 Reply Last reply
Reply Quote 0

Browse Questions

View

From

Sorted by

With category

Explore more categories

Related Questions

What happens to crawled URLs subsequently blocked by robots.txt?

We have a very large store with 278,146 individual product pages. Since these are all various sizes and packaging quantities of less than 200 product categories my feeling is that Google would be better off making sure our category pages are indexed. I would like to block all product pages via robots.txt until we are sure all category pages are indexed, then unblock them. Our product pages rarely change, no ratings or product reviews so there is little reason for a search engine to revisit a product page. The sales team is afraid blocking a previously indexed product page will result in in it being removed from the Google index and would prefer to submit the categories by hand, 10 per day via requested crawling. Which is the better practice?
Intermediate & Advanced SEO | | AspenFasteners

1
Discontinued products on ecommerce store

Hi, I have a high number of very-low/zero traffic and zero backlinked product pages that have been discontinued (and wont come back). For these pages we automatically remove them from our website indexes and also removed internal links and then essentially kept the product pages and their urls intact but just added a note saying "no longer available, how about these..." with alternate similar product options. This seems to be the general consensus online for discontinued product pages that have little value. The questions is do I either 404 or noindex these now discontinued pages? What are the pros or cons? Thanks
Intermediate & Advanced SEO | | coma99

0
Use Canonical or Robots.txt for Map View URL without Backlink Potential

I have a Page X with lots of unique content. This page has a "Map view" option, which displays some of the info from Page X, but a lot is ommitted. Questions: Should I add canonical even though Map View URL does not display a lot of info from Page X or adding to robots.txt or noindex, follow? I don't see any back links coming to Map View URL Should Map View page have unique H1, title tag, meta des?
Intermediate & Advanced SEO | | khi5

0
The "webmaster" disallowed all ROBOTS to fight spam! Help!!

One of the companies I do work for has a magento site. I am simply the SEO guy and they work the website through some developers who hold access to their systems VERY tightly. Using Google Webmaster Tools I saw that the robots.txt file was blocking ALL robots. I immediately e-mailed out and received a long reply about foreign robots and scrappers slowing down the website. They told me I would have to provide a list of only the good robots to allow in robots.txt. Please correct me if I'm wrong.. but isn't Robots.txt optional?? Won't a bad scrapper or bot still bog down the site? Shouldn't that be handled in httaccess or something different? I'm not new to SEO but I'm sure some of you who have been around longer have run into something like this and could provide some suggestions or resources I could use to plead my case! If I'm wrong.. please help me understand how we can meet both needs of allowing bots to visit the site but prevent the 'bad' ones. Their claim is the site is bombarded by tons and tons of bots that have slowed down performance. Thanks in advance for your help!
Intermediate & Advanced SEO | | JoshuaLindley

0
Using Meta Header vs Robots.txt

Hey Mozzers, I am working on a site that has search-friendly parameters for their faceted navigation, however this makes it difficult to identify the parameters in a robots.txt file. I know that using the robots.txt file is highly recommended and powerful, but I am not sure how to do this when facets are using common words such as sizes. For example, a filtered url may look like www.website.com/category/brand/small.html Brand and size are both facets. Brand is a great filter, and size is very relevant for shoppers, but many products include "small" in the url, so it is tough to isolate that filter in the robots.txt. (I hope that makes sense). I am able to identify problematic pages and edit the Meta Head so I can add on any page that is causing these duplicate issues. My question is, is this a good idea? I want bots to crawl the facets, but indexing all of the facets causes duplicate issues. Thoughts?
Intermediate & Advanced SEO | | evan89

0
Can't find X-Robots tag!

Hi all. I've been checking out http://www.unthankbooks.com/ as it seems to have some indexing problems. I ran a server header check, and got a 200 response. However, it also shows the following: X-Robots-Tag:
noindex, nofollow It's not in the page HTML though. Could it be being picked up from somewhere else?
Intermediate & Advanced SEO | | Blink-SEO

0
Robots.txt 404 problem

I've just set up a wordpress site with a hosting company who only allow you to install your wordpress site in http://www.myurl.com/folder as opposed to the root folder. I now have the problem that the robots.txt file only works in http://www.myurl./com/folder/robots.txt Of course google is looking for it at http://www.myurl.com/robots.txt and returning a 404 error. How can I get around this? Is there a way to tell google in webmaster tools to use a different path to locate it? I'm stumped?
Intermediate & Advanced SEO | | SamCUK

0
Removing Duplicate Content Issues in an Ecommerce Store

Hi All OK i have an ecommerce store and there is a load of duplicate content which is pretty much the norm with ecommerce store setups e.g. this is my problem http://www.mystoreexample.com/product1.html
http://www.mystoreexample.com/brandname/product1.html
http://www.mystoreexample.com/appliancetype/product1.html
http://www.mystoreexample.com/brandname/appliancetype/product1.html
http://www.mystoreexample.com/appliancetype/brandname/product1.html so all the above lead to the same product
I also want to keep the breadcrumb path to the product Here's my plan Add a canonical URL to the product page
e.g. http://www.mystoreexample.com/product1.html
This way i have a short product URL Noindex all duplicate pages but do follow the internal links so the pages are spidered What are the other options available and recommended? Does that make sense?
Is this what most people are doing to remove duplicate content pages? thanks 🙂
Intermediate & Advanced SEO | | ChriSEOcouk

0