How can I make it so that robots.txt is not ignored due to a URL re-direct?

rodelmo4

Recently a site moved from blog.site.com to site.com/blog with an instruction like this one:

/etc/httpd/conf.d/site_com.conf:94: ProxyPass /blog http://blog.site.com
/etc/httpd/conf.d/site_com.conf:95: ProxyPassReverse /blog http://blog.site.com

It's a Wordpress.org blog that was set as a subdomain, and now is being redirected to look like a directory. That said, the robots.txt file seems to be ignored by Google bot. There is a Disallow: /tag/ on that file to avoid "duplicate content" on the site. I have tried this before with other Wordpress subdomains and works like a charm, except for this time, in which the blog is rendered as a subdirectory. Any ideas why? Thanks!

rodelmo4

Hi there,

No, haven't tried it yet, but we'll give it a shot. Thanks!

JordanLowry

Have you thought about adding rel canonicals by chance? Also, how do you know the robots.txt is being ignored are the page showing up in search results? If so maybe the syntax is incorrect in your robots.txt file. Check out robotstxt.org

Gaston Riera

Hi Rocio,

Have you tried YOAST SEO plugin? It has an option to ad to the tags.
That's the easiest way I'd go for.

Best Luck.
GR.

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

How can I make it so that robots.txt is not ignored due to a URL re-direct?

Browse Questions

Explore more categories

Related Questions

Robots.txt Disallow: / in Search Console

Robots.txt vs. meta noindex, follow

Meta-robots Nofollow

301 permanent re-direct

301 Re-direct as referral traffic

How can I best find out which URLs from large sitemaps aren't indexed?

Can you 404 any forms of URL?

Url re-write / minimal subfolders