/$1 URL Showing Up

HDI

Whenever I crawl my site with any kind of bot or a sitemap generator over my site. it comes up with /$1 version of my URLs. For example:

It gives me hdiconference.com & hdiconference.com/$1 and hdiconference.com/purchases & hdiconference.com/purchases/$1

Then I get warnings saying that it's duplicate content. Here's the problem: I can't find these /$1 URLs anywhere. Even when I type them in, I get a 404 error. I don't know what they are, where they came from, and I can't find them when I scour my code.

So, I'm trying to figure out where the crawlers are picking this up. Where are these things? If sitemap generators and other site crawlers are seeing them, I have to assume that Googlebot is seeing them as well.

Any help? My developers are at a loss as well.

HDI

Perfect. Thanks for the help, guys!

STPseo

If you can't find them, you could put a disallow in your robots.txt files to keep them from being crawled.

Daylan

I had a similar issue and found it was due to (in the case of a MozPro crawl at least) the bot crawling a JS command in the head. One of the commands included an anchor tag that was being read as a link rather than in context of the java script command. Check your JS files/scripts. It might be in there somewhere.

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

/$1 URL Showing Up

Browse Questions

Explore more categories

Related Questions

What do you do with product pages that are no longer used ? Delete/redirect to category/404 etc

Home Page Being Indexed / Referral URLs /

Tools/Software that can crawl all image URLs in a site

Google not indexing /showing my site in search results...

Disallow: /search/ in robots but soft 404s are still showing in GWT and Google search?

What are the SEO implications of URLs that use a # in them?

Getting multiple errors for domain.com/xxxx/xxxx/feed/feed/feed/feed...

Duplicate Homepage: www.mysite.com/ and www.mysite.com/default.aspx