Can anyone explain why and how these odd URLs could be working?
-
In our GWT and Google Analytics traffic reports, I often see some very oddly formed URLs. Here's an example
http://www.ccisolutions.com/storefront/www.ccisolutions.com
and here's another
http://www.ccisolutions.com/StoreFront/category//www.ccisolutions.com/StoreFront/CEW.catWhat strikes me about this particular URL is two things:
- It renders this page http://www.ccisolutions.com/StoreFront/category/on-disc-printing, but not with that URL, the URL stays http://www.ccisolutions.com/StoreFront/category//www.ccisolutions.com/StoreFront/CEW.cat
- When I break this URL into pieces
http://www.ccisolutions.com/StoreFront/category/CEW.cat
and www.ccisolutions.com/StoreFront/CEW.cat,
both redirect to: http://www.ccisolutions.com/StoreFront/category/on-disc-printingThis makes me wonder, is there something (a rule?) in the
backend (maybe the .htaccess file?)that was set up that sayshttp://www.ccisolutions.com/StoreFront/category/CEW.cat
= www.ccisolutions.com/StoreFront/CEW.cat
(or maybe vice versa?), and as a result an odd URL for the page is being
written automatically?This scenario worked on every category page I checked. All had the same results. For example, I tried:
http://www.ccisolutions.com/StoreFront/category//www.ccisolutions.com/StoreFront/AAA.cat
and it rendered the Live Sound category page, but without redirecting to the
user friendly URL. This URL stayed unchanged in the address barWhen I broke it into pieces, like
http://www.ccisolutions.com/StoreFront/category/AAA.cat
and www.ccisolutions.com/StoreFront/AAA.cat, both of these redirected to http://www.ccisolutions.com/StoreFront/category/sound-video-lighting-equipment-expertsHave any of you ever encountered a problem like this? Any sugeestions as to what might be causing it and how to remedy the problem? It is definitely causing us a duplicate content headache. Thanks!
Dana
-
Thanks George! Fantastic detail and I think between your suggestions and Ben's too we are going to get further to solving this than we've ever gotten before. Perhaps we'll even solve this. That would be so great. As I mentioned, the company identified this problem 4 years before they hired me, and it's never been solved. I feel like part of why I am there as there SEO strategist is to pound away at these problems until they're fixed.
Thanks so much to you both. I can't wait to go in on Monday morning and use these suggestions to solve a five year old problem! Awesome.
I'll let you know what happens. If we fix it, I owe you and Ben dinner! (at the very least)
-
Thanks Ben. No apology necessary, it's all good. Your suggestion in combination with George's could lead us to an answer. This is definitely going to get us closer to finding the problem than we've ever gotten before. The company has been aware of this problem for almost 5 years but hasn't ever identified how to fix it. I've only been there a year now and I'm on the warpath to fix these technical issues. There are so many of them causing duplicate content problems that any SEO I do is undermined by problems like these.
I really really appreciate your reply and suggestions!
Dana
-
I'm not sure what CMS you are using, but I've seen this before with Joomla when setting the SEO Settings in the Global Configuration section of the Administration panel. Specifically, when working with the Apache mod_rewrite setting; which is related to the .htaccess question you had.
There are a number of things wrong with the way some CMS's have set up their redirects and how they present content. You may end up playing with each combination to fix your issue (depending on how you want to fix it).
If I were looking into this, I would do the following:
- I would determine if I was using Joomla. If so, check your configuration.php file and see if you have your domain name provided in the property for "live_site". If you do, try changing this from 'www.ccisolutions.com' (or whatever is there) to the empty string '' (aka just two single quotes).
- If you are not using Joomla, see if there is a configuration file for the CMS you are using and look for something similar to the above.
- If there is not a configuration setting that is providing for this "duplication" of domain name, look at the .htaccess file itself to see why it redirects when you break the URL up, but not when it has a second domain string in the URL (e.g. the second "www.ccisolutions.com").
- Then look at the code for the CMS and see how it interprets your URLs. To me it looks like you are using some sort of MVC framework which is taking each piece of the URL and translating it into variables to determine what content to show (REST-like). When it is parsing the URL, it seems to be looking for the end of the domain name and then taking anything off the end to translate into content.
However you figure out the issue, I suggest looking at how your CMS is actually producing the canonical tag. Right now this URL (http://www.ccisolutions.com/StoreFront/category/www.ccisolutions.com/StoreFront/CEW.cat) is using the following canonical:
rel="canonical" href="on-disc-printing"/>
I don't think that is what you are looking for in your canonical tags.
I hope this helps and answers your questions.
-
Hi Dana,
I wrote the following after assuming , for no reason at all, that you didn't know much about SEO. However having looked at your profile I realized that I was wrong and that my tone is probably a little patronizing. That being said it's 1am over here and I really don't want to rewrite it so please accept my apologies.
If I had to guess (and it is a guess as I'm not technical) I would say it was some badly formed links.
You know how some of your error pages have an Origin parameter (like this one) that say where the page was generated? Well these URLs follow the same format as the error pages that you're finding. It looks like rather than using an absolute link (like http://www.ccisolutions.com/page) the onclick action is actually generating a relative link (so just /page).
When you use a relative link your site adds the partial URL (/page) onto the end of your domain to give you a full URL (http://www.ccisolutions.com + /page = http://www.ccisolutions.com/page). It looks like you're using relative links as if they were static ones. Which is why you have "www.ccisolutions" in each URL twice.
If I had to blame anything it would be whatever is powering your IAFDispatcher however as I haven't been able to replicate your problem I couldn't be certain. If you can track how these URLs were generated by looking at the preceding pages that are sending traffic/bots to them then you should be able to narrow it down to which links are broken.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Broken URL Links
Hi everyone, I have a question regarding broken URL links on my website. Late last year I move my site from an old platform to Shopify, and now have broken URL links giving out 4xx errors. When I look at Moz Pro>Campaigns>Insights>links, I can see the top broken URL links, however there is a difference if copy & paste URL directly from Moz Pro and by Export CSV file. For example below, If I copy and paste links direct from Moz Pro, it has the “http://” in front as below: http://www.thehairhub.com.au/WebRoot/ecshared01/Shops/thehairhub/57F3/1D8F/D244/C675/E27D/AC10/003F/35AD/manic-panic-colours.jpg But when I export the list of links as an CSV file, the http:// is removed. www.thehairhub.com.au/WebRoot/ecshared01/Shops/thehairhub/57F3/1D8F/D244/C675/E27D/AC10/003F/35AD/manic-panic-colours.jpg Another Example below: By copy & paste URL direct from Moz Pro
Technical SEO | | johnwall
http://thehairhub.com.au/Shop-Brands/Vitafive-CPR/CPR-Rescue By export CSV file.
thehairhub.com.au/Shop-Brands/Vitafive-CPR/CPR-Rescue Which one do I use to enter into the “Redirect From” field in Shopify URL Redirects? Do I need to have the http:// in front of the URL? Or is it not required for redirects to work? Kind Regards, John Wall
The Hair Hub0 -
After you remove a 301 redirect that Google has processed, will the new URL retain any of the link equity from the old URL?
Lets say you 301 redirect URL A to URL B, and URL A has some backlinks from other sites. Say you left the 301 redirect in place for a year, and Google had already replaced the old URL with the new URL in the SERPs, would the new URL (B) retain some of the link equity from URL A after the 301 redirect was removed, or does the redirect have to remain in place forever?
Technical SEO | | johnwalkersmith0 -
Dynamic vs. static URLs
Hello Everyone, I'm new here on MOZ and just getting back into SEO (a little bit) after not doing anything 'myself' for a couple of years. Currently my individual URLs show as: https://www.example.com/index.php?l=product_detail&p=107 (dynamic responsive site). I can switch it to a static site, so the individual product pages read as:https://www.example.com/catalog/category name/product name-107.html It's still a long URL, but it would be keyword rich. Some of my current dynamic pages are indexed,and due to an upgrade I had to do several months back, I already have some redirects (301) from my php extensions to the one listed above. This is my long explanation to my following questions: Does having a dynamic or static site matter when ranking in search engines I already have some redirects coming my older site to this dynamic site, so I would have to make more directs from the dynamic site to my static site - is this okay to do? I'm really at a loss, a couple of years ago, I ranked 1-3 (on Page 1) on Google for all my keywords, (all White Hat work), and now I'm into great abyss of no mans land of the internet (ranked on Page 3+) Thank you for any and all help from everyone! ~Sandra
Technical SEO | | rankmenow0 -
301 redirect not working
Hi there! I have recently moved a domain that has been indexed by google and setup redirects so that it forwards to the new domain. It seems like the only redirect that actually is working is the canonical and main domain but every other page and or page nested within a folder are not working. Here is an example of some of the redirects. Am I doing this wrong? It seems to be going to the new domain but can't find the actual pages.... RewriteEngine On
Technical SEO | | twotd
RewriteBase /
RewriteCond %{HTTP_HOST} !agoodsweep.com$ [NC]
RewriteRule ^(.*)$ http://agoodsweep.com/$1 [L,R=301]
redirect 301 woodstoveservicerepair.html http://agoodsweep.com/woodstoveservicerepair/
redirect 301 /westchesterchimney.html http://agoodsweep.com/west-chester-chimney/ Thanks in advance for any help!!0 -
Landing Page URL Structure
We are finally setting up landing pages to support our PPC campaigns. There has been some debate internally about the URL structure. Originally we were planning on URL's like: domain.com /california /florida /ny I would prefer to have the URL's for each state inside a "state" folder like: domain.com /state /california /florida /ny I like having the folders and pages for each state under a parent folder to keep the root folder as clean as possible. Having a folder or file for each state in the root will be very messy. Before you scream URL rewriting :-). Our current site is still running under Classic ASP which doesn't support URL rewriting. We have tried to use HeliconTech's ISAPI rewrite module for IIS but had to remove it because of too many configuration issues. Next year when our coding to MVC is complete we will use URL rewriting. So the question for now: Is there any advantage or disadvantage to one URL structure over the other?
Technical SEO | | briankb0 -
Why Do Transparent Networks Still Work
Hi Mozzers, My client has a major competitor that dominates several industry head terms. A check of their link profile reveals that they have 50 low DA domains that are identical to the main site, the only difference being that they all link to the main domain for these terms. They're not even attempting to disguise the network but it works. Can anyone tell me why? See: www.omega.com/vhpc/
Technical SEO | | waynekolenchuk0 -
Can anyone tell me where my site can be improved
My website is www.theradiatorgallery.com I have been trying to get on 1st page for designer radiators. I am currently ranked 21st just dropped to the 3rd page within the past hour, i was 20th before this. There are some sites that i would think i would rank better than due do i have better on page optimization, higher PA, DA and PR based on seo moz reports. Is there something wrong with my site that i am missing? the sites i feel i should rank better than are; radsnrails.co.uk, warmrooms.co.uk and a couple others Any help would be great! Thanks Guys
Technical SEO | | myloseo0 -
Why google index my IP URL
hi guys, a question please. if site:112.65.247.14 , you can see google index our website IP address, this could duplicate with our darwinmarketing.com content pages. i am not quite sure why google index my IP pages while index domain pages, i understand this could because of backlink, internal link and etc, but i don't see obvious issues there, also i have submit request to google team to remove ip address index, but seems no luck. Please do you have any other suggestion on this? i was trying to do change of address setting in Google Webmaster Tools, but didn't allow as it said "Restricted to root level domains only", any ideas? Thank you! boson
Technical SEO | | DarwinChinaSEO0