Staging site - Treated as duplicate?
-
Last week (exactly 8 days ago to be precise) my developer created a staging/test site to test some new features. The staging site duplicated the entire existing site on the same server.
To explain this better -My site address is - www.mysite.com
The path of the new staging site was www.mysite/staging
I realized this only today and have immediately restricted robot text and put a no index no follow on the entire duplicate server folder but I am sure that Google would have indexed the duplicate content by now?
So far I do not see any significant drop in traffic but should I be worried? and what if anything can I do at this stage?
-
Yes, it would show up in your analytics as an active user but the fact that the query returns no results means it's not been indexed. All good.
Peter
-
Hey Peter,
The Analytics code could have helped to get the site indexed. Or even a G +1/Facebook Like/share/Stumble/etc button clicked by error.
@Rajat
Doing the search Peter suggested should return any indexed page.
-
Got it. No, no results show up but interestingly when I go to www.mysite.com/staging, it does show up as 1active user on analytic report, which is what got me worried and made me realize of this problem.
-
Hi Rajat,
No what I mean is put the following query into the search box
site:<yourdomainname>/<yourstagingfolder></yourstagingfolder></yourdomainname>
where yourdomainname is your domain name (e.g. mysite.com) and yourstagingfolder is your staging folder (e.g. staging), so ike this:
site:mysite.com/staging
Peter
-
Thanks Pete. When I search for mysite.com/staging on google, I only see mysite.com as first result...and nothing at all on staging. Is that what you mean I should check?
-
Hi Rajat
The analytics code may have given some signals to Google of pages to index but to test it the staging server's pages are in Google use site:mysite.com/staging (NB. no spaces between site and the domain name).
Peter
-
Thanks Federico. That's re-assuring. Also, a related point, since the whole site was duplicated, so was the Google analytic code.
Does that have any impact?
Also, is there a way to check if the test server was in fact indexed or not?
-
Hi Rajat
I agree with Federico. Also, if there was no active link on mysite.com to mysite.com/staging then it's unlikely Google would have found it unless the staging site had been submitted to Google via a sitemap for indexing. You should be fine.
Peter
-
You have done the necessary steps (disallowing in robots plus setting a noindex tag). There's shouldn't be anything to worry about. If you want to be entirely sure, you can add some HTTP authentication to the folder so only those knowing the credentials can access (you could find that some robots may not follow the disallow flag or noindex tag).
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Rel canonical on other page instead of duplicate page. How Google responds?
Hi all, We have 3 pages for same topics. We decided to use rel canonical and remove old pages from search to avoid duplicate content. Out of these 3 pages....1 and 2 type of pages have more similar content where 3 type don't have. Generally we must use rel canonical between 1 and 2. But I am wondering what happens if I canonical between 1 and 3 while 2 has more similar content? Will Google respects it or penalise as we left the most similar page and used other page for canonical. Thanks
Algorithm Updates | | vtmoz0 -
Will Google penalize 2 sites for targeting "like" keyword phrases?
I own (2) different websites, one an HTML site that has been live for 20 years and a ecommerce site that has been live for 7 years. We sell custom printed (branded) tents for use at trade shows and other indoor and outdoor events. While our ecomm site targets "trade show" tents our HTML site targets "event" tents. I believe that the keyword phrases are dissimilar enough that targeting "trade show tents" on one site and "event tents" on the other should not cause Google to penalize one or the other or both sites for having similar content. The content is different on both sites. I'm wondering if anyone has experience with, or opinions on, my thoughts... either way. Thanks,
Algorithm Updates | | terry_tradeshowstuff
Terry Hepola0 -
Ecommerce sites 30% drop in organic since the spring
I help manage SEO for a number of large retail websites and we've seen a significant drop in organic traffic (upwards of 30%) since around May 2015. It's likely we were hit my Google's Phantom Quality update, but I don't understand why it had such a big impact. Can anyone explain that Google update in more depth and advise on steps to take to recover from it? Thank you.
Algorithm Updates | | JimLynch0 -
Your search - site:domain.com - did not match any documents.
I've recently started work on a new clients website and done some preliminary work with on-page optimisation, and there is still plenty of work to be done and issues to resolve. They are ranking ok on Bing, but they are not getting any ranking on Google at all (except paid) - I tried the site:domain.com search and comes up with no results... so this confirms that something is going on with the google search rank! Can anyone shed light on what can cause this or why this would happen? My next step is to look at their webmaster tools (haven't had access yet), but if anyone has any tips to resolve this or where to look, that would be great! Thanks!
Algorithm Updates | | ElevateCreativeAU0 -
Should I use canonical tags on my site?
I'm trying to keep this a generic example, so apologies if this is too vague. On my main website, we've always had a duplicate content issue. The main focus of our site is breaking down to specific, brick and mortar locations. We have to duplicate the description of product/service for every geographic location (this is a legal requirement). So for example, you might have the parent "product/service" page targeting the term, and then 100's of sub pages with "product/service San Francisco", "product/service Austin", etc. These pages have identical content except for the geographic location is dynamically swapped out. There is also additional useful content like google map of area, local resources, etc. As I said this was always seen as an SEO issue, specifically you could see in the way that googlebot would crawl pages and how pagerank flowed through the site that having 100's of pages with identical copy and just swapping out the geographic location wasn't seen as good content, however we still always received traffic and conversions for the long tail geographic terms so we left it. Las year, with Panda, we noticed a drop in traffic and thought it was due to this duplicate issue so I added canonical tags to all our geographic specific product/service pages that pointed back to the parent page, that seemed to be received well by google and traffic was back to normal in short order. However, recently what I notice a LOT in our SERP pages is if I type in a geographic specific term, i.e. "product/service san francisco", our deep page with the canonical tag is what google is ranking. Google inserts its own title tag on the SERP page and leaves the description blank as it doesn't index the page due to the canonical tag on the page. Essentially what I think it is rewarding is the site architecture which organizes the content to the specific geo in the URL: site.com/service/location/san-francisco. Other than that there is no reason for it to rank that page. Sorry if this is lengthy, thanks for reading all of that! Essentially my question is, should I keep the canonical tags on the site or take them off since Google insists on ranking the page? If I am ranking already then the potential upside to doing that is ranking higher (we're usually in the 3-6 spot on the result page) and also higher CTR because we can get a description back on our resulting page. The counter argument is I'm already ranking so leave it and focus on other things. Appreciate your thoughts on this!
Algorithm Updates | | edu-SEO0 -
Is there a utility that can tell me what keywords my site already ranks high for?
Ok... so I'm looking for a way to understand what my site already ranks high for.. I don't necessarily want to have to manually type in keywords. The purpose of this exercise is to demonstrate to a client what keywords they're already ranking high for. Is there an easy way / tool to go about doing this? Thanks in advance, Gene
Algorithm Updates | | BGroup0 -
Large site with faceted navigation using rel=canonical, but Google still has issues
First off, I just wanted to mention I did post this on one other forum so I hope that is not completely against the rules here or anything. Just trying to get an idea from some of the pros at both sources. Hope this is received well. Now for the question..... "Googlebot found an extremely high number of URLs on your site:" Gotta love these messages in GWT. Anyway, I wanted to get some other opinions here so if anyone has experienced something similar or has any recommendations I would love to hear them. First off, the site is very large and utilizes faceted navigation to help visitors sift through results. I have implemented rel=canonical for many months now to have each page url that is created based on the faceted nav filters, push back to the main category page. However, I still get these damn messages from Google every month or so saying that they found too many pages on the site. My main concern obviously is wasting crawler time on all these pages that I am trying to do what they ask in these instances and tell them to ignore and find the content on page x. So at this point I am thinking about possibly using robots.txt file to handle these, but wanted to see what others around here thought before I dive into this arduous task. Plus I am a little ticked off that Google is not following a standard they helped bring to the table. Thanks for those who take the time to respond in advance.
Algorithm Updates | | PeteGregory0 -
Risks associated with having multiple similar ecom sites together under the same analytics account?
Any downsides to having multiple (similar) eCommerce sites linked to the same Google Analytics account? Traffic splitting or other penalties? I've heard a range of answers from "Yes, traffic was split between my two first-page ranked sites, it was awful" to "no, Google couldn't care less/ they'd be able to tell if your sites were related outside of having them in the same account anyways" Any info would be much apprecaited 🙂 Thanks!
Algorithm Updates | | apo11o1770