VisitSweden indexing error
-
Hi all
Just got a new site up about weekend travel for VisitSweden, the official tourism office of Sweden. Everything went just fine except som issues with indexing.
The site can be found here at weekend.visitsweden.com/no/
For some weird reason the "frontpage" of the site does not get indexed. What I have done myself to find the issue:
- Added sitemaps.xml
- Configured and added site to webmaster tools
- Checked 301s so they are not faulty
By doing a simple site:weekend.visitsweden.com/no/ you can see that the frontpage is simple not in the index. Also by doing a cache:weekend.visitsweden.com/no/ I see that Google tries to index the page without the trailing /no/ for some reason.
http://webcache.googleusercontent.com/search?q=cache:http://weekend.visitsweden.com/no/
Any smart ideas to get this fixed or where to start looking?
All help greatly appreciated
Kind regards
Fredrik
-
Oh my God Fred!! the weekend sub-domain has been completely blocked from being crawled using a robots.txt file sitting in the root of the sub-domain.
http://weekend.visitsweden.com/robots.txt
User-agent: * Disallow: / Please remove the '/' from there **immediately**
-
Hi Fred,
I just copied my first response:
Here is your redirection setup:
http://weekend.visitsweden.com/ being redirected via 301 to
http://weekend.visitsweden.com/no being redirected via 301 to
http://weekend.visitsweden.com/no/
So, I would suggest you to remove the interim URL without the trailing slash after 'no'. Let the original homepage, http://weekend.visitsweden.com/ also be redirected to http://weekend.visitsweden.com/no/ (the one with trailing slash) via 301.
So your redirection setup should be as follows:
http://weekend.visitsweden.com/ - via 301 to - http://weekend.visitsweden.com/no/
Essentially, we are eliminating the redirection loop here. Please try this and post back.
Best regards,
Devanur Rafi
-
Hi
Again thanks for your quick response. Unfortunately we still have the same issue even though we have performed many checks and tests. Any more smart ideas on how this can be corrected?
Regards
Fredrik
-
Hi Fred,
Please wait for at least 2 weeks for the change to reflect in Google. This happens and depends on how popular your site is in terms of link profile, DA, PA etc..I still see "http://weekend.visitsweden.com/no" (without trailing slash) in Google's index. Let us wait for sometime. Nothing to worry about it.
-
Hi again
The weirdest this is that it does not seem to update. When I do a site:weekend.visitsweden.com/no/ the page is still nowhere to be found.
https://www.google.no/?gws_rd=ssl#q=site:weekend.visitsweden.com%2Fno%2F
Any ideas?
Again thanks
Fredrik
-
Hi Fred, now its perfect. It should soon reflect in Google and you will be able to see it in site: search. Good Luck my friend.
Best regards,
Devanur Rafi
-
Thanks for the great input! Have now tried to do the changes as per your suggestion.
Does it look better now?
Again thanks
Fredrik
-
Dear Fred,
Here is your redirection setup:
http://weekend.visitsweden.com/ being redirected via 301 to
http://weekend.visitsweden.com/no being redirected via 301 to
http://weekend.visitsweden.com/no/
So, I would suggest you to remove the interim URL without the trailing slash after 'no'. Let the original homepage, http://weekend.visitsweden.com/ also be redirected to http://weekend.visitsweden.com/no/ (the one with trailing slash) via 301.
So your redirection setup should be as follows:
http://weekend.visitsweden.com/ - via 301 to - http://weekend.visitsweden.com/no/
This should fix the issue. Essentially, we are eliminating the redirection loop here.
By the way, both the URLs, with and without trailing slash appear in Google when searched with the following queries:
Best regards,
Devanur Rafi
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Fetch as Google -- Does not result in pages getting indexed
I run a exotic pet website which currently has several types of species of reptiles. It has done well in SERP for the first couple of types of reptiles, but I am continuing to add new species and for each of these comes the task of getting ranked and I need to figure out the best process. We just released our 4th species, "reticulated pythons", about 2 weeks ago, and I made these pages public and in Webmaster tools did a "Fetch as Google" and index page and child pages for this page: http://www.morphmarket.com/c/reptiles/pythons/reticulated-pythons/index While Google immediately indexed the index page, it did not really index the couple of dozen pages linked from this page despite me checking the option to crawl child pages. I know this by two ways: first, in Google Webmaster Tools, if I look at Search Analytics and Pages filtered by "retic", there are only 2 listed. This at least tells me it's not showing these pages to users. More directly though, if I look at Google search for "site:morphmarket.com/c/reptiles/pythons/reticulated-pythons" there are only 7 pages indexed. More details -- I've tested at least one of these URLs with the robot checker and they are not blocked. The canonical values look right. I have not monkeyed really with Crawl URL Parameters. I do NOT have these pages listed in my sitemap, but in my experience Google didn't care a lot about that -- I previously had about 100 pages there and google didn't index some of them for more than 1 year. Google has indexed "105k" pages from my site so it is very happy to do so, apparently just not the ones I want (this large value is due to permutations of search parameters, something I think I've since improved with canonical, robots, etc). I may have some nofollow links to the same URLs but NOT on this page, so assuming nofollow has only local effects, this shouldn't matter. Any advice on what could be going wrong here. I really want Google to index the top couple of links on this page (home, index, stores, calculator) as well as the couple dozen gene/tag links below.
Intermediate & Advanced SEO | | jplehmann0 -
How long should it take for indexed pages to update
Google has crawled and indexed my new site, but my old URLS appear in the search results. Is there a typical amount of time that it takes for Google to update the URL's displayed in search results?
Intermediate & Advanced SEO | | brianvest0 -
Images Sitemap GWT - not indexed?
So we went ahead and created an image sitemap of 2387 images, one for each product - I was hoping it would give us better exposure in image results. No joy, over 7 days and they only showing as "sent" but not "indexed". Any ideas?
Intermediate & Advanced SEO | | bjs20100 -
Town and County pages taking months to index.
Hi, At http://www.general-hypnotherapy-register.com/regional-hypnotherapy-directory/ we have a load of town and county pages for all of the hypnotherapists on the site a) I have checked all of these links and they are spiderable. b) About a month back I noticed after the site changes, not entirely sure why, but the site was generating rogue pages, eg http://www.general-hypnotherapy-register.com/hypnotherapists/page/5/?town=barnsley instead of http://www.general-hypnotherapy-register.com/hypnotherapists/?town=barnsley We have added meta no index, no follow to these rogue pages around 4 weeks ago..however these pages still have a google cache date of Oct 4th predating these meta changes c) There are examples of the pages we do want, indexed, and ranking too on page 1, site:www.general-hypnotherapy-register.com/hypnotherapists eg http://www.general-hypnotherapy-register.com/hypnotherapists/?town=ockham however these pages are few and far between, these have a recent google cache date of Nov 1 **d) **The xml sitemap has all of the correct URLS, but in webmaster tools, the amount of pages indexed has been stubbornly flat at 2800 out of 4400 for 4 weeks now e) Query Paramaters: for ?town and ?county in webmaster tools, are set to Yes/Specifies Would love any suggestions, Thanks. Mark.
Intermediate & Advanced SEO | | Advantec0 -
Is this link being indexed?
link text Deadline: Monday, Sep 30, 2013 link text I appreciate the help guys!
Intermediate & Advanced SEO | | jameswalkerson0 -
Wordpress error
On our Google Webmaster Tools I'm getting a Severe Health Warning regarding our Robot.txt file reading: User-agent: *
Intermediate & Advanced SEO | | NileCruises
Crawl-delay: 20 User-agent: 008
Disallow: / I'm wondering how I can fix this and stop it happening again. The site was hacked about 4 months ago but I thought we'd managed to clear things up. Colin0 -
Indexing an e-commerce site
Hi all, My client babyblingstreet.com. She sells baby and toddler clothing. Now a lot of the links on her site contain the same products. For instance: if you go to "What's new" you can find those same products in let's say her "Sale Items" link category. The real problem with this is let's say my client sells a green dress and someone accesses it through the "baby and toddler dresses" category. And let's say this URL has 10 links pointing to it. Now, let's say someone else accesses this same green dress through the "What's new" category. And let's say this particular URL has 10 links pointing to it. Instead of having 20 links pointing to one URL about the green dress, I now have 10 links pointing to one URL and 10 pointing to another URL even though both URLs feature the exact same green dress. In this particular example I would want to make the URL of the green dress in the "baby and toddler clothing" section be the canonical URL. So that means I would have to use this canonical tag on the green dress URL that's in the "what's new" category and let's say also the "sale items" category. This could get very tedious if my client has 200+ products. So I am wondering if I have to place a canonical tag on every URL that displays the green dress? More importantly, I would like to know other people's strategies for indexing e-commerce sites that have the same product featured in multiple categories throughout the site. I hope this makes sense. Thanks for your time.
Intermediate & Advanced SEO | | jenga110 -
202 error page set in robots.txt versus using crawl-able 404 error
We currently have our error page set up as a 202 page that is unreachable by the search engines as it is currently in our robots.txt file. Should the current error page be a 404 error page and reachable by the search engines? Is there more value or is it a better practice to use 404 over a 202? We noticed in our Google Webmaster account we have a number of broken links pointing the site, but the 404 error page was not accessible. If you have any insight that would be great, if you have any questions please let me know. Thanks, VPSEO
Intermediate & Advanced SEO | | VPSEO0