SSL and robots.txt question - confused by Google guidelines
-
I noticed "Don’t block your HTTPS site from crawling using robots.txt" here: http://googlewebmastercentral.blogspot.co.uk/2014/08/https-as-ranking-signal.html
Does this mean you can't use robots.txt anywhere on the site - even parts of a site you want to noindex, for example?
-
Hi Luke,
Just make sure that your robots.txt file located at https://www.example.com/robots.txt doesn't block search engine spiders. Of course there may be some folders or filetypes you want to block but it certainly shouldn't look like below which would block everything:
User-agent: *
Disallow: /
Hope that helps
-
No that's not what they mean - it means Google recommends you allow the secure version of your site(where applicable) to be crawled. You can still block certain pages/sections should you choose to do so.
With regards to noindexing you could also place this on the actual page as an alternative.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Scary bug in search console: All our pages reported as being blocked by robots.txt after https migration
We just migrated to https and created 2 days ago a new property in search console for the https domain. Webmaster Tools account for the https domain now shows for every page in our sitemap the warning: "Sitemap contains urls which are blocked by robots.txt."Also in the dashboard of the search console it shows a red triangle with warning that our root domain would be blocked by robots.txt. 1) When I test the URLs in search console robots.txt test tool all looks fine.2) When I fetch as google and render the page it renders and indexes without problem (would not if it was really blocked in robots.txt)3) We temporarily completely emptied the robots.txt, submitted it in search console and uploaded sitemap again and same warnings even though no robots.txt was online4) We run screaming frog crawl on whole website and it indicates that there is no page blocked by robots.txt5) We carefully revised the whole robots.txt and it does not contain any row that blocks relevant content on our site or our root domain. (same robots.txt was online for last decade in http version without problem)6) In big webmaster tools I could upload the sitemap and so far no error reported.7) we resubmitted sitemaps and same issue8) I see our root domain already with https in google SERPThe site is https://www.languagecourse.netSince the site has significant traffic, if google would really interpret for any reason that our site is blocked by robots we will be in serious trouble.
Intermediate & Advanced SEO | | lcourse
This is really scary, so even if it is just a bug in search console and does not affect crawling of the site, it would be great if someone from google could have a look into the reason for this since for a site owner this really can increase cortisol to unhealthy levels.Anybody ever experienced the same problem?Anybody has an idea where we could report/post this issue?0 -
Fetch as Google
I have odd scenario I don't know if anyone can help? I've done some serious speed optimisation on a website, amongst other things CDN and caching. However when I do a Search Console Fetch As Google It is still showing 1.7 seconds download time even though the cached content seems to be delivered in less than 200 ms. The site is using SSL which obviously creams off a bit of speed, but I still don't understand the huge discrepancy. Could it be that Google somehow is forcing the server to deliver fresh content despite settings to deliver cache? Thanks in advance
Intermediate & Advanced SEO | | seoman100 -
Use Canonical or Robots.txt for Map View URL without Backlink Potential
I have a Page X with lots of unique content. This page has a "Map view" option, which displays some of the info from Page X, but a lot is ommitted. Questions: Should I add canonical even though Map View URL does not display a lot of info from Page X or adding to robots.txt or noindex, follow? I don't see any back links coming to Map View URL Should Map View page have unique H1, title tag, meta des?
Intermediate & Advanced SEO | | khi50 -
Google snippet chosen why?
We have a page about buying property in the Megeve area of the Alps in France. We are No.2 on Google.co.uk for the term "megeve property for sale" and No.1 for "megeve property". http://www.prestigeproperty.co.uk/MegeveProperty/Properties.asp If you search for "megeve property for sale", Google serves our META description as the snippet: Ski chalets, homes and apartments for sale in this exclusive, prestigious Rhone Alpes village - 520000-16500000 EUR. However, we noticed that searching for just "megeve property" serves up a much better snippet taken from the text on the page: A crucial factor for potential property buyers is that there is a strong rental market in Megève and this remains high all year around with properties close to the ... Does anyone know why Google would serve this particular snippet instead of the META description. Is it the number of strong and descriptive words used, or some other reason?
Intermediate & Advanced SEO | | PPGUKLTD0 -
Google Fetch Issue
I'm having some problems with what google is fetching and what it isn't, and I'd like to know why. For example, google IS fetching a non-existent page but listing it as an error: http://www.gaport.com/carports but the actual url is http://www.gaport.com/carports.htm. Google is NOT able to fetch http://www.gaport.com/aluminum/storage-buildings-10x12.htm. It says the page doesn't exist (even though it does) and when I click on the not found link in Google fetch it adds %E@%80%8E to the url causing the problem. One theory we have is that this may be some sort of server/hosting problem, but that's only really because we can't figure out what we could have done to cause it. Any insights would be greatly appreciated. Thanks and Happy Holidays! Ruben
Intermediate & Advanced SEO | | KempRugeLawGroup0 -
Information Architecture Question
I've got a site architecture / branding / SEO question for my own site (http://www.strikemodels.com/). In brief, the site sells kits and accessories for model warships that shoot and sink each other. My husband (Stephen) runs the business, and makes many of the parts we sell in our workshop/garage. Stephen wants to have a section where he talks about the equipment he is building/ using, and give updates on each of the pieces. This is equipment we use to make products, not equipment that we sell. For example, he's building an EDM machine, and getting a plastic injection molding machine and an ultrasonic welder up and running. We have a blog section where we post about updates about items that we sell, how to use our products, etc. This is more of a place for him to talk about what he's doing in the shop, and would also serve in future years as something he could point people to regarding his skills as an engineer if needed. I'm looking for opinions and options as to where to put this. Is there a way to use a different category in the blog and have items in the blog show up under a different page if they're in the "Stephen's Corner" category? Other options would be a separate site just for that, or to do threads on the a forum dedicated to the hobby. I'd prefer to keep things on our own site to keep all of the benefits together. Thoughts on structure or ways to make this work? Things I hadn't thought about? Thanks!
Intermediate & Advanced SEO | | KeriMorgret0 -
Not ranking on Bing but is on Google?
Hi What are the main differences between Bing and Google in terms of ranking sites? My site is ranking well in Google but in Bing it is very low down and does not deliver much traffic. In Bing webmaster tools there are no warning messages and I had sent in a sitemap back in 2011 and 77 pages are listed, but I had not submitted a URL could this be why my pages are not ranking highly? Or does anybody have a checklist on what a site should offer to get ranking on Bing?
Intermediate & Advanced SEO | | ocelot0 -
Canonical Tag - Question
Hey, I will give a thumbs up and best answer to whoever answers my question correctly. The Canonical Tag is supposed to solve Duplication which is fine. My questions are: Does the Canonical Tag make the PR / Link Juice flow differently? If I have john.long.com/home and john.long.com but put a Canonical Tag on john.long.com/home reading john.long.com then what does this do? Does it flow the Link Equity back to john.long.com? Can you use the Canonical Tag to change PR flow in any means? If I had john.long.com/washing-machines and john.long.com/kids-toys... If I put a Canonical Tag on john.long.com/kids-toys reading john.long.com/washing-machines then would the PR from /kids-toys flow to /washing-machines or would Google just ignore this? (The pages are completely different in this example and content is completely different). Thank you.
Intermediate & Advanced SEO | | AdiRste0