How was cdn.seomoz.org configured?
-
The SEOmoz CDN appears to have a "pull zone" that is set to the root of the domain, such that any static file can be addressed from either subdomain:
http://www.seomoz.org/q/moz_nav_assets/images/logo.png
http://cdn.seomoz.org/q/moz_nav_assets/images/logo.png
The risk of this configuration is that web pages (not just images/CSS/JS) also get cached and served by the CDN. I won't put the URL here for fear of Google indexing it, but if you replace the 'www' in the URL below with 'cdn', you'll see a cached copy of the original:
http://www.seomoz.org/ugc/the-greatest-attribution-ever-graphed
The worst-case scenario is that the homepage gets indexed. But this doesn't happen here:
That URL issues a 301 redirect back to the canonical www subdomain. As it should.
Here's my question: how was that done?
Because maxcdn.com can't do it. If you set a "pull zone" to your entire domain, they'll cache your homepage and everything else. googlebot has a field day with that; it will reindex your entire site off the CDN.
Maybe the SEOmoz CDN provider (CloudFront) allows specific URLs to be blocked? Or do you detect the CloudFront IPs and serve them a 301 (which they'd proxy out to anyone requesting cdn.seomoz.org)?
One solution is to create a pull zone that points to a folder, like example.com/images... but this doesn't help a complex site that has cacheable content in multiple places (do you Wordpress users really store ALL your static content under /wp-content/ ?).
Or, as suggested above, dynamically detect requests from the CDN's proxy servers, and give them a 301 for any HTML-page request. This gets complex quickly, and is both prone to breakage and very difficult to regression-test.
Properly retrofitting a complex site to use a CDN, without creating a half-dozen new CDN subdomains, does not appear to be easy.
-
its a SEOmoz secret...
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Should We Add the W3.org Language Tag To Every Page Or Just The Home Page?
Greetings, We have five international sites around the world, two of which are in difference languages. Currently we have the following line of html code on the home page of each of the sites: Clearly, we need to change the "en" portion for the sites that aren't in English, but, should we include that meta tag in each of the site's pages, or will the home page suffice. Thanks!
Intermediate & Advanced SEO | | CSawatzky0 -
Where does Schema.org Microdata go on a page?
Say you've got a Magento e-commerce site and you want to add Schema.org Microdata to it to take advantage of Google's Rich Snippets feature. Would the markup be part of the page's HTML TITLE . . . or somewhere in the bare-bones description (usually wrought by inputting data into separate fields in the CMS), e.g., Item: Something
Intermediate & Advanced SEO | | RScime25
Price: $00.00
Short Description: blah, blah, blah Or, hidden somewhere in the header? Or, can it be marked-up somewhere beneath my lengthy (and Panda-friendly) content and subsequently extracted by Google and highlighted in the SERPs? Admittedly, I'm more than a bit late to the Schema.org party and I'm a content-guy anyway; and not much good at under-the-hood stuff. I figure I'd better get my chops together, now. I've searched Moz.com's Q&A as well as Google's and Schema.org's and haven't come up with an answer yet that doesn't require that I learn a whole new vocabulary.0 -
How can I improve my rankings in Google with help of seomoz
Hey guys, I have joined seomoz today and set up campaigns for my sites. I got reports about keyword rankings, errors, notices etc. But I am still confused about how to use seomoz in order to improve my rankings. My point is does seomoz provides any services for improving position in google or simply seomoz provides only reporting? These reports are good but my ultimate goal to join seomoz is to improve my rankings for my each website and each post. Please help. BJ
Intermediate & Advanced SEO | | intmktcom0 -
SEOMoz and Facebook Graph Search
Are SEOMoz looking to integrate Facebook Graph Search (the web search section) into the product? At the moment we can measure and track rankings for Google, Bing/Yahoo, but not Facebook graph search. What are the general thoughts among the community? Do you think it will be adopted as a real search engine? I'm not overly concerned - I reckon it will take a lot to change people behaviour and have them moving away from the other search engines. It's throwing up some interesting results though in searches!
Intermediate & Advanced SEO | | littlesthobo0 -
Schema.org and YouTube Videos
Hi, Does anyone know how to implement schema.org markup with YouTube embedded videos? Thanks Carlos
Intermediate & Advanced SEO | | Carlos-R0 -
All In One SEO PACK Configuration - Index or Noindex?
I'm finding conflicting information about the right way to configure the All in One SEO Pack wordpress plugin. Do I index or noindex for the items below? Use noindex for Categories - yes or no? Use noindex for Archives - yes or no? Use noindex for Tag Archives - yes or no?
Intermediate & Advanced SEO | | webestate0 -
Microdata / Schema.org and HTTPS
I have a quick question regarding Microdata / Schema.org files that are not hosted on secure connections. I receive a receive a security error from my e-commerce site because the code references the schema over HTTP instead of HTTPS.<div< span="">itemscope itemtype="http://schema.org/Product"></div<>This is not the first time I have run into this issue. We also use MRSS schema for an RSS feed from yahoo and the same thing happens.<div< span="">xmlns:media="http://search.yahoo.com/mrss"></div<>The problem mainly lies in the fact that these schemas are not hosted over HTTPS. If you add HTTPS to the beginning of both you will get a security error.Just wondering if anyone else has dealt with this or similar issue and what the "best practices" are around this?Is it ok to obtain the schema directly and then host it on our server, over our secure connection?Thanks!
Intermediate & Advanced SEO | | AnthonyMangia0 -
Setting up .org and .net supplements
To help with DSEO we are consdering setting up a .org and .net of our brand name (the primary site runs off .com obviously). We are thinking that the net/org sites will: Be one page that is optimized as we need (standard stuff like copy, title, alt-tags, meta desc, etc.) Will be hosted on separate C-blocks Will have a google sitemap that is submitted to Google Webmaster Central Provides a link back to .com Any other suggestions? Are separate C-blocks necessary? Thanks, b
Intermediate & Advanced SEO | | bcmull0