Noindex, nofollow on a blog since 2009
-
Just reviewed a WordPress blog that was launched in 2009 but somehow the privacy setting was to not index it, so all this time there's been a noindex, nofollow meta tag in the header. The client couldn't figure out why masses of content wasn't showing up in search results.
I've fixed the setting and assume Google will spider in short order; the blog is a subdirectory of their main site. My question is whether there is anything else I can or should do. Can Google recognize the age of the content, or that it once had a noindex meta tag? Will it "date" the blog as of today? Has the client lost out on untold benefits from the long history of content creation? I imagine that link juice from any backlinks to the blog will now flow back to the main site; think that's true?
Just curious what others might think of this scenario and whether any other action is warranted.
-
Thanks Dan. One thing I found interesting is that Google Webmaster Tools doesn't offer any alerts about pages that aren't indexed because of meta tags, only about those included in the robots.txt file.
-
Hi
Great responses Matt and Ben, thanks!! Only things I could add are;
Webmaster Tools
- Check google webmaster tools every few days for the first 2-3 weeks.
- You may turn up some 404s or other types of errors that should be corrected.
- And keep your eyes out for any other warnings
Analytics
- You're going to spike your traffic (potentially, hopefully) in analytics big time, or at least skew the data
- Use filters and advanced segments to separate blog traffic so you can still analyze things even after a potential spike in blog search traffic.
- At minimum make an annotation of the date you made it indexable.
Dates
- Regarding the dates, I did come across this recently - I have not tested, so please take it with a grain of salt - removing dates from the SERPs - I would only recommend trying it if the content was not "time sensitive" (like a cooking recipe).
Hope all this helps!
-Dan
-
Thanks for the clarification Ben. I think I'll leave older posts as is. They've been actively posting several times a week, so there should be enough fresh content. My hope is that Google recognizes the age of the blog because it's my understanding that age factors in the ranking algorithm.
-
Ahh yeah my bad, ignore that bit. I think you'd still want to make a subtle change to each post so WordPress can set the date updated flag on the sitemap to today, that way Google will put a higher priority on the content when indexing your site.
-
Thanks, the site maps are a good idea. Ben, I'm not sure what you mean about making the content different to what Google has in its index. Because of the meta tag, it doesn't have any content in its index, right?
-
You've done the most important step (removing the noindex/nofollow) tags. The only additional thing I would do is submit (or resubmit) the XML sitemap to Google. Make sure that XML sitemap is perfect and error free so that you don't create any additional errors.
Google should be smart enough to recognize the dates. I've never had a situation where it was years between publish and index. I have however had situations where it was days or weeks in between publish and index and in those situations Google has recognize the date. I'd imagine the same is true here (assuming of course, you have the date in a recognizable format and don't change the date to today).
I'd be curious to find out what happens. Definitely update this Q&A when you find out what happens!
-
I would probably re-arrange some of the paragraphs (or add some more content) to the old posts and update them in WordPress, this then makes the content different to what Google has in its index.
I would then use the Yoast WordPress SEO plugin to regenerate your sitemap. Since you've updated and added new content to the posts their last updated date would have changed so Google will probably see this as revised content. I would submit to all major search engines as your first port of call.
In terms of the "link juice", I would say that Google will still count links to the article as a ranking factor, but because you have noindex the content wont appear in search results. So the content will have a fairly good page rank (possibly) but its being held back by the exclusion of the search engine index.
Now that the setting has been changed and the sitemap / content has been updated you should start to see the results in the search results in due time.
You could also add a few new articles of content to the blog and publicise that over social media to help get back in the game a bit quicker.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Unsolved Using NoIndex Tag instead of 410 Gone Code on Discontinued products?
Hello everyone, I am very new to SEO and I wanted to get some input & second opinions on a workaround I am planning to implement on our Shopify store. Any suggestions, thoughts, or insight you have are welcome & appreciated! For those who aren't aware, Shopify as a platform doesn't allow us to send a 410 Gone Code/Error under any circumstance. When you delete or archive a product/page, it becomes unavailable on the storefront. Unfortunately, the only thing Shopify natively allows me to do is set up a 301 redirect. So when we are forced to discontinue a product, customers currently get a 404 error when trying to go to that old URL. My planned workaround is to automatically detect when a product has been discontinued and add the NoIndex meta tag to the product page. The product page will stay up but be unavailable for purchase. I am also adjusting the LD+JSON to list the products availability as Discontinued instead of InStock/OutOfStock.
Technical SEO | | BakeryTech
Then I let the page sit for a few months so that crawlers have a chance to recrawl and remove the page from their indexes. I think that is how that works?
Once 3 or 6 months have passed, I plan on archiving the product followed by setting up a 301 redirect pointing to our internal search results page. The redirect will send the to search with a query aimed towards similar products. That should prevent people with open tabs, bookmarks and direct links to that page from receiving a 404 error. I do have Google Search Console setup and integrated with our site, but manually telling google to remove a page obviously only impacts their index. Will this work the way I think it will?
Will search engines remove the page from their indexes if I add the NoIndex meta tag after they have already been index?
Is there a better way I should implement this? P.S. For those wondering why I am not disallowing the page URL to the Robots.txt, Shopify won't allow me to call collection or product data from within the template that assembles the Robots.txt. So I can't automatically add product URLs to the list.0 -
"Noindex, follow" for thin pages?
Hey there Mozzers, I have a question regarding Thin pages. Unfortunately, we have Thin pages, almost empty to be honest. I have the idea to ask the dev team to do "noindex, follow" on these pages. What do you think? Has someone faced this situation before? Will appreciate your input!
Technical SEO | | Europarl_SEO_Team0 -
Robots.txt & meta noindex--site still shows up on Google Search
I have set up my robots.txt like this: User-agent: *
Technical SEO | | RoxBrock
Disallow: / and I have this meta tag in my on a Wordpress site, set up with SEO Yoast name="robots" content="noindex,follow"/> I did "Fetch as Google" on my Google Search Console My website is still showing up in the search results and it says this: "A description for this result is not available because of this site's robots.txt" This site has not shown up for years and now it is ranking above my site that I want to rank for this keyword. How do I get Google to ignore this site? This seems really weird and I'm confused how a site with little content, that has not been updated for years can rank higher than a site that is constantly updated and improved.1 -
All images are noindex will opening this at once be an issue?
Hi, All images are noindex will opening this at once be an issue? Not sure how a few months ago all my images were set as noindex which i realized last week. We have 20K images which were indexed fine but now when i check Site:sitename it shows 10 or 12 and the inspect element via Chrome i see the noindex is set for all images. We have been renaming the images and adding ALT tags for most of them and would it be an issue if we change the noindex in one shot or should we do them few at a time? Thanks
Technical SEO | | mtthompsons0 -
What is better for SEO, keeping a current blog or creating a new one?
We are working with a client to build a new website. They are asking if we should keep their existing blog or create a new one. If they keep the current blog, should we move it so we can incorporate it into their new site or keep it as a separate URL? Or, should we build the new site into the current blog––they are using Wordpress, so we would be able to build out the blog to incorporate the new website.
Technical SEO | | thinkcreativegroup0 -
Noindex Pages indexed
I'm having problem that gogole is index my search results pages even though i have added the "noindex" metatag. Is the best thing to block the robot from crawling that file using robots.txt?
Technical SEO | | Tedred0 -
Redirecting blog.<mydomain>.com to www.<mydomain>.com\blog</mydomain></mydomain>
This is more of a technical question than pure SEO per se, but I am guessing that some folks here may have covered this and so I would appreciate any questions. I am moving from a WordPress.com-based blog (hosted on WordPress) to a WordPress installation on my own server (as suggested by folks in another thread here). As part of this I want to move from the format blog.<mydomain>.com to www.mydomain.com\blog. I have installed WordPress on my server and have imported posts from the hosted site to my own server. How should I manage the transition from first format to the second? I have a bunch of links on Facebook, etc that refer to URLs of the blog..com format so it's important that I redirect.</mydomain> I am running DotNetNuke/WordPress on my own IIS/ASP.Net servers. Thanks. Mark
Technical SEO | | MarkWill0 -
Seomoz is showing duplicate page content for my wordpress blog
Hi Everyone, My seomoz crawl diagnostics is indicating that I have duplicate content issues in the wordpress blog section of my site located at: http://www.cleversplash.com/blog/ What is the best strategy to deal with this? Is there a plugin that can resolve this? I really appreciate your help guys. Martin
Technical SEO | | RogersSEO0