Noindex, nofollow on a blog since 2009
-
Just reviewed a WordPress blog that was launched in 2009 but somehow the privacy setting was to not index it, so all this time there's been a noindex, nofollow meta tag in the header. The client couldn't figure out why masses of content wasn't showing up in search results.
I've fixed the setting and assume Google will spider in short order; the blog is a subdirectory of their main site. My question is whether there is anything else I can or should do. Can Google recognize the age of the content, or that it once had a noindex meta tag? Will it "date" the blog as of today? Has the client lost out on untold benefits from the long history of content creation? I imagine that link juice from any backlinks to the blog will now flow back to the main site; think that's true?
Just curious what others might think of this scenario and whether any other action is warranted.
-
Thanks Dan. One thing I found interesting is that Google Webmaster Tools doesn't offer any alerts about pages that aren't indexed because of meta tags, only about those included in the robots.txt file.
-
Hi
Great responses Matt and Ben, thanks!! Only things I could add are;
Webmaster Tools
- Check google webmaster tools every few days for the first 2-3 weeks.
- You may turn up some 404s or other types of errors that should be corrected.
- And keep your eyes out for any other warnings
Analytics
- You're going to spike your traffic (potentially, hopefully) in analytics big time, or at least skew the data
- Use filters and advanced segments to separate blog traffic so you can still analyze things even after a potential spike in blog search traffic.
- At minimum make an annotation of the date you made it indexable.
Dates
- Regarding the dates, I did come across this recently - I have not tested, so please take it with a grain of salt - removing dates from the SERPs - I would only recommend trying it if the content was not "time sensitive" (like a cooking recipe).
Hope all this helps!
-Dan
-
Thanks for the clarification Ben. I think I'll leave older posts as is. They've been actively posting several times a week, so there should be enough fresh content. My hope is that Google recognizes the age of the blog because it's my understanding that age factors in the ranking algorithm.
-
Ahh yeah my bad, ignore that bit. I think you'd still want to make a subtle change to each post so WordPress can set the date updated flag on the sitemap to today, that way Google will put a higher priority on the content when indexing your site.
-
Thanks, the site maps are a good idea. Ben, I'm not sure what you mean about making the content different to what Google has in its index. Because of the meta tag, it doesn't have any content in its index, right?
-
You've done the most important step (removing the noindex/nofollow) tags. The only additional thing I would do is submit (or resubmit) the XML sitemap to Google. Make sure that XML sitemap is perfect and error free so that you don't create any additional errors.
Google should be smart enough to recognize the dates. I've never had a situation where it was years between publish and index. I have however had situations where it was days or weeks in between publish and index and in those situations Google has recognize the date. I'd imagine the same is true here (assuming of course, you have the date in a recognizable format and don't change the date to today).
I'd be curious to find out what happens. Definitely update this Q&A when you find out what happens!
-
I would probably re-arrange some of the paragraphs (or add some more content) to the old posts and update them in WordPress, this then makes the content different to what Google has in its index.
I would then use the Yoast WordPress SEO plugin to regenerate your sitemap. Since you've updated and added new content to the posts their last updated date would have changed so Google will probably see this as revised content. I would submit to all major search engines as your first port of call.
In terms of the "link juice", I would say that Google will still count links to the article as a ranking factor, but because you have noindex the content wont appear in search results. So the content will have a fairly good page rank (possibly) but its being held back by the exclusion of the search engine index.
Now that the setting has been changed and the sitemap / content has been updated you should start to see the results in the search results in due time.
You could also add a few new articles of content to the blog and publicise that over social media to help get back in the game a bit quicker.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Unsolved Using NoIndex Tag instead of 410 Gone Code on Discontinued products?
Hello everyone, I am very new to SEO and I wanted to get some input & second opinions on a workaround I am planning to implement on our Shopify store. Any suggestions, thoughts, or insight you have are welcome & appreciated! For those who aren't aware, Shopify as a platform doesn't allow us to send a 410 Gone Code/Error under any circumstance. When you delete or archive a product/page, it becomes unavailable on the storefront. Unfortunately, the only thing Shopify natively allows me to do is set up a 301 redirect. So when we are forced to discontinue a product, customers currently get a 404 error when trying to go to that old URL. My planned workaround is to automatically detect when a product has been discontinued and add the NoIndex meta tag to the product page. The product page will stay up but be unavailable for purchase. I am also adjusting the LD+JSON to list the products availability as Discontinued instead of InStock/OutOfStock.
Technical SEO | | BakeryTech
Then I let the page sit for a few months so that crawlers have a chance to recrawl and remove the page from their indexes. I think that is how that works?
Once 3 or 6 months have passed, I plan on archiving the product followed by setting up a 301 redirect pointing to our internal search results page. The redirect will send the to search with a query aimed towards similar products. That should prevent people with open tabs, bookmarks and direct links to that page from receiving a 404 error. I do have Google Search Console setup and integrated with our site, but manually telling google to remove a page obviously only impacts their index. Will this work the way I think it will?
Will search engines remove the page from their indexes if I add the NoIndex meta tag after they have already been index?
Is there a better way I should implement this? P.S. For those wondering why I am not disallowing the page URL to the Robots.txt, Shopify won't allow me to call collection or product data from within the template that assembles the Robots.txt. So I can't automatically add product URLs to the list.0 -
Blogs Not Getting Indexed Intermittently - Why?
Over the past 5 months many of our clients are having indexing issues for their blog posts.
Technical SEO | | JohnBracamontes
A blog from 5 months ago could be indexed, and a blog from 1 month ago could be indexed but blogs from 4, 3 and 2 months ago aren't indexed. It isn't consistent and there is not commonality across all of these clients that would point to why this is happening. We've checked sitemap, robots, canonical issues, internal linking, combed through Search Console, run Moz reports, run SEM Rush reports (sorry Moz), but can't find anything. We are now manually submitting URLs to be indexed to try and ensure they get into the index. Search console reports for many of the URLs will show that the blog has been fetched and crawled, but not indexed (with no errors). In some cases we find that the blog paginated pages (i.e. blog/page/2 , blog/page/3 , etc.) are getting indexed but not the blogs themselves. There aren't any nofollow tags on the links going to the blogs either. Any ideas? *I've added a screenshot of one of the URL inspection reports from Search Console alt text0 -
Duplicate Page Content and Titles from Weebly Blog
Anyone familiar with Weebly that can offer some suggestions? I ran a crawl diagnostics on my site and have some high priority issues that appear to stem from Weebly Blog posts. There are several of them and it appears that the post is being counted as "page content" on the main blog feed and then again when it is tagged to a category. I hope this makes sense, I am new to SEO and this is really confusing. Thanks!
Technical SEO | | CRMI0 -
Blog Location
We are planning on redesigning our homepage and are thinking of moving the location of our blog. Currently it is part of the main menu with a tab "Blog and News" Links to the top news article are also displayed below the fold. I checked with Google In-Page analytics and the news articles main link get 0.1% of the clicks and the blog&news don't get any clicks. The Marketing VP wants to move Blog&News to a link below the fold... which seems like it will send a message to Google we don't care about it and get it even less traction than we currently have in terms of visitors. Any suggestions of what we should do with it?
Technical SEO | | theLotter0 -
Blog posts outranked for Title a String searches in content...why?
Site Pages: When I wrap a page title, or a string of several words in quotes, and GG search, my client's page shows up first. My understanding is that this shows general health of site, and acknowledgement as the original source of the content. Blog Posts: When I wrap a blog page, or post, title or string of words in quotes, and GG search, Feedblitz, Facebook, and other scraper sites appear before the blog home page, and also the actual blog post. The blog is in a separate directory. Does this suggest that the /blog/ is being penalized or demoted in any way? Does it indicate the /blog/ directory does not have authority? Both the static site pages, and the blog pages, are using rel=canonical tags. What causes this, what does it indicate, and how can I fix it? Thanks,
Technical SEO | | seagreen
Greg0 -
How to link site.com/blog or site.com/blog/
Hello friends, I have a very basic question but I can not find the right answer... I have made my blog linkbuilding using the adress "mysite.com/blog" but now im not sure if is better to do the linkbuilding to "mysite.com**/blog/ "** Is there any diference? Thanks...
Technical SEO | | lans27870 -
Noindex,follow - linked pages not showing
We have a blog on our site where the homepage and category pages have "noindex,follow" but the articles have "index,follow". Recently we have noticed that the article pages are no longer showing in the Google SERPs (but they are in Bing!) - this was done by using the "site:" search operator. Have double-checked our robots.txt file too just in case something silly had slipped in, but that's as it should be... Has anyone else noticed similar behaviour or could suggest things I could check? Thanks!
Technical SEO | | Nobody15569050351140 -
Too many on page links for WP blog page
Hello, I have set my WP blog to a page so new posts go to that page making it the blog. On a SEOmoz campaign crawl, it says there are too many links on one page, so does this mean that as I am posting my blog posts to this page, the search engines are seeing the page as one page with links instead of the blog posts? I worry that if I continue to add more posts (which obviously I want to) the links will increase more and more, meaning that they will be discounted due to too many links. What can I do to rectify this? Many thanks in advance
Technical SEO | | mozUser14692366292850