I have a page where you can download a PDF of the material - should I exclude the PDF from the search engines?
-
In my niche, there is a controversial research article that is very popular. I am writing a rebuttal to this article and giving another point of view.
My article has the potential to be really good link bait for my site.
The original article is often printed out to be shown to professionals in my niche. My hope is that people will do the same with mine. So, I plan to have a PDF version of my article available on my page. The article that is visible on my site (i.e. non PDF) will be a graphic rich article that is easy for the reader to go through. I plan to have the PDF have all of the same text, but it won't have as many graphics - it will look more like a scientific research article.
So, should I exclude the pdf from search engines so that it isn't duplicate content? Or does that even matter seeing as it is a duplicate of my own content? I want people to link to the main article, not the pdf.
Any tips would be greatly appreciated!
-
Thank you! This is exactly the kind of information I needed!
I was thinking contacting webmasters who published the original article to tell them about mine. But now, perhaps what I will do is not just contact them but attach a copy of the pdf for them to use.
-
Do not exclude.
People will link to it.
PDF documents can rank in the SERPs if you complete the properties portion of the document. The title in the properties will serve as a title tag for Google SERPs.
PDF documents can accumulate pagerank and pass that pagerank though any links in the PDF document. (Be sure to place a few links to your website in the PDF. Because....pdf, .ppt, .xls and many other file times display in my google webmaster tools backlinks).
Encourage other webmasters to download your pdf and post it on their server and link to it from their website. That will give you backlinks from their domain. You can get a kickass number of backlinks from this. (I usually don't advocate giving content away but I have seen success from "whitepapers" like this. You might consider offering them a "branded" copy of the document to post on their own site - you would add their branding for them.)
Its a good idea to lock the .pdf document so that others can't change it. They can always make their own document from your content but don't make it too easy for them.
I have used .pdfs and have not seen a duplicate content problem from them. However, the content of the pdf is not exactly the same as what is on an .html page of my site. It sounds like you are planning to have richer content on your site than in the .pdf so I would not worry about dupe content. Just be sure that there is a significant difference.
-
I don't think there's a problem with hosting the PDF. Just make sure you've got strong branding in the PDF and links back to your online article. People will most likely pass your PDF around to others and you want them to come visit the source --> YOU.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Loads of Blog Search Results showing up in SERPs - What's the best way to remove?
Our client has a good number of results showing up in SERPs that are search results pages produced by Blog posts. Unfortunately all these results have exactly the same Title tag and it has nothing to do with the blog content which means they are unlikely to help us much. We can’t create a 301 redirect because there is no page to redirect. There is no blog page we can re=canonical to either. The content on these pages is a short list of blog posts by each author. They are not true “Author” pages that would have a URL structure like this: your company.com/author/joeblow Our plan is to use GWMT's URL removal tool to request remove of these pages. (and then try to stop new results from being created) We are doing this to get low-value content out of the SERP. Is there a better way to remove these search results? Any drawback in removing them in GWMTs? Thanks.
Content Development | | RosemaryB1 -
Can unreliable server hurt your serps?
Hi I have a small personal blog about food and wine that I recently moved from blogger to wordpress and it is currently being hosted by the company who moved the blog over from one system to the other. This week I've noticed an 80% drop in organic traffic thanks to losing pretty much every first page SERP, there's no messages in WMT, i dont pay for links, all that's on the site is original content about food and wine that I enjoy. I've never had a previous drop in ranking/traffic like it. The one thing I can say, is that the guy who took over the hosting is hosting it himself on his own server and the website has been down more times than I would consider reasonable, often for hours at a time (this is when I catch it and I don't check often). Would the site be penalised for this? If I move servers to a reliable co i normally use, how long will it take to recover?
Content Development | | xoffie0 -
Google Slower to Trust New Pages than One Year Ago?
It seems to me that Google is slower to trust (and rank) new pages today than in the past. I used to be able to put up a new page and it would go right to the top of a competitive SERP. For about the past year when I launch a new page it starts deep in the SERPs, sits there for a few weeks, then starts slowly moving up. These pages still eventually rank on the first page of Google - often at #1 or #2 after wikipedia or another strong site - but it can take a few months to get there, several months in a competitive SERP. These are not "hot news" topics where freshness is an important factor. Instead they are product pages or general information articles. Anybody else seeing this? [ Just stabbing in the dark here... I am wondering if Google is relying more on visitor behavior these days and the delay is while they collect data?... Just stabbing in the dark.]
Content Development | | EGOL0 -
Multiply pages of similair subject not showing up in serps?
Hi, I have a website with a lot of similair subjects. Its a website with around 1000s pages.
Content Development | | vulonl
For example this website is about "cars" so I wrote a pages about: Green classy cars
Green cars for the summer
Green car show 2013 What happens is that on the query of "Green car show". The page green classy cars shows up. I have the feeling that google takes one page about a similair subject and only put on in the serps. I checked this on multiply pages and this seems the case. If I search on exact "Green car show 2013" + MY URL, still then its shows indexed but only position 4. Places 1,2,3 shows again other pages of my website with similair subject. Now my feeling says the other pages have more authority and thats why they show up higher. But then...again now all the content Im adding it isnt showing up.. The last months I added around 300 pages and I did not got one visitor more daily and I have the feeling it is because it are all similair pages and google does not want to show them. My question is: Is their something I can still make them show up? Because they do have all 100% unique content and 100% unique images they only have similair subjects. or Is their some way I can tell Google that his are really different pages, so this would maybe help?0 -
How to optimize content pages with ecommerce?
Some content pages act as buyers guides for certain products for example Used Paddle Boards for Sale - http://www.islesurfboards.com/used-paddle-boards-for-sale.aspx this is a content page that gets huge amount of traffic and is pure content with no products on the page, but we also have a ecommerce section of the site that is Used Paddle Boards for Sale -http://www.islesurfboards.com/buy-used-paddle-boards-for-sale.aspx however this page just has a small paragraph and all the ecommerce product related to this section on the page. The content only page above gets all the traffic and rank and then they click over to the actual ecomm section wiht the products from a link on that page. Should i merge these two together so its just one page and put the content on the ecom page? If i do all the content with push the ecommerce products down which is not good so what does anyone recommend as a best practice? Also will this mess up the content pages rank is i merge them assuming i redirect? or Keep them seperate like i have with a content page regarding "used paddle boards for sale" and an ecommerce page that sells acutal "used paddle boards for sale"
Content Development | | isle_surf0 -
Correction Duplicate Page Title Problems for a Blog
EDITED: To just focus on the issue at hand. I am trying to figure out the SEO rules instead of just working on the content. Please bear with me. I am adept technically. I just do not know the rules of the SEO process or even some of the termology. So I’m trying to attack problems one at time. Today’s problem – **Duplicate Page Titles ** We evidently have thousands of Duplicate Page Titles. We are using Joomla 2.5 & Easyblog. Our sitemap is automated from XML Sitemap Easyblog takes the title of the sites and uses it for a name of the summary pages. We post 5 blog items per page and all the names are the same. http://www.OursiteName.com/?start=5 Page Title = Site Name http://www.OursiteName.com/?start=10 Page Title = Site Name A similar thing happens on the sorting by Author or Category etc etc. Basically non-duplicate pages are looking like duplicates. What is the best practice / approach? Using the Robot.txt or XML Sitemap to tell Google not to crawl these pages? Writing a script or edit the Easyblog code to edit the 2000 duplicate Page Titles? Other thoughts?
Content Development | | Romana0 -
Adding a picture page - Good or Bad?
I have a lot of cool pics that just did not quite make it on one of my pages. Not necessarily because I did not want to, but space reasons they just happened to lose out to another photo. What I was thinking was, maybe I can add like a gallery page? Possibly with links back to the pages that each photo was considered for? Would this be a decent idea or just a page deemed as having low quality/value and end up hurting my site. Or maybe you can add an idea that may make it work for me!
Content Development | | VictorVC0 -
Can someone define what a low quality blog is supposed to look like?
I know the recent Google update devalued a lot of low quality blogs, but i'm having a hard time understanding what can be considered low quality? My site is www.247VirtualAssistant.com and it wa sitting in the top 3 for all my keywords(virtual assistant, virtual assistants etc etc). Last month everything tanked and now on the 2nd and 3rd page for my keywords. I'm thinking this is because of a lot of my links got devalued but with my limited SEO knowledge, i'm having a hard time identifiyng these. Please help!
Content Development | | Shajan0