I have a page where you can download a PDF of the material - should I exclude the PDF from the search engines?
-
In my niche, there is a controversial research article that is very popular. I am writing a rebuttal to this article and giving another point of view.
My article has the potential to be really good link bait for my site.
The original article is often printed out to be shown to professionals in my niche. My hope is that people will do the same with mine. So, I plan to have a PDF version of my article available on my page. The article that is visible on my site (i.e. non PDF) will be a graphic rich article that is easy for the reader to go through. I plan to have the PDF have all of the same text, but it won't have as many graphics - it will look more like a scientific research article.
So, should I exclude the pdf from search engines so that it isn't duplicate content? Or does that even matter seeing as it is a duplicate of my own content? I want people to link to the main article, not the pdf.
Any tips would be greatly appreciated!
-
Thank you! This is exactly the kind of information I needed!
I was thinking contacting webmasters who published the original article to tell them about mine. But now, perhaps what I will do is not just contact them but attach a copy of the pdf for them to use.
-
Do not exclude.
People will link to it.
PDF documents can rank in the SERPs if you complete the properties portion of the document. The title in the properties will serve as a title tag for Google SERPs.
PDF documents can accumulate pagerank and pass that pagerank though any links in the PDF document. (Be sure to place a few links to your website in the PDF. Because....pdf, .ppt, .xls and many other file times display in my google webmaster tools backlinks).
Encourage other webmasters to download your pdf and post it on their server and link to it from their website. That will give you backlinks from their domain. You can get a kickass number of backlinks from this. (I usually don't advocate giving content away but I have seen success from "whitepapers" like this. You might consider offering them a "branded" copy of the document to post on their own site - you would add their branding for them.)
Its a good idea to lock the .pdf document so that others can't change it. They can always make their own document from your content but don't make it too easy for them.
I have used .pdfs and have not seen a duplicate content problem from them. However, the content of the pdf is not exactly the same as what is on an .html page of my site. It sounds like you are planning to have richer content on your site than in the .pdf so I would not worry about dupe content. Just be sure that there is a significant difference.
-
I don't think there's a problem with hosting the PDF. Just make sure you've got strong branding in the PDF and links back to your online article. People will most likely pass your PDF around to others and you want them to come visit the source --> YOU.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Hi, on SEO article submissions, do I only include the link to the page I am trying to promote or is it best practice to also include a link to home page or parent page?
Good day. I am writing articles for submission, I would just like some help with the page structure. Do I only include the link for the page that I would like to promote or is it advisable to include other page links, such as home page or the parent category too? Any help would be appreciated
Content Development | | thebedguy0 -
Do you know any website you can get in touch with bloggers?
I would like to get in touch with some good bloggers who would be happy to write a blog about my company services. Is there any website where is the list of blogs so you can get in touch with them or buy blog post on their blogs? Thanks Lukas
Content Development | | Lukas-ST0 -
Pages and categories with the same name?
I manage a wordpress based site that is needing to under go a site architecture overhaul. the site is christ.org and one of the problems is it has 89 pages but really only 4 are navigatable (not a word apparently). The site also has over 400 posts so categories and parent pages are both definitely needed. One option is I convert a lot of the pages into posts, but would that happen to break any links pointing to those pages turned posts? Or another option is to keep the pages and posts and create a bunch of subpages, then I would most likely end up with similarly named categories and top level pages. I would guess the name of the category needs to be unique from page titles right? And not just unique but very much differentiated than any page title (not posts but page titles). Maybe what I need to do is convert the pages that are not really unique into posts and put them in the category it fits with. And then keep those that are unique as top level pages. The architecture needs some serious work I think 🙂 Any help would be greatly appreciated.
Content Development | | ThridHour0 -
Gallary Pages
We have multiple Gallery Pages on a website and they are all being indexed as duplicate content. I am assuming it's because there's no content on those pages. So, it's picking up the pages header/footer navigation and considering it content. I am not sure what the best way is to deal with Gallery pages. I want the images to get indexed, but not sure how to do this if I need to set the gallery pages with the thumbnails on it to noindex. Would it be smart to set the pages to "noindex, follow" or "index, nofollow" or do you have any other suggestions?
Content Development | | cmaseattle0 -
Duplicate Pages Different Content
Will duplicate pages different content hurt rankings/seo E-commerce Site is plugin style with WYSIWYG editor allowing for full customization, all pages are setup with basic default content. Ive created custom pages with content/keywords to begin seo on them I have two pages www.domain.com/sports/hockey and www.domain.com/nhl-tickets The first url is default, with a single H1tag, + Default Meta+Title tags, the second is the content rich page, and structured properly, both of which show up on the site, should I block the first url from displaying at all? The reason I am asking is because ive also setup breadcrumb links, which makes all of these category url's accessible on the site, I cannot edit breadcrumb links, we can either have them there or remove them. Thank you Very Much!!
Content Development | | TP_Marketing0 -
How to handle product pages with similar information
We have thousands of product pages with similar information but differentiating variables such as length/width. Example: http://www.savvyboater.com/store/p/2100-Cover-for-V-Hull-Fishing-Boat-with-Side-Console-O-B-14-X-74-.aspx http://www.savvyboater.com/store/p/2101-Cover-for-V-Hull-Fishing-Boat-with-Side-Console-O-B-15-X-76-.aspx http://www.savvyboater.com/store/p/2102-Cover-for-V-Hull-Fishing-Boat-with-Side-Console-O-B-16-X-92-.aspx http://www.savvyboater.com/store/p/2103-Cover-for-V-Hull-Fishing-Boat-with-Side-Console-O-B-17-X-92-.aspx We built individual products instead of grouped products because we recommend specific part numbers for specific make, model, year boats through our finder tool. These pages have recently started showing up in SEOmoz as duplicate content and we are looking for solutions to solve it. We have considered creating a "parent page" that lists all sizes and then using a rel canonical on each individual page to tell google that the parent page is the preferred page. Any thoughts or other ideas on this?
Content Development | | ironpac0 -
Duplicate Text on Blog & Internal News Page
I have two places I post news for our company. Our blog - typically more informal posts
Content Development | | seo-hunter
mycompany.wordpress.com & Our news page - typically more newsworthy than the blog
mycompany.com/news My question is, It is okay to just copy the exact text from my wordpress blog and paste to my news area of my site and vice versa? Does this hurt ranking potential for either page?0 -
Wordpress Duplicate Pages/ URL's - Help !
Hi guys, I have been running SEOMoz for just over a month and slowly cleaning up one of my Wordpress Blogs. While going through the crawl reports I have noticed that I have duplicate pages showing on the crawl. For example, the main post would be; www.xxxxx.com/blog/post-title Then I see another URL which would be; **www.xxxx.com/blog/page/59 ** When I click on either URL it goes back to the actual post title URL. What's with these page URL's ? Isn't these two URL's showing duplicate content to the search engines ? Any suggestions would be greatly appreciated.
Content Development | | dcc0