PDF's - Dupe Content
-
Hi
I have some pdfs linked to from a page with little content. Hence thinking best to extract the copy from the pdf and have on-page as body text, and the pdf will still be linked too. Will this count as dupe content ?
Or is it best to use a pdf plugin so page opens pdf automatically and hence gives page content that way ?
Cheers
Dan
-
Should be different, but you would have to look at them to make sure.
-
ps - is a pdf to html coverter different from a plugin that loads the pdf as an open page when you click it ? or same thing ?
-
That is what I was going to suggest - setting up a canonical in the http header of the PDF back to the article
https://support.google.com/webmasters/answer/139394?hl=en
As another option, you can just block access to the PDFs to keep them out of the index as well.
-
thanks Chris
yes you can canonicalise the pdf to the html (according to the comments of that article i just linked to anyway)
-
Hi Dan,
Yes PDFs are crawlable (sorry for confusion!) if you were to put it into say a .zip or .rar (or similar) it wouldn't be crawled or you could no index the link i guess. You would need to stick the PDF (download) behind some thing that couldn't be crawled. You could try rel= canonical but I've never tried it with a PDF so i'm not sure how that would go.
Hope that enlightens you a bit.
-
Thanks Chris although i thought PDFS were crawlable??: http://www.lunametrics.com/blog/2013/01/10/seo-pdfs/
Hence why im worried about dupe content if use content of pdf as body text too OR are you saying should no-follow the link to the pdf if use its content as body text because it is considered dupe content in that scenario ?
Ideally i want both - the copy on it used as body text copy on page and the pdf a linkable download, or page as embed of open pdf via a plugin.
-
What would give the user the best experience is the really question,I would;d say put it on page then if the user is lacking a plugin they can still read it, if you have it as a downloadable PDF is shouldn't be able to get crawled and thus avoiding the problem.
Hope that helps.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate content with tagging and categories
Hello, Moz is showing that a site has duplicate content - which appears to be because of tags and categories. It is a relatively new site, with only a few blog publications so far. This means that the same articles are displayed under a number of different tags and categories... Is this something I should worry about, or just wait until I have more content? The 'tag' and 'category' pages are not really pages I would expect or aim for anyone to find in google results anyway. Would be glad to here any advice / opinions on this Thanks!
On-Page Optimization | | wearehappymedia1 -
Would a free PDF download diminish SEO benefits of HTML content?
Dear readers, This post is a duplicate of one I just put up. Sorry about that. If you are interested in commenting or seeing other responses, please go to http://moz.com/community/q/would-a-free-pdf-download-diminish-seo-benefits-of-html-content. Thanks. Hello, I am doing SEO for a company that, as a sideline business, sells four books written by the principals; the content is directly relevant to the company's primary business focus. Book sales are a tiny fraction of our overall revenue, and we don't expect that to change, although we will continue to sell the books. In addition to selling them, we have decided to convert the books to HTML and post them for free on our website (laid out by chapter and section). The hope is that this will result in goodwill, links, traffic, and ultimately improved search rankings. My question: Would offering free PDF downloads of the books (in addition to posting the HTML content) diminish the SEO benefits of the HTML content? If we don't offer the PDF option, people would have to visit our site to read the content (unless they bought a hard copy). If visitors were able to download a free PDF, they wouldn't need to return to our site to read it. If our corporate clients (nearly all of our clients are corporations) could download a PDF, they could then post it on an intranet instead of posting a link to our site. In general, do you think a visitor would be less likely to link to our site if he or she were able to download the PDF? Or would the appeal of the PDF option make it more likely that people would visit and link to the site? Also, if we offer the PDF option, are there any SEO issues related to duplicate content? Finally, if we did offer the free PDF download, would you recommend that we ask for an email address before giving the PDF? Thank you very much!
On-Page Optimization | | nyc-seo0 -
How to view all 'followed internal links' on a page
I am trying to view all the followed internal links on a few pages of my website. The MOZ toolbar just gives me the total number of internal followed links. What is the best way to actually see all the internal links that are followed by the google bot from any particular page? Thanks in advance.
On-Page Optimization | | rjchugh0 -
How do I cure 'overly dynamic' url's on an e-commerce website?
I've just launched an e-commerce website selling hosiery and have received aa report from SEO Moz regarding overly dynamic URL's. How do I resolve this issue - in words of one syllable please, I'm new to SEO! Here are three exapmles of over 120: http://www.yosassy.com/index.php?route=product/category&path=1&page=2 http://www.yosassy.com/index.php?route=product/product&filter_tag=&page=1&product_id=57 http://www.yosassy.com/index.php?route=product/product&filter_tag=&page=1&product_id=64 Thank you.
On-Page Optimization | | lindsayjhopkins0 -
Content Update
Hello, If I update the existing content i.e.I added some content to the already existing indexed content in a post,how will it effect SEO wise? Venkee
On-Page Optimization | | Venkee0 -
Would adding a line break tag into the product name affect SEO ranking and Google's ability to read the entire title?
Our client would like to include a link break so that part of the product name always showed up on a second line. Would this affect how Google bots crawl the product name? Would it also affect how Google would show the product name in a search result page? Thanks!
On-Page Optimization | | BrandLabs0 -
Duplicate Content
What I can do to avoid the duplicate content on the index and in the categorys, I cant block my categorys, cause are pages with big autorithy, so what i can do ?
On-Page Optimization | | nafera20 -
Value of PDF's in SEO
I have a client who has a lot of information in PDF form. They think they should move some of it over into HTML pages so it indexes better. Is there a benefit to converting these PDF's into HTML pages? It seems to me that HTML pages would be good, IF they are relevant pages that could be used online.
On-Page Optimization | | lvstrickland0