Rel="canonical" for PFDs?
-
Hello there,
We have a lot of PDFs that seem to end up on other websites. I was wondering if there was a way to make sure that our website gets the credit/authority as the original creator. Besides linking directly from the PDF copy to our pages, is anyone aware of strategy for letting Google know that we are the original publishers?
I know search engines can index HTML versions of PDFs, so is there anyway to get them to index a rel="canonical" tag as well?
Thoughts/Ideas?
-
I stand corrected on that point.
Thank you Jassy for sharing the link. I was not aware Google made that change.
-
I'm not sure that statement about rel canonical only working within your own domain - if you have some test data/similar that shows this to be the case, I'd love to hear about it.
Matt Cutts specifically says that cross-domain rel canonical is supported, see: the webmaster video on: iwww.youtube.com/watch?v=zI6L2N4A0hA
-
Canonical tags are only effective within your domain. They have no value if someone else was to take your work and share it elsewhere.
A few things you can do to establish yourself as the original content creator:
-
publish it first on your site. Wait until you see your content in Google before actively distributing the pdf to others. This would be one indicator that can be used to demonstrate you are the original author.
-
as you shared, ensure there are links back to your site within the PDF. This would be another good indicator to Google that you are the content creator.
-
lock the PDF so changes cannot be made to the content.
-
Earlier today Google announced the new schema.org microdata offers an author tag so you can determine the original author. That system has been tested and is available to use now.
-
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Rel=Canonical For Landing Pages
We have PPC landing pages that are also ranking in organic search. We've decided to create new landing pages that have been improved to rank better in natural search. The PPC team however wants to use their original landing pages so we are unable to 301 these pages to the new pages being created. We need to block the old PPC pages from search. Any idea if we can use rel=canonical? The difference between old PPC page and new landing page is much more content to support keyword targeting and provide value to users. Google says it's OK to use rel=canonical if pages are similar but not sure if this applies to us. The old PPC pages have 1 paragraph of content followed by featured products for sale. The new pages have 4-5 paragraphs of content and many more products for sale. The other option would be to add meta noindex to the old PPC landing pages. Curious as to what you guys think. Thanks.
Technical SEO | | SoulSurfer80 -
Will this URL structure: "domain.com/s/content-title" cause problems?
Hey all, We have a new in-house built too for building content. The problem is it inserts a letter directly after the domain automatically. The content we build with these pages aren't all related, so we could end up with a bunch of urls like this: domain.com/s/some-calculator
Technical SEO | | joshuaboyd
domain.com/s/some-infographic
domain.com/s/some-long-form-blog-post
domain.com/s/some-product-page Could this cause any significant issues down the line?0 -
"Yet-to-be-translated" Duplicate Content: is rel='canonical' the answer?
Hi All, We have a partially internationalized site, some pages are translated while others have yet to be translated. Right now, when a page has not yet been translated we add an English-language page at the url https://our-website/:language/page-name and add a bar for users to the top of the page that simply says "Sorry, this page has not yet been translated". This is best for our users, but unfortunately it creates duplicate content, as we re-publish our English-language content a second time under a different url. When we have untranslated (i.e. duplicate) content I believe the best thing we can do is add which points to the English page. However here's my concern: someday we _will_translate/localize these pages, and therefore someday these links will _not _have duplicate content. I'm concerned that a long time of having rel='canonical' on these urls, if we suddenly change this, that these "recently translated, no longer pointing to cannonical='english' pages" will not be indexed properly. Is this a valid concern?
Technical SEO | | VectrLabs0 -
Is "Above the Fold Content" still a thing?
Many of our pages have the textual content stuffed at the bottom of the page because the manager doesn't think anybody reads it and it is an eyesore to have at the top: http://www.stevinsontoyotawest.com/schedule-service For some light reading here is Google’s official blog talking about content quality:
Technical SEO | | MEllsworth
http://googlewebmastercentral.blogspot.co.uk/2012/01/page-layout-algorithm-improvement.html This references Ads vs Content showing above the fold. However, in our case it has to do with images vs ads and stuffing text at the bottom of pages. Here is a bit of heavier reading. You can do a quick search for "Fold" to see their interpretation.
http://macedynamics.com/research/content-quality-score/ I understand that images are still content, however hardly any of the images have Alt text and they are not even named with keywords so Google really can't distinguish what the page is about through images alone. I'm not about to go through the entire site and add Alt text and rename images because I have much more to do on my plate. So, the questions is: Is stuffing content at the bottom of the page, below all images/inventory/widgets ok to do or should we stick with the eyesore content at the top of the page? Thoughts?0 -
If I want clean up my URLs and take the "www.site.com/page.html" and make it "www.site.com/page" do I need a redirect?
If I want clean up my URLs and take the "www.site.com/page.html" and make it "www.site.com/page" do I need a redirect? If this scenario requires a 301 redirect no matter what, I might as well update the URL to be a little more keyword rich for the page while I'm at it. However, since these pages are ranking well I'd rather not lose any authority in the process and keep the URL just stripped of the ".html" (if that's possible). Thanks for you help! [edited for formatting]
Technical SEO | | Booj0 -
Jigoshop "add to cart" producing 302 redirects
Hi, My site is throwing thousands of 302 redirect warnings on crawl for the add to cart process in my Wordpress/Jigshop online store. A sample url the crawl references is: | | https://morrowsnuts.com/product/the-best-of-the-best-8-oz/?add-to-cart=6117&_n=9773652185 | | I have read several other threads here that are similar in nature but haven't discovered a way to eliminate this. I am a store owner and with only partial technology skills and I don't know what to try next. I posted the problem with Jigoshop but I am not sure if they will provide a solution since this was the first time they heard of this. The site is Morrow's Nut House located at: https://morrowsnuts.com Thanks in advance for any direction or suggestions for me on next steps, John
Technical SEO | | MorrowsCandyMan0 -
Is it OK for a sitemap to appear as a "Top URL" in Google Webmaster?
I'm using Google Webmaster (alongside other tools) to understand how Google is indexing my site. One of the tools is "Content Keywords", where it lists keywords that Google sees as significant for your site. The keywords shown are generally fine, but when I click on an individual word, I am often seeing our sitemap as one of the "Top URLs" that the keyword is found on (our sitemap is at system/sitemap1.xml.gz) - is this OK? Obviously I don't want to add the sitemap URL to robots.txt, but I also want to ensure that 'real' user-focused pages (e.g. our homepage) appear higher in the "Top URLs" list for the keywords, as I'm assuming this is an indicator of how the site is performing in search. Any help appreciated!
Technical SEO | | anilababla0 -
Rel canonical to dissimilar pages
Is there a penalty for implementing a rel canonical between to pages that don't have the same content? I was told that you should avoid using a rel canonical if the pages score lower than 50-60 on the Similar Page Checker: http://www.webconfs.com/similar-page-checker.php The overall theme of the pages I am considering this for are similar but the actual content is different.
Technical SEO | | ryanwats0