Google Rewriting PDF Titles
-
Has anyone else noticed Google rewriting the title of PDF documents?
-
Sure Wayne.
While there are differences between a web page and a PDF, from the concept of how Google handle's the data there is little difference. A crawler reads text and processes the data, which is then ranked and appears in search results. The same basic rules apply.
Here is an example:
-
Go to the following URL: http://centerforhealthysex.com/wp-content/uploads/. You can see this site allows the contents of this folder to be displayed (not a recommended practice).
-
Notice the first pdf file in the list: "alexandra-katehakis-biography.pdf"
-
Go to Google.com and search for the following without quotes: ".pdf site:centerforhealthysex.com". Notice the title shows as "download bio pdf - Center for Healthy Sex".
-
Return to Google.com and search for "alexandra katehakis biography". You will see the same file now has a title of "Alexandra Katehakis is a licensed Marriage, Family Therapist ..." In this case, Google grabbed the first line of text and used it as the title.
You can repeat this type of testing with almost any pdf or web page.
-
-
Yes, I've seen it with web pages but this is my first experience with PDF's. Anyone else seeing this?
-
Google reserves the right to change titles to represent what they feel is most appropriate for the user. A pdf document online is similar to a web page in that regard.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Google News and Discover down by a lot
Hi,
Technical SEO | | SolenneGINX
Could you help me understand why my website's Google News and Discover Performance dropped suddenly and drastically all of a sudden in November? numbers seem to pick up a little bit again but nowhere close what we used to see before then0 -
SERP Title shows up-with-dashes
Occasionally I see the our 'listings' on Google where the Title line shows up with dashes... like sony-professional-hard-drive - TapeandMedia.com It appears to be the URL shortened and rehashed. This example was after I searched for "Sony PSZ-HA1T" without the quotes. The title for this page is <title></span><span class="html-tag">Sony 1TB Professional Portable External Hard Disk Drive (PSZ-HA1T)</span><span class="html-tag"></title> and the url is http://www.tapeandmedia.com/sony-1tb-professional-portable-hard-drive.asp Link to image: http://i.imgur.com/FmvAn6c.jpg Other searches (like "Sony 1tb PSZ-HA1T") yield normal looking SERP Titles Does anyone know why this happens and what I can do to avoid this? FmvAn6c.jpg
Technical SEO | | BWallacejr0 -
Should I remove these pages from the Google index?
Hi there, Please have a look at the following URL http://www.elefant-tours.com/index.php?callback=imagerotator&gid=65&483. It's a "sitemap" generated by a Wordpress plug-in called NextGen gallery and it maps all the images that have been added to the site through this plugin, which is quite a lot in this case. I can see that these "sitemap" pages have been indexed by Google and I'm wondering whether I should remove these or not? In my opinion these are pages that a search engine would never would want to serve as a search result and pages that a visitor never would want to see. Attracting any traffic through Google images is irrelevant in this case. What is your advice? Block it or leave it indexed or something else?
Technical SEO | | Robbern0 -
Is Duplicate title made Sanbox?
I use Seomoz tool and discover that my webpage have 1000 duplicate title, my keyword with domain key 's position is 25, but i dont have any keyword ontop 100 of google. Is the Duplicate title effect in the key word position or SEO ? right?
Technical SEO | | magician0 -
Google Page speed
I get the following advice from Google page speed: Suggestions for this page The following resources have identical contents, but are served from different URLs. Serve these resources from a consistent URL to save 1 request(s) and 77.1KiB. http://www.irishnews.com/ http://www.irishnews.com/index.aspx I'm not sure how to fix this the default page is http://www.irishnews.com/index.aspx, anybody know what need to be done please advise. thanks
Technical SEO | | Liammcmullen0 -
Google has not been visiting my site
Hi I am working on a site at the moment http://www.cheapflightsgatwick.com and i had the site using a different template and in the search engines for the search term cheap flights gatwick we were fourth and for the term holiday magazine we were 12th in google but now we are not even in google on the first page for the search terms. But now after changing the template in joomla our rankings have gone out of the window. It took me about a day to sort out the site with the new template so i was not expecting any problems with the search engines but for some reason there is. If you put into the search engine www.cheapflightsgatwick.com then you will see that google has not visited the site for four days and also it is not showing the description and instead it is showing details about joomla. Can anyone let me know if there is anything i need to do to sort this out and why google is taking so long to visit my site
Technical SEO | | ClaireH-1848860 -
How do I eliminate duplicate page titles?
Almost...I repeat almost all of my duplicate page titles show up as such because the page is being seen twice in the crawl. How do I prevent this? <colgroup><col width="336"> <col width="438"></colgroup>
Technical SEO | | ENSO
| www.ensoplastics.com/ContactUs/ContactUs.html | Contact ENSO Plastics |
| ensoplastics.com/ContactUs/ContactUs.html | Contact ENSO Plastics | This is what is from the CSV...there are many more just like this. How do I cut out all of these duplicate urls?0 -
How long does it take for customized Google Site Search to show results from pdf files?
The site in question is http://www.ejmh.eu I am pretty unsatisfied with the results I am getting from the Site Search provided by Google. We have over 160 pdf files in this subfolder: http://www.ejmh.eu/mellekletek The files are the digital versions of articles. When I search for content in those pdf files, Google does not show results. It does show results from older pages, dating back 1-2 years but it is certainly not showing anything from pdf files that I have just put up 3 weeks ago. My questions: If I place a Google Search on a site, does it not automatically display results from ALL the content in the root domain? Is there any correlation between how the Site Search is indexing the files and how Google is indexing the urls in general? Should I just wait and see whether site search performance improves or should I switch to another Search software like Zoom Search? It is vital to have a proper, high-quality search functioning on that site in the very near future. What are your experiences? Any tips are greatly appreciated.
Technical SEO | | Lauroca0