Is there a way to prevent Google Alerts from picking up old press releases?
-
I have a client that wants a lot of old press releases (pdfs) added to their news page, but they don't want these to show up in Google Alerts. Is there a way for me to prevent this?
-
Thanks for the post Keri.
Yep, the OCR option would still make the image option for hiding "moo"
-
Harder, but certainly not impossible. I had Google Alerts come up on scanned PDF copies of newsletters from the 1980s and 1990s that were images.
The files recently moved and aren't showing up for the query, but I did see something else interesting. When I went to view one of the newsletters (https://docs.google.com/file/d/0B2S0WP3ixBdTVWg3RmFadF91ek0/edit?pli=1), it said "extracting text" for a few moments, then had a search box where I could search the document. On the fly, Google was doing OCR work and seemed decently accurate in the couple of tests I had done. There's a whole bunch of these newsletters at http://www.modelwarshipcombat.com/howto.shtml#hullbusters if you want to mess around with it at all.
-
Well that is how to exclude them from an alert that they setup, but I think they are talking about anyone who would setup an alert that might find the PDFs.
One other idea I had, that I think may help. If you setup the PDFs as images vs text then it would be harder for Google to "read" the PDFs and therefore not catalog them properly for the alert, but then this would have the same net effect of not having the PDFs in the index at all.
Danielle, my other question would be - why do they give a crap about Google Alerts specifically. There has been all kinds of issues with the service and if someone is really interested in finding out info on the company, there are other ways to monitor a website than Google Alerts. I used to use services that simply monitor a page (say the news release page) and lets me know when it is updated, this was often faster than Google Alerts and I would find stuff on a page before others who did only use Google Alerts. I think they are being kind of myopic about the whole approach and that blocking for Google Alerts may not help them as much as they think. Way more people simply search on Google vs using Alerts.
-
The easiest thing to do in this situation would be to add negative keywords or advanced operators to your google alert that prevent the new pages from triggering the alert. You can do this be adding advanced operators that exclude an exact match phrase, a file type, the clients domain or just a specific directory. If all the new pdf files will be in the same directory or share a common url structure you can exclude using the "inurl:-" operator.
-
That also presumes Google Alerts is anything near accurate. I've had it come up with things that have been on the web for years and for whatever reason, Google thinks they are new.
-
That was what I was thinking would have to be done... It's a little complicated on why they don't want them showing up in Alerts. They do want them showing up on the web, just not as an Alert. I'll let them know they can't have it both ways!
-
Robots.txt and exclude those files. Note that this takes them out of the web index in general so they will not show up in searches.
You need to ask your client why they are putting things on the web if they do not want them to be found. If they do not want them found, dont put them up on the web.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Ensuring that Google Display my Meta Descriptions
Hello, I have a few pages that appear in SERPs with the page copy as opposed to the meta descriptions I wrote, is there any way to try & force the meta descriptions to be displayed? Its a WordPress site using Yoast
On-Page Optimization | | jasongmcmahon0 -
New website showing old domain titles in search
Hello Moz, We have recently built a new website http://www.hegroup.org.uk/ The new site has the domain for one of the clients old sites pointing to it - heartofmersey.org.uk. When we check the SEO index (site:hegroup.org.uk) for the new site, most of of the indexed items are using the old 'Heart of Mersey' title in the index although these do redirect to the new site. See below. Heart of Mersey <cite class="_Rm">www.hegroup.org.uk/</cite>Jessica Bell · Andrew Bennett · Nicola Calder · Matt Donnelly · Alexandra Holt · Robin Ireland · Magdalena Kolka · Alison Gradwell · Matthew Philpott · Trustees. Not sure how to resolve this issue. Any suggestions Thanks Ian
On-Page Optimization | | Substance-create0 -
Google Treating these URL's as diff, but they are same. please help
Google is treating, below URL's as two different URL's when they are same. How to solve this. Please help. Case 1:/2570/Venture-Capital-and-Capital-Markets/2570/venture-capital-and-capital-marketsCase 2: /xxx/Java-Programming//xxx/Java-ProgrammingPlease help, how to solve this. Thanks in advance
On-Page Optimization | | AnkammaRao0 -
Meta Title in Google does not match the HTML meta title I have coded in a site
I have a client site that is pulling a meta title that is not in his code. I am using Yoast for the titles and descriptions on this site. Not 100% sure why Google is not listing the title we have in place. Could the code be pulling from somewhere else? Is there a fix for this?
On-Page Optimization | | Bryan_Loconto0 -
On Page Optimization Reports for Google UK Grade A - F
Hi, Can someone please explain how it is that one of my keywords ranks as a Grade A and a Grade F? This doesn't seem to make any sense? Thanks in advance. Heather
On-Page Optimization | | T1RBO0 -
Hit by Panda - Google Disavow Help
Hi I hope you can help me A Website I manage has been hit hard by the Panda Update. I am really struggling to understand what is seen as a Spammy link. The Website use to be on page 1 for "fancy dress" now it isnt visable for that term at all and most other terms the site has dropped for. I have looked into what might have gone wrong and have removed several links , used the disavow tool 2-3 times and submitted re-consideration requests, but each time google informs me that they are still detecting unnatural links. Could somebody please take a look at our link profile www.partydomain.co.uk for "fancy dress" as an example and show examples of links you would consider that google might not like. It would also be good if anybody had any contacts in the UK that could help thanks Adam
On-Page Optimization | | AMG1000 -
Does Google still see masked domains as duplicate content?
Older reads state the domain forwarding or masking will create duplicate content but Google has evolved quite a bit and I'm wondering if that is still the case? Not suggesting that a 301 is not the proper way to redirect something but my question is: Does Google still see masked domains as duplicate content? Is there any viable use for domain masking other than for affiliates?
On-Page Optimization | | TracyWeb0 -
Confirmation regarding canonical and syndication google tags
Hi, We are in the process of improving our CMS upstream to resolve our duplicate content issues. We were hit pretty hard by the Panda update. One of the steps we have taken is implementation of the canonical link tag across all domains in our site. You see, we are a news release service with muliple channels and websites to represent each. The problem is that a client will submit a release and in many cases the news item is relevant to multiple channels I.E. multiple websites under the same IP range. Site Examples:
On-Page Optimization | | jarrett.mackay
www.hotelnewsresource.com www.restaurantnewsresource.com
www.travelindustrywire.com From a user perspective, it makes sense that they should be able to access the article from the site they are browsing without being redirected to the site we feel carries the most relevance. We hope the canconical tag will resolve this issue for us. I have also read about the syndication tag and was looking for feedback or recommendations if we should implement that also, but it may be overkill as the two tags objectives seem to be similar. I guess my first question is if the syndication tag is only used by Google News. Secondly, and a little off topic is that we also offer an API and like many other sites, I have read, our content partners are now doing better in primary and long tail rankings even thought we are the original source. My assumption is that we should modify the API to force using both caconical and syndication tags as well. Lastly, I´m curious if anyone has tested the original source tag and if we should implement that as well. Thanks everyone. Jarrett0