Tool which checks cache date of pages?
-
Does anyone know of a tool which can check the cache date of each page of a site?
i can get each page of the site into a .csv or xml file
-
HI Wildner
Thanks for the ideas, i was thinking along these lines. Thanks for your input!
-
I think you will have to write your own application. Google would provide such a tool if they wanted to...
Here is the idea: one one side, take the xml-sitemap, on the other side you have goolge query cache:domain/path. Now you have to write a php code and combine these elements. In the response you'll get from Google, you will have to find the part with the date. For example the answer is: "This is Google's cache of http://www.seomoz.org/. It is a snapshot of the page as it appeared on 18 May 2011 09:06:09 GMT. The current page could have changed in the meantime. Learn more"
And dont't forget to incorporate a sort of timeout. Google doesn't like those queries...
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
What will happen if all our website content has the date created amended to the migration date?
HI, We will be migrating all our website content soon to a new CMS and at the moment the
Technical SEO | | alzheimerssoc1 -
Should I deindex my pages?
I recently changed the URLs on a website to make them tidier and easier to follow. I put 301s in place to direct all the previous page names to the new ones. However, I didn't read moz's guide which says I should leave the old sitemap online for a few weeks afterwards. As I result, webmaster tools is showing duplicate page titles (which means duplicate pages) for the old versions of the pages I have renamed. Since the old versions are no longer on the sitemap, google can no longer access them to find the 301s I have put in place. Is this a problem that will fix itself over time or is there a way to quicken up the process? I could use webmaster tools to remove these old urls, but I'm not sure if this is recommended. Alternatively, I could try and recreate the old sitemap, but this would take a lot of time.
Technical SEO | | maxweb0 -
Pages not being cached have a negative effect?
Hi all! I look after a website where it's been discovered a section of the website has the noarchive robots meta tag active on it causing it to not get cached but has been indexed. Out of curiosity has anyone seen any negative effects from Google for having pages that aren't cached? It's not the strongest section on the website so makes it tricky to judge myself but interested if anyone had any thoughts on the matter. Cheers,
Technical SEO | | thisisOllie0 -
Duplicate pages
Hi Can anyone tell me why SEO MOZ thinks these paes are duplicates when they're clearly not? Thanks very much Kate http://www.katetooncopywriter.com.au/how-to-be-a-freelance-copywriter/picture-1-58/ http://www.katetooncopywriter.com.au/portfolio/clients/other/ http://www.katetooncopywriter.com.au/portfolio/clients/travel/ http://www.katetooncopywriter.com.au/webservices/what-i-do/blog-copywriter/
Technical SEO | | ToonyWoony0 -
Indexed pages and current pages - Big difference?
Our website shows ~22k pages in the sitemap but ~56k are showing indexed on Google through the "site:" command. Firstly, how much attention should we paying to the discrepancy? If we should be worried what's the best way to find the cause of the difference? The domain canonical is set so can't really figure out if we've got a problem or not?
Technical SEO | | Nathan.Smith0 -
Different links to to the same page
Hi, Based on the user's actions we post activity into users Facebook timeline. And each activity has link back to our particular page on our website. For example if original page was: www.Domain.com from Facebook timeline it would be like this: www.Domain.com?Ffb_action_ids=101508953168 Do you think this will have a negative effect on our page rankings as we will eded up having a lot of different URL's to the same page? www.Domain.com?Ffb_action_ids=101508953168 www.Domain.com?Ffb_action_ids=456788765609 etc.. Thank you, Karen Bdoyan
Technical SEO | | showme0 -
Duplicate Pages Issue
I noticed a problem and I was wondering if anyone knows how to fix it. I was a sitemap for 1oxygen.com, a site that has around 50 pages. The sitemap generator come back with over a 2000 pages. Here is two of the results: http://www.1oxygen.com/portableconcentrators/portableconcentrators/portableconcentrators/services/rentals.htm
Technical SEO | | chuck-layton
http://www.1oxygen.com/portableconcentrators/portableconcentrators/1oxygen/portableconcentrators/portableconcentrators/portableconcentrators/oxusportableconcentrator.htm These are actaully pages somehow. In my FTP there in the first /portableconentrators/ folder there is about 12 html documents and no other folders. It looks like it is creating a page for every possible folder combination. I have no idea why you those pages above actually work, help please???0 -
Missing Cache - very strange
Anyone have experience with a cache going missing from a page that had a cache in the past? We’re overhauling a page and noticed the cache was gone from Google results. We don’t know if this event is good/bad/doesn’t matter but I am curious why this happens. I am positive the cache was missing before we updated the page today because a programmer mentioned they did try checking for a cache for a historical load time prior to today for a different project. I have attached two screenshots to illustrate two things: 1) What google delivers for a cache: of the page instead of the normal cache page 2) Even though you can see a cache of any of the indexed pages we have from a serp, the cached link is missing in serps for the mentioned page Has anyone seen this before? thanks! IhnAf SL8ax
Technical SEO | | CouponCactus0