What is a good crawl budget?

yaelslater

Hi Community!

I am in the process of updating sitemaps and am trying to obtain a standard for what is considered "strong" crawl budget? Every documentation I've found includes how to make it better or what to watch out for. However, I'm looking for an amount to obtain for (ex: 60% of the sitemap has been crawled, 100%, etc.)

mywebguytaylor

@blueprintmarketing I have a large website with Wordpress image folders going back to 2009.

I am currently redesigning my website, and I am trying to determine if there is any benefit to trying to shrink down / delete those images and image folders which I am no longer using.

I really do not have time to go through all of those image folders, and see which ones I am still using, and which ones I am not using anymore. I am hoping this does not matter?

Does anyone here know if this matters when it comes to Google's Crawl Budget?

All of the images are completely optimized and crunched, however, my question is whether it would be worth the time investment to go through every single folder and thousands of images and try to delete the ones which are not being referenced on any of my pages?

Does anyone have a definitive answer regarding Crawl Budget?

seopack.or3best

@blueprintmarketing

Can you give some inputs about the site [https://indiapincodes.net/](link url) I tried all recommendations, only 30% of the url is been indexed. would appreciate your time.

BlueprintMarketing

@yaelslater
Unless you have a huge site, I'm talking about half a million to one million pages. I Would not worry about True Google crawl budge anymore.

However, if only 60% of URLs in your XML site map are being indexed, make sure they are indexable URLs if they're not index value, or else you should be able to click in the coverage section of Search Console. It will give you a reason why your URL was submitted by an XML site map or not noindex.
A recent study showed about 20% of URLs on all websites across the study were not indexed for one reason or another.

But make sure there are only 200 URLs, no redirects 301, 302, or 404's or noindex nofollow URLs in the XML sitemap because obviously, Google does not put them into the index if the Search Console does not tell you the issue & you would like to share your domain with me, I'm sure I could figure it out.

I don't know if you're using a CDN and if you could share a little more with me especially the domain I can be a lot more helpful.

You could also use a tool like screaming frog and generate a new site map and make sure that is not the issue. If you're using Yoast, you can turn it on and off if you wanted to create a new site map.

You can create up to 500 pages for free using Screaming Frog SEO Spider it is paid after that https://www.screamingfrog.co.uk/xml-sitemap-generator/

Or if you want it or you can generate over 1000 URLs for free online I would recommend https://www.sureoak.com/seo-tools/google-xml-sitemap-generator
However, please keep in mind the sureoak tool has things like a "keyword density checker" that makes me feel like this site is giving out that information because that's not a real thing that Google considers unless you use the same word for every word in the document. Keyword density is one of those things that are not real
But the XML site map generator works just fine

I hope this was of help,
Tom

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

What is a good crawl budget?

Got a burning SEO question?

Browse Questions

Explore more categories

Related Questions

When rogerbot tried to crawl my site it gets a 404\. Why?

Where are the crawled URLS in webmaster tools coming from?

WEBMASTER console: increase in the number of URLs we were blocked from crawling due to authorization permission errors.

Is it good to have a few outbound links to good authority sites?

Interesting Dilemma: Making A (Good) Content Scrapper Rank

Could somebody suggest a GOOD Wordpress XML sitemap generator?

Where can I find a good definition of "link juice"?

Crawling image folders / crawl allowance