Crawl test. Bot crawled only 200 or so links when it should have crawled thousands
-
Hi everyone,
I just recieved my crawl test report and its only given me 200 or so URL's when my site has thousands, any thoughts?
-
Hi Ryan,
I am the site owner and this is the precise reason im trying to take matters into my own hands.
<meta name="keywords" content="E60,Rear,lamp,set" /> I see what you mean, because this is actually ridiculous, not quite sure how it got into this state either. Whats that saying, "if you want something done you have to do it yourself" Looks like i have to take a crash course in SEO to sort it all out. Thanks very much for all your help.
-
I realize you may not have full control over the site. What I would share is:
"That's how the site is" is not an acceptable response, unless the site owner is satisfied with their current SEO ranking.
The keywords have NOTHING to do with the product being displayed on the page. The link I offered is for a Hella e60 Rear Lamp. The only related in the keyword section is "rear". I am quite certain that is by coincidence.
Your keywords are not dynamically generated to vary with the pages content, nor were they manually altered to fit the pages content. The keyword selection is awful. The numbers "3", "5", and "7" are listed as 3 of the key words.
I want to help you, so don't take this the wrong way. The best thing about that site is it probably qualifies as a textbook case of what NOT to do from a SEO perspective. Perhaps you can appeal to a SEO company to use the site in a case study and turn it around.
-
Thank you very much Ryan, the columns on two sides of the page cant be helped as thats, how the site is, only the central content changes. However the duplicate keywords are for the products themselves, for example i sell 50 different BMW oil filters. Theres not much i can do about duplicating keywords as all of the products are very very similar.
I think you might be right about the site redesign....
-
A few notes about your site:
-
you are using meta keyords in your header. It offers no benefit and I would suggest removing it. It's not related to your inquiry but is something I noticed.
-
your site has a 50 keyword TAG block with the same keywords on every page. This isn't good from a SEO perspective on many levels. You want your keywords to focus the unique content on a given page
-you site pages are likely viewed as all duplicates. I can recognize the main item in the center of the page changes, but would a crawler? Your left and right sidebars are identical on all pages, along with most of your header. The actual content you offer is only a small percentage of the total page.
The large image of the various car parts is not considered as part of the content, aside from the ALT tag.
Look at a random page from your site: http://www.incarmotorfactors.co.uk/content/16-hella-bmw-e60-rear-lamp-set
According to the Analyze tool there are 5975 words on the page. I estimate about 100 of them are unique words addressing your Rear Lamp product, and the remaining 5800+ words are exactly the same as every other product page.
A crawler will see your pages as 98% duplicated data and the result will likely be your site isn't going to be listed. I would recommend a site re-design. Before taking that advice, it would probably be best to hear from others who have a lot more SEO expertise then myself.
-
-
-
What is different about your site? Is it flash or javascript based? Can you share your site URL?
-
Hi Ryan,
I used
On-Page Optimization Tools: Crawl Test. However this problem may be deeper than i first thought, as SEOMoz is not able to read any of my site info properly.
Open site explorer cant read it
Linkscape cant read the links. Crawl test isnt read properly, however my server and robots.txt files are fine, theres no blocking attempts from the server. Very strange.
-
What Crawl Test tool did you use?
Depending on the crawl tool, it may not look at content blocked from your Robots.txt file. You may want to ensure it is configured correctly.
Are there any permission issues? The crawler will look at your site the way a guest would. Any content which requires users to log in would be hidden to the crawler.
Are there any other issues regarding your site's accessibility? Connection or firewall issues? Could a server admin have seen a server performance issue and kicked the crawler before it finished? You can check server logs for this information.
If you check everything and do not locate a definitive cause, I would suggest running the crawl once more and checking the results before pursuing the matter further.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Crawl Diagnostics
My site was crawled last night and found 10,000 errors due to a Robot.txt change implemented last week in between Moz crawls. This is obviously very bad so we have corrected it this morning. We do not want to wait until next Monday (6 days) to see if the fix has worked. How do we force a Moz crawl now? Thanks
Moz Pro | | Studio330 -
Crawl Diagnostics - Historical Summary
As we've been fixing errors on our website, the crawl diagnostic graphs have been showing great results (top left to bottom right for errors). The problem is the graphs themselves aren't very pretty. I can't use them in my internal reports (all internal reports are standardised colours/formats). Is there anyway of exporting the top level summary with historic data so the graphs can be recreated in company colours? I don't want the detailed CSV breakdown of what errors occurred, but rather than on X date there were Y errors, the next month Z errors and so forth. The data must already be in the SEOMoz system in order to create the graphs themselves - I was hoping this can be made available to us if it isn't already? Does anyone know if there is already a way of doing this? I've tried to 'inspect element' and find the underlying data in the source code but to no avail, and can't see any exports that would do this. Thanks in advance Dean
Moz Pro | | FashionLux0 -
Difference Between equity passing and follow links
Hi, I am recently seeing two more new options in the Opensiteexplorer filter options. equity passing links
Moz Pro | | Dexjj
non equity passing links
nofollow
dofollow What is the difference between an equity passing links and dofollow links. Can you guys help me.4 -
Re: Competitive Link Comparison
In Competitive Link Comparison Top 5 contenders... why would the landing page have an HTTP Status showing as Blocked by robots.txt when it is not blocked within the robots.txt file and no files are shown as blocked in Google's webmaster tools. Sorr if I've ticked the incorrect topic categories
Moz Pro | | Hornblower0 -
Inbound Links To Deleted Pages
Hi, I recently deleted some pages from my website and believe that there will be external inbound links pointing to these pages. I would like to find them and put redirects in place - can anybody tell me how to use SEOMOZ to find where external links are poiting to moved/deleted pages Thanks
Moz Pro | | stayin1 -
Issue in number of pages crawled
i wanted to figure out how our friend Roger Bot works. On the first crawl of one of my large sites, the number of pages crawled stopped at 10000 (due to the restriction on the pro account). However after a few weeks, the number of pages crawled went down to about 5500. This number seemed to be a more accurate count of the pages on our site. Today, it seems that Roger Bot has completed another crawl and the number is up to 10000 again. I know there has been no downtime on our site, and the items that we fixed on our site did not reduce or increase the number of pages we had. Just making sure there are no known issues with Roger Bot before I look deeper into our site to see if there is an issue. Thanks!
Moz Pro | | cchhita0 -
SEOmoz bot and "noindex"
As a recent newbie to SEOmoz, I've been implementing some suggestions and doing a general tidy up. I removed URL's from our robots txt, and rolled out instead the noindex meta tag to pages we don't want indexed. But surprised to see issues that are now flagged from the last crawl by the moz bot from pages that have this meta tag? Does the SEOmoz bot not ignore this tag? Just want to make sure I've implemented it correctly, so the google bot does ignore it. Meta tag syntax is and is placed below the title tag. cheers Steve
Moz Pro | | sjr4x40 -
Why doesn't the BBB / Trustlink.org links show up in the Link Analysis?
I am curious why one of my client's main competitors (www.allbayhardwood.com) shows links from the Better Business Bureau and Trustlink.org (associated with BBB) but links from those sources do not show up for his domain (www.sanjosehardwoodfloors.com). He has been a BBB Acredited Business since 12/2010 and on file with them for probably as long as they have had the online version, which seems like plenty of time for the link to have been picked up. BBB has a very nice domain authority and it would be great to see these links show up. (they don't show up in webmaster tools either) Is there something I am missing? Thanks in advance guys and gals! (I know the site has other SEO issues - just getting started on pounding everything out.)
Moz Pro | | SnoBaer0