Only one internal Equity Passing Link
-
Our web site IYBI is reporting only one internal equity passing link. I've somehow never noticed this and now that we are doing a lot more competitor analysis, I'm a little concerned given the numbers some of the other sites in the space are getting. I'm not sure I understand it completely and how it's possible we only have one. Any help would be appreciated.
-
Hello!
We do have another bot for our Mozscape Index (Open Site Explorer) which is dotbot
We do find ourselves blocked by a hosting provider at least once a week but mostly as a result of miscommunication between marketers working with clients with multiple web developers. One or two of them may not be aware of a crawler being setup to crawl a site and will see it as malicious behavior, thus causing our bot to be blocked. This is mostly seen from larger companies.
-
Hi David,
I've since spoken to our host at WP Engine and they confirmed they were blocking rogerbot. Is dotbot another crawler from MOZ I need to make an exception for?
Seems INSANE to me that our host would just block your bots for no reason and without notification. Do you run into this often? Any thoughts on why?
-
These pages can be found under the top pages tab in OSE: http://moz.com/researchtools/ose/pages?page=1&site=http%3A%2F%2Fifyoubuildit.com.au
The pages appear to not like our crawler dotbot when trying to request an HTTP response
curl -A "dotbot" http://ifyoubuildit.com.au/branding/
<title>403 Forbidden</title>
<center>
403 Forbidden
</center>
<center>nginx/1.2.9 WPEngine/6.0.7</center>
-
Isn't a 403 a permissions error? It seems impossible to me that some of our high ranking pages are returning this error and we have not noticed before. Where are you seeing this info?
-
Wow. Seems crazy. Would something like that effect our ranking as well?
-
Hello!
It appears the site returns a 403 HTTP response for many of your top pages which contribute to internal links to your home page. Some we are not able to reach at all and others return the 403 response. So you would want to check with your hosting provider and web developer to make sure the chmod permissions are the same which is the most common issues we've seen for these errors.
Check the permission set for this page http://ifyoubuildit.com.au/2014/07/golden-age-cinema-bar/ which we can reach and compare it to one we cannot such as http://ifyoubuildit.com.au/branding
Hope this helps!
-
Okay thanks for the info Erin. I guess my concern is that our site has been around for years. It's not new and it's the one metric that doesn't seem to make any sense.
Are you able to look at our site and tell me if there are any crawl issues?
-
Hi there! Thanks for reaching out. My name is Erin, and I'm on the Moz Help Team. I appologize that it's taken us so long to get back to you, things have been a little hectic around here!
I know this might look a little wonky in OSE, but it's just because of the way we build our index. It's totally possible that you have more internal equity passing links, but that not all of them are in our index yet. Most new sites and links will be indexed by our spiders and available in Mozscape and Open Site Explorer within 60 days, but some take even longer for many of reasons, including the crawl-ability of sites, the amount of inbound links to them, and the depth of pages in subdirectories.
Just so you know, here's how we compile our index:
- We grab the most recent index.
- We take the top 10 billion URLs with the highest MozRank (with a fixed limit on some of the larger domains).
- We start crawling from the top down until we've crawled 90,000,000,000 pages (which is about 35% the amount in Google's index).
Therefore, if the site is not linked to by one of these seed URLs (or one of the URLs linked to by them in the next update) then it won't show up in our index. Sorry!
We update our Mozscape Index every 4 weeks. Crawling the entire Internet to look for links takes 2-3 weeks, but our crawlers are always collecting data. When we need to put the index together, we grab all the data they have collected and start processing which can take up to 3 weeks to determine which of those links are the most important. You can see our most recently updated schedule here: http://moz.com/products/api/updates
Mozscape focuses on a breadth-first approach. Therefore we almost always have content from the homepage of websites, externally linked-to pages, and pages higher up in a site's information hierarchy. However, deep pages that are buried beneath many layers of navigation are sometimes missed and it may be several index updates before we catch all of these.
If our crawlers or data sources are blocked from reaching those URLs, they may not be included in our index (though links that point to those pages will still be available). Finally, the URLs seen by Mozscape must be linked-to by other documents on the web or our index will not include them.
Whew! Sorry for the long answer, I just wanted to be as thorough as possible. I hope this helps, and if you have any other questions, just shoot us an email at [email protected].
Cheers!
Erin
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
How to find lost internal links
I've been seeing a large decline in internal followed links over the past couple of months and am trying to figure out the cause. My developer says there have been no changes to the site that would cause this. Is there a way to find what internal links have been lost?
Link Explorer | | rgibson1002 -
Links to non-exisiting pages
Hi, I have tons of incoming links to target pages on my website that do not exist. If you follow the link you get a 404 error. The anchor text is always a "spammy" non related text. Does anyone know what is happening and how to get rid or block these links? Thanks, Dirk
Link Explorer | | FoodJEt0 -
Question about link report
Hey all, SEO noob here, back with another question. I downloaded a link report, and I don't understand the columns Links to Page, Outbound Domains from Page, and Outbound Links from Page. Is Links to Page the number of links from the source page to the target page? Is Outbound Domains from Page the number of domains linked to by the source page? Is Outbound Links from Page the number of outbound links to any domain or page from the source page? Thanks in advance! AK
Link Explorer | | AndyKubrin0 -
Internal Pages marked as Spam by Moz
Hello,
Link Explorer | | Umesh-Chandra
We use Moz for our website,Open site explorer marks many of the internal pages of the website as spam. Even pages such as careers, privacy policy etc. Please let us know why is this the case ? Also does these pages have impact on overall rankings of the website ?, If yes, What should we do about it ? Please clarify. Thanks.0 -
What happened to my links in Open Site Explorer.
Hello, A while ago I had a problem with open site explorer saying that I had a problems with redirects on my home page. There was 302 redirect and it was giving me information from my www domain rather then from non www domain. With your awesome help, this issue is sorted now and when I type in my non www domain, it doesn't give me that weird message anymore. I have a different issue now. For some reason when I enter non www domain it doesn't show links from www version. I presume that this is affecting my site authority for non www version. Does it mean that link juice was not passed on to non www version from www version? Authority dropped by two points, but I'm not sure why. I'm more interested to know why it doesn't show my links from www version on non www version. And is that affecting my sites rankings. Thank you, Regards, Armands
Link Explorer | | A_Fotografy0 -
Identifying Recently Added Links?
Is there a way to see specifically which links were added in my campaign? I noticed that this week we have 133 new links that were added to out website, yet we lost 1 link on the root level. So I am a little confused where that is coming from. I wanted to see if I could see where. When I look at the recent inbound links in the OpenWeb explorer there isn't anything listed. Thanks!
Link Explorer | | HashtagHustler0 -
Links from no-indexed pages in Open Site Explorer
I just looked up a site in Open Site Explorer. After putting in the site, I get a list of domains, links etc which is very useful. At the bottom is a note which I have pasted below. It mentions "24 links from no-indexed pages" and says we can't get any info on these. Is there a way to see what the 24 links are so I can investigate further?"Why does my number of inbounds links listed not match my total links count? The listed number may be slightly different because we only show 25 links per domain (so that you see a variety). Additionally you have 24 links from no-indexed pages. Unfortunately, we can't get info for those links."
Link Explorer | | EdKim0 -
Is there an efficient way to use Open Site Explorer to find unnatural or harmful links
We have a new client with 8 sites that we would like to 301 to a new site. However, before doing so we want to make sure that the backlinks are not unnatural or harmful in any way. Using open site explorer (other than just looking at exact anchor text vs brand anchor text ratio) is there a way to determine low quality inbound sites linking in? or any type of links google will find manipulative?
Link Explorer | | Bryan_Loconto0