A suggestion to help with linkscape crawling and data processing
-
Since you guys are understandably struggling with crawling and processing the sheer number of URLs and links, I came up with this idea:
In a similar way to how SETI@Home (is that still a thing? Google says yes: http://setiathome.ssl.berkeley.edu/) works, could SEOmoz use distributed computing amongst SEO moz users to help with the data processing? Would people be happy to offer up their idle processor time and (optionally) internet connections to get more accurate, broader data?
Are there enough users of the data to make distributed computing worthwhile?
Perhaps those who crunched the most data each month could receive moz points or a free month of Pro.
I have submitted this as a suggestion here:
http://seomoz.zendesk.com/entries/20458998-crowd-source-linkscape-data-processing-and-crawling-in-a-similar-way-to-seti-home -
Sean - I share Rand' sentiments, thanks so much for the suggestion!
We have considered distributed crawling in the past (or even distributed rank checking because then it would be in that user's locale) but there are a whole different set of challenges. For example, you have to handle all the edge cases: what if a user's computer isn't on, or loses connectivity, what if we crawl too fast and the user gets blocked from a site, how do you write all that data securely?
Of course all of these concerns can be overcome, but right now we feel like we have a good handle on the problems, and it will be much faster for us to just fix what we have
Although, I know all of us are so appreciative of the ideas and support, and we will have something really great soon!
-
Thanks a ton Sean! We have considered distributed computing as a way to help crawl, index, process, etc. It's so flattering and humbling to hear that you'd be willing to help out and that the community would, too
For now, we believe we can get to the index size/quality/freshness using our hosted system, but the engineering team will certainly be encouraged to hear that folks in our community might contribute to this. Distributed systems present their own challenges, and we'd have to write that code from scratch, but if we find that we can't do what we want with our existing network, we might reach out.
BTW - I wanted to let folks know that the team here does feel very confident that come December/January, we're going to be producing indices that reach exceptional quality bars. The problems we face are largely known, and we now have the team and the solutions to tackle it, so we're pretty excited.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Keywords Data Tool: Why is volume metrics unavailable for all of my keywords?
When I use the SEOMoz Pro Tool for Keyword Resarch, I get the notice that the tool is getting improvements. But when I run my keywords all of the volume metric data is unavailable. Why is this?
Moz Pro | | seocoppercupimages0 -
Linkscape API Sort calls
I'm using Linkscape API plugin from SEO Gadget. It's all fine but I can't find any documentation on other 'sort' calls. The only request that I knows that works is "domains_linking_page". Any idea where this information is? I was hoping for a list of all the available requests I can and more particularly I want to sort it by Internal Linking Domains.
Moz Pro | | iProspect-397560 -
Is there a way you can determine the time SEOMoz crawls your website
I'm a new user to SEOMoz Pro, and having created a number of campaigns I wanted to know if you can set / schedule when SEOMoz crawls your website? Ideally I want to prevent SEOMoz from crawling my site at certain times, but I can't seem to find a way of doing this. Thanks in advance.
Moz Pro | | Simon_Glanville0 -
Summarize your question.Is it possible to request another unscheduled crawl?
We have just sorted a couple of issues on the website which threw the crawl into spasm and gave us hundreds of hugely long URLs. We are pretty sure that we have corrected this and do not want to wait another week to check what SEOMOZ comes up with. Is there anyway that we can request a special crawl of the website so that we can hopefully just be left any legitimate remaining issues?
Moz Pro | | dmckenzie4560 -
Who wants to help go over my crawl diagnostics via skype?
I have run a crawl diagnostic on my site and have 194 errors and most of them are 404 errors in wordpress. Not sure why, but many of my pages had name changes (possibly a permalinks issue) but I have no idea how to fix it. I had 5 duplicate page titles, and 1 tile missing or empty. 72 crawl notices found (2 permanent redirect, 17 blocked by robots, 53 rel canonical) 19 Crawl warnings were found Who wants to have some fun?
Moz Pro | | starkSEO0 -
Exporting Twitter and FB data in report
Hi Been with this tool for a few days now and enjoying it so far. I do have one query though. In the campaigns section we have various tabs of data, including Social. However while all the other data is exportable in the created report, social is not available to add to the custom reports. Why is this? When I click on the social tab I can download it in CSV but it would be good to be able to export the charts in pdf as per the other analysis data. Would make it much easier when sharing reports with clients. Are there any plans to make the social metrics addable to the custom reports one can create?
Moz Pro | | GrumpyCarl0 -
Drop in Number of Crawled pages by SEOMOZ?
I noticed that the number of Crawled Pages on my website has been 2 pages only over past week. Before that the number of crawled pages was over 1000. My site has numerous pages as it is a Travel website that pulls search results for Flights, Cars, Hotels, Cruises and Vacation packages so there is a huge Database there. Can someone help? Thanks !
Moz Pro | | sherohass0 -
Linkscape Update In Feb?
I have a site i just built and it has well over 2k backlinks, i seen on the schedule that linkscape was updated already. But my site still has authority of 1. Also my pagerank increased.
Moz Pro | | antoniow1870