Any tools for scraping blogroll URLs from sites?
-
This question is entirely in the whitehat realm...
Let's say you've encountered a great blog - with a strong blogroll of 40 sites.
The 40-site blogroll is interesting to you for any number of reasons, from link building targets to simply subscribing in your feedreader. Right now, it's tedious to extract the URLs from the site. There are some "save all links" tools, but they are also messy.
Are there any good tools that will
a) allow you to grab the blogroll (only) of any site into a list of URLs (yeah, ok, it might not be perfect since some sites call it "sites I like" etc.)
b) same, but export as OPML so you can subscribe.
Thanks!
Scott
-
Not at all. I guess my feeling here is that there is a sort of untapped social graph defined by blogrolls. If it were simple to harvest them upon visiting a blog (e.g. this blogger recommends...) one could do a stumble-on-steroids approach to a niche.
-
I thought you might be able to use the outbound link scraper to grab the outbound link onto the page. Pop in your URLS of the pages you want to scrape and it will spit out our a list of those domaind and urls. You can take those urls and put them into the contact finder and it will return the contact details for those sites. Combine the two spreadsheets for an epiuc list of blogs to contact for your outreach.
This is obviously for link building rather than subscribing - sorry if I have misunderstood what you were trying to do
-
Hi Keri,
That is a very cool tool, but is overkill for this. It takes far too many steps to accomplish only part of the desired goal of grabbing all blogroll URLs (within the blogroll DIV tag) and exporting the list to a valid OMPL file or URL list.
thanks!
-
nothing I saw there would do this. It looks like it could manage to list all external links, and I suppose you could manually pick the blogroll out of it.
-
Hi there,
Well, Keris response reminded me of this question and the fact that I found a tool for scraping these kind of lists:
Here it is (with some other cool tools) , have fun:
-
Hi Scott,
I'm going through older questions. Did you ever find a tool to do what you wanted to do here?
-
One thing to look at is Outwit Hub for Firefox. It might be able to help with that. It can scrape data from a page and do a lot with it. http://www.outwit.com/products/hub/. Don't know that it meets all of your needs, but I also haven't seen a response with anything better at the moment.
-
Hey Scott,
What a great question and <sigh>I don't have the answer. I am going to back to find out what people come up with here. Surely there is someone that lurks these parts that can throw something together?</sigh>
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
How to fix overly dynamic URLs for Volusion site?
We're currently getting over 5439 pages with an 'overly dynamic URL' warning in our Moz scan. The site is run on Volusion. Is there a way to fix this seeming Volusion error?
Moz Pro | | Brandon_Clay0 -
Does SEOmoz have a tool to find mirror sites?
I heard from a company that is trying to get my clients SEO business that they discovered multiple sites mirroring our site's content. Does SEOmoz have a tool to find these websites? Or does Google?
Moz Pro | | thomas.wittine0 -
Does SEOmoz have a Keyword Research tool similar to, say, the Google AdWords tool or the WebCEO Keyword Research Tool? And where might that be? (Sorry, I'm very new to SEOmoz Pro.)
I'm looking for an SEOmoz version of the classic WebCEO Keyword Research that would give you effective suggestions based on a keyword inquiry. I've made the switch from WebCEO, but I'm trying to find something similar to that Keyword Research tool. Am I going to just need to use the Google AdWords tool for this function or does SEOmoz have it's own version?
Moz Pro | | SmokewagonKen0 -
MozRank in Open Site Explorer?
Hi, I wondered why mozRank is not showing in OSE? As this is the "equivalent" of Google's page rank? Thanks
Moz Pro | | CallieGunstinson0 -
Has the relevancy of SEOmoz tools disappeared?
I have A rankings for my on-site grades for my most important keywords. I have no Critical issues and no Warnings with my Crawl Diagnostics. Most of the Competiive Link analysis data shows my site beating out the competition. If all this is accurate, how can my SERPs continue to decrease and lesser pages with terrible optimization and backlinking be ranking higher? I even have a facebook page beating me in the results. If there is nothing left for me to address using SEOmoz, and I keep getting worse & results, doesn't it mean that the SEOmoz tools are not relevant to producing actual results? Or, am I missing something?
Moz Pro | | TOPYX0 -
It would be Great if their were further integrations with Google Webmaster Tools
It would be great if their were plans to integrate Google Webmaster Tools into the mix. Specifically the Errors section. I am currently working on a new Campaign where I am seeing a little bit of overlap, but Google is finding all sorts of different missing pages from 3 redesigns ago but also quite a few current ones. Currently in SEOmoz: 0 Errors, while Google is reporting 12 - 403 Errors for some content the client unpublished. While the addition of Google Analytics was a nice, it would be great to dig further into Webmaster tools and Analytics with features that discover errors and provide actionable next steps. Is anyone else seeing these discrepancies between SEOmoz and Google Webmaster Tools?
Moz Pro | | drewschug0 -
Adding LinkedIn to the new Social Media tool
I am loving the new Social Media data that SEOMoz recently added. I am sure more will come soon, but I wondering if they have plans of adding LinkedIn Company pages as apart of a campaign to track. Does anyone have the inside clue about this? Do you think it would be a good idea as well?
Moz Pro | | nextraq0 -
SEOMoz site crawlers created an issue for our servers
I have set up a number of campaigns with your pro tool. Unfortunately we have 7 sites on our server and our IT dept have said that we had an issue when your site crawlers visited for several sites at the same time - is there any way that I can retain the campaigns but have the sites crawled on request rather than automatically?
Moz Pro | | StephenALee0