Help with Roger finding phantom links
-
It Monday and Roger has done another crawl and now I have a couple of issues:
- I have two pages showing 404->302 or 500 because these links do not exist. I have to fix the 500 but the 404 is trapped correctly.
http://www.oznappies.com/nappies.faq & http://www.oznappies.com/store/value-packs/\
The issue is when I do a site scan there is no anchor text that contains these links. So, what I would like to find out is where is Roger finding them. I cannot see any where in the Crawl Report that tells me where the origin of these links is.
- I also created a blog on Tumblr and now every tag and rss feed entry is producing a duplicate content error in the crawl stats. I cannot see anywhere in Tumblr to fix this issue.
Any Ideas?
-
Thanks again Ryan, you have been very helpful answering al lot of my questions.
-
Someone else asked the same question regarding tag pages yesterday. I would suggest asking a separate Q&A on that topic.
Tag pages & forum category pages are both often used as containers. They don't have any content except links to articles. I would ask for feedback as to the best practice. I suspect noindex, following those pages would be best, but I don't have the experience to feel comfortable offering that advice.
-
I have been looking at the data that Roger is reporting for the duplicate content and in ALL cases there is either a 301 or a NoIndex. So now I do not know why Roger is reporting them as a duplicate, robots should not see the second entry.
-
I did not think of looking at the csv report. I see it now thanks Ryan. There should be a soft 404 handler in place to process the bad urls, I will have to see why it is not working.
With tumblr, I was looking for an easy way to add a blog to the site.
The RSS is coming from tumblr as is all the content.
When we specify Tags in tumblr it creates urls e.g. mypage.com/article/tag1 mypage.com/article/tag2 mypage.com/article/tag3 which all contain the content of mypage.com/article with out a canonical to the original. It is a really strange non-seo friendly approach, and so I wondered if anyone had similar problems.
-
The crawl report offers a "referrer" field. That field offers where Roger found the offending link. In my experience that field has always been accurate.
When I try to access www.oznappies.com/faq I receive a 302 redirect and a 500 error. I would recommend adjusting non-existant pages to a soft 404 page. Still provide a 404 response to browsers, but offer users a friendly way to find information (i.e. links / search) and stay on your site.
A great example of a soft 404 page is http://www.orangecoat.com/a-404-page.html
For the Tumblr issue, I am not clear on the problem. Are you writing content and publishing on both the oznappies.com site and your tumblr site? Then this content is being published again on your site via a RSS import?
-
I removed the links and just left the text so these will cut and paste now. It confuses me where Roger found the links.
Thanks for running the Xenu scan. I have tried other site scanner and come up blank.
-
That second link is anchored to the wrong place.
Regardless I also cannot find the .faq page. I just ran Xenu over it to see what it could find, but no broken links showed up.
Afraid I don't use Tumblr either, so eh, pretty useless post. Sorry.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Do you know good company for obtaining back links ?
Hi All Can anybody recommend a company which can produce ethical good quality links and a reasonable price? We need to try and get our rankings higher and if you've used a company I would appreciate if you could let me know etc Thank you Alan Whitby Holiday Cottages
Moz Pro | | alandavidson
http://www.endeavourcottage.co.uk/1 -
Why is Link Count smaller than Internal Links in Crawl Test report?
We recently ran the crawl test report and for most of our pages we are getting 1150 internal links but 40-50 as the link count. Why is there such a big disparity?
Moz Pro | | usdmseo0 -
Good tool to track external links from the website
I am in search of a tool that provides me links generating from my site to another site. Is there a software or tool that can scan the whole site and provide me what are the links of other sites in my site.
Moz Pro | | csfarnsworth0 -
No follow links also been reported in SEOmoz crawl diagnostics
Hi, Why does SEOmoz reports links which has been marked as 'nofollow'. I am getting 'Overly-Dynamic URL' reports on links which I have designated as nofollow which means Google will discount them. So why does SEOmoz still report them. Thanks.
Moz Pro | | malpani0 -
Open Site Explorer and link numbers
I know this question has been asked many times in this forum but I still can't work it out. Why does this link: http://www.opensiteexplorer.org/links?page=1&site=www.bookpal.com.au&sort=page_authority&filter=&source=external&target=subdomain&group=0 Which is showing all links, external, to pages "on this sub domain" show 1,935 external links but this link: http://www.opensiteexplorer.org/links?page=1&site=www.bookpal.com.au&sort=page_authority&filter=follow&source=external&target=subdomain&group=0 which is exactly the same but this time shoing followed + 301 links, says "showing 1 - 50 external links) but won't show the total links (and I know the mouse-over on the question mark says it's won't show the total links, but I don't understand why it can't show the total links when it could show the total links when I requested to see "all links" instead of just "followed+301" links.) but it actually lists 700 links (14 pages, 50 results each page). I know the link list is limited to 25 links per domain but then it means you can NEVER know the total link count unless you download the full report. This makes using OSE to know numbers of links (internal, external, or otherwise) impossible. And if anyone uses the API, why the API (external+follow) returns 1,451 links? I'm sure it's an ongoing issue with people trying to get their head around all of this and I've never really been able to. Any insight would be much appreciated!
Moz Pro | | eatyourveggies0 -
How do you check the outbound links of a site?
There are great tools like http://www.opensiteexplorer.org that will tell you all about the inbound links. What about the more basic and easier question: What outgoing links does this site have?
Moz Pro | | SkinLaboratory2 -
Open Site Explorer WAY Off in Terms of Link Profiles?
Hey, One of our websites is www.inspireeducation.net.au. I have noticed although tools like Raventools capture our links well, Open Site Explorer is doing a terrible job... For example the following page >>> http://www.inspireeducation.net.au/courses/training-and-assessment-courses/certificate-iv-in-training-and-assessment/ has many many more than 8 root domains linking, however Open Site Explorer only presents 8? We are finding the same problem for almost any page we review through Open Site. Does anyone have any idea why the numbers would be so out? The new links are NOT fresh links. Many are well-established (been there for years), and even many newer ones have been there for more that 60 days. I find the same thing when reviewing competitor sites.. is Open Site Explorer working properly at all at the moment?
Moz Pro | | love-seo-goodness0 -
Only 2 internal links in OpenSite Explorer?
In Open Site Explorer´s tab Full list of Metrics I get for the Page Specific Metrics only 2 Internal Followd Links for one of my websites domain URL. I have checked this metric for other websites and I get some surprising results. Most of the websites get an amount which seems logical take the size of the site into account. But there are a couple of sites more for which I get very low results like only 1 or 2 Internal Followed Links! This is strange because the sites do have at least more than a 100 internal pages which are all linking back to the domain and are indexed in Google. I have checked if there is something strange with the robots.txt or htaccess but I havent found anything. So I wonder if this a failure in Open Site Explorer or can there be any other explanation? Anybody with similar experience? aNp83
Moz Pro | | ceesie0