Help with Roger finding phantom links
-
It Monday and Roger has done another crawl and now I have a couple of issues:
- I have two pages showing 404->302 or 500 because these links do not exist. I have to fix the 500 but the 404 is trapped correctly.
http://www.oznappies.com/nappies.faq & http://www.oznappies.com/store/value-packs/\
The issue is when I do a site scan there is no anchor text that contains these links. So, what I would like to find out is where is Roger finding them. I cannot see any where in the Crawl Report that tells me where the origin of these links is.
- I also created a blog on Tumblr and now every tag and rss feed entry is producing a duplicate content error in the crawl stats. I cannot see anywhere in Tumblr to fix this issue.
Any Ideas?
-
Thanks again Ryan, you have been very helpful answering al lot of my questions.
-
Someone else asked the same question regarding tag pages yesterday. I would suggest asking a separate Q&A on that topic.
Tag pages & forum category pages are both often used as containers. They don't have any content except links to articles. I would ask for feedback as to the best practice. I suspect noindex, following those pages would be best, but I don't have the experience to feel comfortable offering that advice.
-
I have been looking at the data that Roger is reporting for the duplicate content and in ALL cases there is either a 301 or a NoIndex. So now I do not know why Roger is reporting them as a duplicate, robots should not see the second entry.
-
I did not think of looking at the csv report. I see it now thanks Ryan. There should be a soft 404 handler in place to process the bad urls, I will have to see why it is not working.
With tumblr, I was looking for an easy way to add a blog to the site.
The RSS is coming from tumblr as is all the content.
When we specify Tags in tumblr it creates urls e.g. mypage.com/article/tag1 mypage.com/article/tag2 mypage.com/article/tag3 which all contain the content of mypage.com/article with out a canonical to the original. It is a really strange non-seo friendly approach, and so I wondered if anyone had similar problems.
-
The crawl report offers a "referrer" field. That field offers where Roger found the offending link. In my experience that field has always been accurate.
When I try to access www.oznappies.com/faq I receive a 302 redirect and a 500 error. I would recommend adjusting non-existant pages to a soft 404 page. Still provide a 404 response to browsers, but offer users a friendly way to find information (i.e. links / search) and stay on your site.
A great example of a soft 404 page is http://www.orangecoat.com/a-404-page.html
For the Tumblr issue, I am not clear on the problem. Are you writing content and publishing on both the oznappies.com site and your tumblr site? Then this content is being published again on your site via a RSS import?
-
I removed the links and just left the text so these will cut and paste now. It confuses me where Roger found the links.
Thanks for running the Xenu scan. I have tried other site scanner and come up blank.
-
That second link is anchored to the wrong place.
Regardless I also cannot find the .faq page. I just ran Xenu over it to see what it could find, but no broken links showed up.
Afraid I don't use Tumblr either, so eh, pretty useless post. Sorry.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
SEO Help in LA
Hi All- We're new to the MOZ community and plan to use the tools to help our site. We sell graphic design retail products globally. We're looking for a recommendation for a MOZ pro in Los Angeles (ideally) to help us build out our campaign(s) and get us started with the MOZ tools. Any of you in LA? Any of you have a rec for a great resource close to us? Thanks so much, Scott
Moz Pro | | freshs0 -
Why is my internal followed links is 0?
My internal followed links is 0. Also for top pages the report say "block by robots.txt" but the Moz bot is able to crawl my site and generate report for 10K pages. Please help me understand why. http://www.opensiteexplorer.org/comparisons?site=www.findyogi.com
Moz Pro | | namansr0 -
Sudden increase in on-page links
Within one week, the site crawl shows my on-page links going from ~10 to ~100+ Does anyone know of a rational reason for this? Sucuri shows my site is clean, so I don't think it's a hack. 10 seems too low to begin with anyway. Any ideas? Thanks 🙂
Moz Pro | | pupstar0 -
Domain Authoriy/Link Authority - Link Building
At what point do you think that an inbound link becomes a bad or low quality link when it comes to the inbound links page domain Authority and page Authority. SO as they are scored 1-100 in each case. At what score would you attempt to get rid of such links.
Moz Pro | | askshopper0 -
How often does SEOMoz refresh their link analysis data?
I know that my link profile has changed within the last month, yet both SEOMoz campaign link analysis and Open Site Explorer have shown the exact same number of links for more than a month. Majestic SEO fluctuates as I would expect, but Moz data is unchanged.
Moz Pro | | Aggie0 -
SEO-Experts help answer my newbie questions!
Hello! Thanks for coming to my rescue; I really appreciate it. I am a newbie at SEO and I still haven't got the full picture of everything in my mind that covers linkbuiling, on page optimization, domain names, website creating, etc. I've really pretty much all the available guides on WarriorForum and BHW without much help on the topics above since most of the information was either provided by some person who didn't know what they were talking about, the information was outdated, or I didn't know what they meant. I have so many questions that I hope some SEO-Expert could answer my newbie questions so that I won't get penalized for my two sites that I currently own. One of them was slapped by Yahoo!, Bing, AOL, and most other search engines with the exception of Google (It's only a matter of time of course). The other site I own is in the making currently without much on-page optimization. Before I start doing some on-page depletion t to my two sites so they both get de-indexed and such because of keyword stuffing, fail link building, and other factors, I wanted to cover the problems I currently have about SEO and this site. 1) Using the SEO-MOZ 'keyword analysis tool', what is the ideal percentage difficulty a newbie like me could take easily with just a few links here and there including 5 posts/pages of great content? (I usually only write 5 posts that cover 500+ words each). 2) Do I need hosting to become successful or to get more on-page optimization for my websites? I currently use Go-Daddy forwarding with masking so that when my site shows up in google the content comes from blogger. 3) What is "Self Cannibalization"? I ALWAYS get that error for both my sites when I use the 'on page optimzation tool' from SEO-MOZ. 4) Using Blogger, What exactly are labels and does using this add any SEO value to my website? 5) I've read so many damn articles about ALT text and Title text for blogger; nothing explained what I put in it though. What am I supposed to put in it that will help me with my on page optimization? (Stuff like do I use spaces or dashes, do I put my keyword in there, how many characters should I not exceed, do I put one word or two words?) 6) On my website, I 'accidentally' didn't know that copy and pasting images from paint straight to blogger would be a bad idea because in the html I saw it was f'ed up because there was literally random characters everywhere for the file name. Since my site is a tutorial site, I have over 50+ images in all my posts combined that I've copied and pasted from paint.exe to the Blogger post. Should I reupload all of them to blogger or keep it the way it is? Will I be penalized for this or nothing would happen? Is there a benefit to fixing it? 7) For a new website that is less than 1 week old, when can I start building backlinks? If I can start building backlinks now, what is the ideal number per day that I should create? 😎 Does post every single page or post of content I have on my website to other Web 2.0s like Squidoo, Wordpress, or Blog penalize my website for duplicate content or what? If I do this, does it give me any SEO benefit? 9) How much stronger is a .gov backlink than a .edu backlink? How much stronger are both of those compared to a regular extension backlink? 10) If I am building backlinks, should I link just my main home page URL or all internal URLs to? 11) I see several people that build backlinks to other websites by typing in a comment and throwing their link in there. Like question 10, should I post ALL the links to my website (Internal + Home) or just my main page link? 12) What is better; .com, .net, or .org? What about .info, .us, and .biz in terms of SEO benefit? 13) I have several pages on my site in which Google indexed them and I deleted the post. Now when I search my site on Google and I click on the link to the post, it is pretty much a 'dead link' because the post cannot be found. Does this harm my site in any way, and how can I take these dead pages off? 14) One of my sites is already indexed by Google but not Yahoo! What should I do to get my site indexed? It's already been about a week. Thanks for attempting to answers my questions. I don't have anything to give, but, I can choose 3 'good answers' for helping me!
Moz Pro | | 6786486312640 -
Why is Followed Linking Root Domains higher than External Followed Links?
Surely there must be at least one external link for each linking root domain? Some results for smaller sites give a higher number of domains linking in than incoming links - e.g. www.forbesandsawyer.co.uk Under Subdomain metrics: External Followed Links - 1 Followed Linking Root Domains - 2 Surely 2 root domains would mean AT LEAST 2 external followed links? Thanks, Andrew
Moz Pro | | Silktide0 -
Crawl Diagnostics finding pages that dont exist. Will Rel Canon Help?
I have recently set up a campaign for www.completeoffice.co.uk. Im the in-house developer there. When the crawl diagnostics completed, i went to check the results, and to my surprise, it had well over 100 missing or empty title tags. I then clicked it to see what pages, and nearly all the pages it say have missing or empty title tags, DO NOT EXIST. This has really confused me and need help figuring out how to solve this. Can anyone help? Attached image is a screen shot of some of the links it showed me on crawl diagnostics, nearly all of these do not exist. Will the relation Canonical tag in the head section of the actual pages help? For example, The actual page that exist is: www.completeoffice.co.uk/Products.php Whereas, when crawled it actually showed www.completeoffice.co.uk/Products/Products.php Will have the rel can tag in the header of the real products.php solve this?
Moz Pro | | CompleteOffice0