Help with Roger finding phantom links
-
It Monday and Roger has done another crawl and now I have a couple of issues:
- I have two pages showing 404->302 or 500 because these links do not exist. I have to fix the 500 but the 404 is trapped correctly.
http://www.oznappies.com/nappies.faq & http://www.oznappies.com/store/value-packs/\
The issue is when I do a site scan there is no anchor text that contains these links. So, what I would like to find out is where is Roger finding them. I cannot see any where in the Crawl Report that tells me where the origin of these links is.
- I also created a blog on Tumblr and now every tag and rss feed entry is producing a duplicate content error in the crawl stats. I cannot see anywhere in Tumblr to fix this issue.
Any Ideas?
-
Thanks again Ryan, you have been very helpful answering al lot of my questions.
-
Someone else asked the same question regarding tag pages yesterday. I would suggest asking a separate Q&A on that topic.
Tag pages & forum category pages are both often used as containers. They don't have any content except links to articles. I would ask for feedback as to the best practice. I suspect noindex, following those pages would be best, but I don't have the experience to feel comfortable offering that advice.
-
I have been looking at the data that Roger is reporting for the duplicate content and in ALL cases there is either a 301 or a NoIndex. So now I do not know why Roger is reporting them as a duplicate, robots should not see the second entry.
-
I did not think of looking at the csv report. I see it now thanks Ryan. There should be a soft 404 handler in place to process the bad urls, I will have to see why it is not working.
With tumblr, I was looking for an easy way to add a blog to the site.
The RSS is coming from tumblr as is all the content.
When we specify Tags in tumblr it creates urls e.g. mypage.com/article/tag1 mypage.com/article/tag2 mypage.com/article/tag3 which all contain the content of mypage.com/article with out a canonical to the original. It is a really strange non-seo friendly approach, and so I wondered if anyone had similar problems.
-
The crawl report offers a "referrer" field. That field offers where Roger found the offending link. In my experience that field has always been accurate.
When I try to access www.oznappies.com/faq I receive a 302 redirect and a 500 error. I would recommend adjusting non-existant pages to a soft 404 page. Still provide a 404 response to browsers, but offer users a friendly way to find information (i.e. links / search) and stay on your site.
A great example of a soft 404 page is http://www.orangecoat.com/a-404-page.html
For the Tumblr issue, I am not clear on the problem. Are you writing content and publishing on both the oznappies.com site and your tumblr site? Then this content is being published again on your site via a RSS import?
-
I removed the links and just left the text so these will cut and paste now. It confuses me where Roger found the links.
Thanks for running the Xenu scan. I have tried other site scanner and come up blank.
-
That second link is anchored to the wrong place.
Regardless I also cannot find the .faq page. I just ran Xenu over it to see what it could find, but no broken links showed up.
Afraid I don't use Tumblr either, so eh, pretty useless post. Sorry.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
How to find those website who are using our content
I'm tring to figure it out that by using seo moz how can i find all website who are using our content.
Moz Pro | | Showhow20 -
I can't find any inbound links that I know I have on open site, ahref or majestic. Does anyone know why?
Since the linkscape update before Christmas I have built a couple of links to fairly high quality sites. I can't find them on open site though or the other tools I mentioned. I'm a bit concerned there is an issue with my site. Does anyone have any idea why? I'm stumped. My site: www.emporiumofmanliness.co.uk
Moz Pro | | EmpofMan0 -
How do you check the outbound links of a site?
There are great tools like http://www.opensiteexplorer.org that will tell you all about the inbound links. What about the more basic and easier question: What outgoing links does this site have?
Moz Pro | | SkinLaboratory2 -
Finding the source of duplicate content URL's
We have a website that displays a number of products. The product has variations (sizes) and unfortunately every size has its own URL (for now anyway). Needless to say, this causes duplicate content issues. (And of course, we are looking to change the URL's for our site as soon as possible) However, even though these duplicate URL's exist, you should not be able to land on them by navigating through the site. In theory, the site should always display the link to the smallest size. It seems that there is a flaw in our system somewhere, as these links are now found in our campaign here on SEOmoz. My question: is there any way to find the crawl path that lead to the URL's that shouldn't have been found, so we can locate the problem?
Moz Pro | | DocdataCommerce0 -
Changing linked analytics account
When setting up a new campaign, I attempted to add GA. I entered the wrong GA account to this profile. How do I change the GA profile that is linked to this campaign?
Moz Pro | | sgavin0 -
Newbie - help me get started, please :)
Hi Guy's, I am super excited to be here and looking fwd to getting to know you all. As the title suggests I am a complete newbie to the world of SEO I am very keen to learn, I just need pointing in the right direction. My website is live! Now I need to make it super populated, my site is an e-commerce website selling natural / organic beauty products for men women & children. I would like to make it one of the best out there and I am not worried about putting the hours in to achieve that goal. So my question to you guys is this.... where do I start?
Moz Pro | | dan1el0 -
Internal links not showing in Open Site Explorer
So I'm working on a law firm site and looking at the links for pages in OSE. For practice areas, the links to each practice area are in the left hand menu on every page of the site. Can anyone help me with this question: Example: http://www.comitzlaw.com/personal-injury/car-accidents.html When I plug this URL into OSE, it only shows one linking page, www.comitzlaw.com/practice-areas.html, yet there is a link to this on every other page in the site. When I plug in a random competitors page, www.lesagelblaw.com/Personal-Injury-Overview/Car-Accidents.shtml, it does show all the internal pages linking to it. Since I'm not using a flash menu or javascript, any ideas as to why no internal links are showing up in OSE? Even when I plug in the main URL for the home page, it only shows 4 other internal pages linking to it, yet there is a link on every page. What am I doing wrong?
Moz Pro | | c2g0 -
Viewing links gained over time
Is there a way to see how the number of inbound links has changed over time for a campaign? It is hard to tell whether links are increasing for a campaign without historical data being stored so trends can be shown
Moz Pro | | MulberrySqCraig0