Help with site structure needed - any assistance welcomed!
-
Hi all,
I am currently tasked with finding a better way to optimise our website ukdocumentstorage dot com.
For starters, I would like to know what our site structure actually is at present. So I would like to be able to see which pages are linking to what at the moment & which pages have broken links on which I need to remove from the content. Hopefully I'd then be able to tidy up any errors that the site already has in its internal linking.
Is there a way to do this easily? Or to have a graphical representation of the sites structure?
I have just signed into our Webmaster Tools account and I am faced with a list of 10 'Crawl Errors' which are all 404 errors. Some of them do not actually exist anymore, but are still being linked to from a few pages according to WMT.
For example, /industries_served_legal.htm is still being linked to from 5 of our pages (including /industries_served_local_authority.htm)
However, this doesn't seem to be a case at all on the page as I can't find a link to /industries_served_legal.htm on /industries_served_local_authority.htm. Any advice as to why this is happening? Is there a way to find out easily where these broken links are situated on the page? And if I do actually manage to find our broken links, how would I go about removing them?
The page /document_security.htm doesn't exist in our Sitewizard list of pages anymore, yet still exists online. How do I go about deleting this unecessary page properly? And does this harm our rankings?
The document_security page also has an extra link on the top toolbar to a Document Management page, an addition which is no longer present on our up to date pages. Now this page (and the extra dropdown page when you hover over it) still exist on our list of Sitewizard pages at the moment, but we obviously no longer want to have these online anymore. How should I remove these?
I understand that this is a lot of information, and so I would appreciate any help that can be given on these!
Many thanks
-
Perfect sense thank you! I'll now research how to actually do this re-direct.
-
If this is an internal link on your website, you would want to change the actual path to point to the newer secure-document-storage page.
If this is an external link from another website, you'd create a redirect that will take the incoming request for the old document-security page and push the visitor to the new secure-document-storage page.
Make sense?
Mike
-
So even though the text is different, I should re-direct people clicking on the link to the old document-security to the newer secure-document-storage page?
-
Here is an example that may help:
You have the following pages on your site - /product1.html, /product2.html, and /product3.html.
An external site (externalsite.com) links to the product 2 page on your site (yoursite.com/product2.html).
You decide to no longer sell product 2, so your remove /product2.html from your website; however, externalsite.com is still linking to yoursite.com/product2.html. You see a 404 warning in Google Webmaster Tools referencing this error.
You then have two options:
-
You recently started selling product 4, which is not the same product, but still offers the same solution to a potential customer. You create a /product4.html page and set up a 301 redirect from externalsite.com to yoursite.com/product4.html.
-
You no longer sell this product or solutions like it, because it was not needed by visitors. There is a link from externalsite.com is no longer applicable to your site; therefore, you disregard the warning in Google Webmaster Tools and the link will eventually not be followed by Google.
Now, if the /product2.html page was still accessible online, but you no longer linked to it via yoursite.com, that is kind of a problem, because if externalsite.com is still linking there, visitors could stumble upon your old/outdated/not-used page. You do not need to actively worry about removing the link, but you should work on removing the page if it is no longer used.
Does that help and did I understand your question correctly?
Mike
-
-
Apologies for the overload!
So my take-way from this is that any pages that I have deleted but are still able to be found the internet (e.g. /document_security) I don't need to worry about actively trying to remove from the internet as it will be removed by Google automatically in the future? And having these pages still existing on the internet (despite not having any current links going to them from pages I haven't deleted) will not harm my site?
Thank you for all of your help so far!
-
To add to Mike's answer
2: If the page is deleted and isn't coming back you may want to 301 it to its new equivalent of possible even return a 410 a status code to tell search engines the pages has been permanently removed
For more info on Status codes see the following article
http://www.seomoz.org/learn-seo/http-status-codes -
Whoa! Information overload!!!
-
I don't know of anything that shows you a graphical representation of your site's linking structure; however, I do know of a program that will list out all of the linking pages on your site and the number of in and out links, including anchor text, etc. The number of in links can be an indicator of structurally how your site is organized.
-
404 errors or not bad as long as they are known. If you no longer have a page and you decide not to redirect from the old page to a new one, that is fine. Google is just giving you a heads up that your site or someone else's is linking to a non existent page. If you do nothing to fix these 404 errors, the page will eventually be removed from Google's index and not be a problem.
-
/document_security.htm looks like it is being linked to from /services_storage_fast_retrieval.htm and /services_archive_storage.htm
I would recommend downloading and installing Screaming Frog that is the program I was referencing in my response to #1 and that is how I found the issue in #3.
Seer Interactive also wrote a great blog on all of the things this tool can do.
Hope this helps.
Mike
-
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
We are migrating a site and are seeing alot of 301s and 302s already in the old site is it ok to leave those as is?
For the 3xx’s I’m not sure if it’s okay for us to redirect to these so please advise on that
Technical SEO | | lina_digital0 -
SEO question: Need help on rel="alternate" hreflang="x"
Hi all, we have webcontent in 3 languages (official belgian yellow pages), we use a separate domain per language, these are also our brands.
Technical SEO | | TruvoDirectories
ex. for the restaurant Wagamamahttp://www.goudengids.be/wagamama-antwerpen-2018/ corresponds to nl-be
http://www.pagesdor.be/wagamama-antwerpen-2018/ corresponds to fr-be
http://www.pagesdor.be/wagamama-antwerpen-2018/ corresponds to en-be The trouble is that sometimes I see the incorrect urls appearing when doing a search in google, ex. when searching on google.be (dutch=nederlands=nl-be) I see the www.pagesdor.be version appearing (french) I was trying to find a fix for this within https://support.google.com/webmasters/answer/189077?hl=nl , but this only seems to apply to websites which use SUBdomains for language purposes. I'm not sure if can work for DOMAINS. Can anyone help me out? Kind regards0 -
Need Urgent Help
I have found one mistake that my place page address is little different than address on all local directories like on place page address is: 10010 S Tryon St #122 Charlotte, NC 28273 and on directories : 10010 South Tryon St 122 Charlotte, NC 28273 so on place page it is just "S" instead of South and "#" is before 122 but on all directories # is missing So what do you suggest ? Should i change address and re verify place page ? Re verify will put down place page value ???
Technical SEO | | mnkpso0 -
Canonical needed after no index
Hi do you need to point canonical from a subpage to main page if you have already marked a no index on the subpage, like when google is not indexing it so do we need canonicals now as is it passing any juice?
Technical SEO | | razasaeed0 -
Yoast settings help
I could use some real help here in my Yoast settings. I had some great settings before but we switched servers and it looks like we lost all our settings. I've taken some screenshots and I'm hoping someone can help! http://d.pr/i/chNQ http://d.pr/i/51TY http://d.pr/i/io7S http://d.pr/i/nak http://d.pr/i/acon The site is run by a couple guys. Please help!
Technical SEO | | ttb0 -
NEED HELP ASAP: SERVER ISSUE
Hey guys, Some of you may be aware of our story. We have a website about or son who was born with Down syndrome. Two days a go a post I wrote went sort of viral, and I woke up this morning to an email from my host saying they had to take my site down as an emergency because of the amount of resources it is using. So now my site is down (noahsdad.com.) ...any ideas how to proceeded? I really need to get my site back online asap. Thank you.
Technical SEO | | NoahsDad0 -
Help internet archive
Hi can anyone help me. I am trying to track information from sites. I am at the moment got solicitiors involved ref trade mark of a name and we are trying to prove that this name was being used before the trade mark application. A company has gone ahead with a trade mark in full knowledge that this is a common name and we have been told if we can prove that the name was being used before it was registered then we can knock out the trade mark. We have been told there is an internet archive but i cannot find one, can anyone please let me know how i can look at sites from before 2010 and see about if they were using this word to help our case.
Technical SEO | | ClaireH-1848860 -
Index forum sites
Hi Moz Team, somehow the last question i raised a few days ago not only wasnt answered up until now, it was also completely deleted and the credit was not "refunded" - obviously there was some data loss involved with your restructuring. Can you check whether you still find the last question and answer it quickly? I need the answer 🙂 Here is one more question: I bought a website that has a huge forum, loads of pages with user generated content. Overall around 500.000 Threads with 9 Million comments. The complete forum is noindex/nofollow when i bought the site, now i am thinking about what is the best way to unleash the potential. The current system is vBulletin 3.6.10. a) Shall i first do an update of vbulletin to version 4 and use the vSEO tool to make the URLs clean, more user and search engine friendly before i switch to index/follow? b) would you recommend to have the forum in the folder structure or on a subdomain? As far as i know subdomain does take lesser strenght from the TLD, however, it is safer because the subdomain is seen as a separate entity from the regular TLD. Having it in he folder makes it easiert to pass strenght from the TLD to the forum, however, it puts my TLD at risk c) Would you release all forum sites at once or section by section? I think section by section looks rather unnatural not only to search engines but also to users, however, i am afraid of blasting more than a millionpages into the index at once. d) Would you index the first page of a threat or all pages of a threat? I fear duplicate content as the different pages of the threat contain different body content but the same Title and possibly the same h1. Looking forward to hear from you soon! Best Fabian
Technical SEO | | fabiank0