Help with facet URLs in Magento
-
Hi Guys,
Wondering if I can get some technical help here...
We have our site britishbraces.co.uk , built in Magento. As per eCommerce sites, we have paginated pages throughout.
These have rel=next/prev implemented but not correctly ( as it is not in is it in ) - this fix is in process.
Our canonicals are currently incorrect as far as I believe, as even when content is filtered, the canonical takes you back to the first page URL. For example,
http://www.britishbraces.co.uk/braces/x-style.html?ajaxcatalog=true&brand=380&max=51.19&min=31.19
Canonical to...
http://www.britishbraces.co.uk/braces/x-style.html
Which I understand to be incorrect.
As I want the coloured filtered pages to be indexed ( due to search volume for colour related queries ), but I don't want the price filtered pages to be indexed - I am unsure how to implement the solution?
As I understand, because rel=next/prev implemented ( with no View All page ), the rel=canonical is not necessary as Google understands page 1 is the first page in the series.
Therefore, once a user has filtered by colour, there should then be a canonical pointing to the coloured filter URL? ( e.g. /product/black )
But when a user filters by price, there should be noindex on those URLs ? Or can this be blocked in robots.txt prior?
My head is a little confused here and I know we have an issue because our amount of indexed pages is increasing day by day but to no solution of the facet urls.
Can anybody help - apologies in advance if I have confused the matter.
Thanks
-
Hi Lewis,
Firstly thank you for taking your time to respond in depth to my question.
Since reading your response, I have done the following...
Identified the parameters that should NOT be indexed, these are; 'brand=', 'min=' and 'max='
The colour filter 'colour=' is to be kept indexed. I have reviewed the website and found that users cannot currently select to filter more than on colour, which eliminates Google from indexing multiple colour filters in one URL.
However, users can still filter by colour and brand, hence why I have requested ours devs to meta noindex any URL that contains the 'brand=' parameter as well as any URLs that have the 'min/max=' parameters as these are price filters.
I have also requested rel=next/prev to be implemented correctly.
The above should drastically reduce our indexed content.
As well as this, I have added the following parameters into Search Consoles' URL Parameter tool as 'No Crawl', 'brand, min, max' - although I understand this is not a guaranteed fix, it was my first option with no immediate dev time over the weekend.
Now the only URLs in need of a canonical is the colour filtered URLs as 'brand, min max' are all noindex. I have asked dev to ensure the canonical points back to page 1 for now, however I am looking into a view-all page option so the canonical would point to that.
A good learning curve all of this!
-
There is a big difference between robots.txt and no index
"Therefore, once a user has filtered by colour, there should then be a canonical pointing to the coloured filter URL? ( e.g. /product/black )
But when a user filters by price, there should be noindex on those URLs Or can this be blocked in robots.txt prior?"
See http://i.imgur.com/114BHcR.png
You need to use a no index tag not robots.txt ideally with a secular canonical pointing to the product.
Please see references one and two below. There are larger versions of the photos below as well
You need to run your site through deep crawl and or screaming frog SEO spider If you would be kind enough to give me the URL privately or publicly I will run a deep crawl and SEO spider
** This topic is difficult to explain without using the ability to show videos and images inside the box while describing this. That's why I recommend you view this YouTube video and slide share.**
Deep crawl is fantastic at solving these issues it has done this for other magenta clients of mine, and I strongly recommend utilizing what you've learned from that webinar and the other references below.
please see one and two below
- https://www.deepcrawl.com/knowledge/webinars/masterclass-webinar-faceted-navigation-for-seo/
- https://www.stonetemple.com/seo-tags-virtual-keynote-with-gary-illyes-and-eric-enge/
-
https://webmasters.googleblog.com/2014/02/faceted-navigation-best-and-5-of-worst.html
-
https://mza.bundledseo.com/blog/building-faceted-navigation-that-doesnt-suck
-
http://searchengineland.com/google-offers-advice-faceted-navigation-infinite-scroll-web-pages-184232
larger versions of the images
I agree with Lewis's recommendation for an extension and have added a couple more.
- http://www.mageworx.com/magento-2-seo-extension.html
- https://ecommerce.aheadworks.com/magento-extensions/ultimate-seo-suite.html
- https://ecommerce.aheadworks.com/magento-2-extensions/layered-navigation
I Hope this helps,
Thomas
78tExl8.png nMrYeUWlslY xJeFTbY.jpg wOHxaEE.jpg QprPUyk.jpg 114BHcR.png
-
Hi!
We do a lot of consultancy for Magento projects and this is a question that comes up quite regularly as it can't really be handled perfectly straight out of the box with Magento.
Every implementation is a little bit different, but I'll put together some recommendations below based on the information available at the moment.
For your faceted navigation, you ideally don't want to index any of these pages, unless you believe that you'll rank in your own right for specific filters (e.g. Colour, like you pointed out in your last message).
That then comes with some additional complications. In Magento, if you have 3 colours available in the faceted nav, you'll have all the different variations indexed in each combination.
For example:
Blue
Black
RedBlue + Black
Blue + Red
Black + Red
Black + Blue
Red + Blue
Red + BlackMagento as standard doesn't always keep the filters in the same order, so you can end up with literally thousands of pages ending up in the index for a relatively small number of attributes being shown on your pages.
There are a few recommendations here:
- Go and look at the MageWorx Ultimate SEO Suite Plugin - http://www.mageworx.com/seo-suite-ultimate-magento-extension.html - For $249, it solves a lot of issues Magneto has straight out of the box and gives you ultimate control over your meta titles.
What you want to do is set all of your facets to 'NOINDEX,FOLLOW' where possible. This will reduce the number of URLs in the index gradually. An example of this would be adding ?min=* and mode=* etc (grid/list variants).
- For your canonicals, you're probably best setting the canonical to the current filtered page (for example, if you're on a category page with colour = blue selected in your faceted nav, you'd have this URL as your canonical). Some sites we work on have it setup so the canonical points to the category URL (like you currently have).
Finally, you probably want to build an extension to allow you to inject content into the filtered content pages. If you're using an extension like ManaDev for your facet navigation, this can be achieved fairly easily and allows you to add a block of text to each filter applied on a page.
You should also look to request each of the incorrectly indexed URLs is removed from the index (although this does take a long time if you have a lot!).
We wrote a really long guide around launching a Magento website last month which may be of interest - https://www.pinpointdesigns.co.uk/the-definitive-guide-to-launching-a-magento-website/. We've also done a guide on Common Magento SEO Issues here - https://www.pinpointdesigns.co.uk/common-magento-seo-issues/ and I previously wrote a guide on setting Magento up for Search Engines on Moz - https://mza.bundledseo.com/ugc/setting-up-magento-for-the-search-engines (Although this is likely to be a little outdated now)
I hope this helps!
Lewis
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
My url disappeared from Google but Search Console shows indexed. This url has been indexed for more than a year. Please help!
Super weird problem that I can't solve for last 5 hours. One of my urls: https://www.dcacar.com/lax-car-service.html Has been indexed for more than a year and also has an AMP version, few hours ago I realized that it had disappeared from serps. We were ranking on page 1 for several key terms. When I perform a search "site:dcacar.com " the url is no where to be found on all 5 pages. But when I check my Google Console it shows as indexed I requested to index again but nothing changed. All other 50 or so urls are not effected at all, this is the only url that has gone missing can someone solve this mystery for me please. Thanks a lot in advance.
Intermediate & Advanced SEO | | Davit19850 -
Duplicate content with URLs
Hi all, Do you think that is possible to have duplicate content issues because we provide a unique image with 5 different URLs ? In the HTML code pages, just one URL is provide. It's enough for that Google don't see the other URLs or not ? Example, in this article : http://www.parismatch.com/People/Kim-Kardashian-sa-securite-n-a-pas-de-prix-1092112 The same image is available on: http://cdn-parismatch.ladmedia.fr/var/news/storage/images/paris-match/people/kim-kardashian-sa-securite-n-a-pas-de-prix-1092112/15629236-1-fre-FR/Kim-Kardashian-sa-securite-n-a-pas-de-prix.jpg http://resize-parismatch.ladmedia.fr/img/var/news/storage/images/paris-match/people/kim-kardashian-sa-securite-n-a-pas-de-prix-1092112/15629236-1-fre-FR/Kim-Kardashian-sa-securite-n-a-pas-de-prix.jpg http://resize1-parismatch.ladmedia.fr/img/var/news/storage/images/paris-match/people/kim-kardashian-sa-securite-n-a-pas-de-prix-1092112/15629236-1-fre-FR/Kim-Kardashian-sa-securite-n-a-pas-de-prix.jpg http://resize2-parismatch.ladmedia.fr/img/var/news/storage/images/paris-match/people/kim-kardashian-sa-securite-n-a-pas-de-prix-1092112/15629236-1-fre-FR/Kim-Kardashian-sa-securite-n-a-pas-de-prix.jpg http://resize3-parismatch.ladmedia.fr/img/var/news/storage/images/paris-match/people/kim-kardashian-sa-securite-n-a-pas-de-prix-1092112/15629236-1-fre-FR/Kim-Kardashian-sa-securite-n-a-pas-de-prix.jpg Thank you very much for your help. Julien
Intermediate & Advanced SEO | | Julien.Ferras0 -
Multi URL treated as one?
I had previous asked this question, where the issue turned out to be that I didn't have all the URLs in Google Search console. Whoops! So I have added 4 properties that are really all the same property: https:// https://www http:// http://www I have added all of these. This has raised a few more questions: Can I get Google Search Console to treat these (and even group these together) to show as one property? Right now they are all listed separately. I know in Site Settings you can set a Preferred Site. Even so, they show as separate sites with data separately. Can I merge these? What about Moz? Should I do something similar to see traffic for each of these in Moz? It looks like we are missing a ton of info. Does Moz get this from GSC automatically? What about sitemaps? Can I fix this in sitemaps? Do I need separate sitemaps for each property?
Intermediate & Advanced SEO | | TapGoods0 -
URL Injection Hack - What to do with spammy URLs that keep appearing in Google's index?
A website was hacked (URL injection) but the malicious code has been cleaned up and removed from all pages. However, whenever we run a site:domain.com in Google, we keep finding more spammy URLs from the hack. They all lead to a 404 error page since the hack was cleaned up in the code. We have been using the Google WMT Remove URLs tool to have these spammy URLs removed from Google's index but new URLs keep appearing every day. We looked at the cache dates on these URLs and they are vary in dates but none are recent and most are from a month ago when the initial hack occurred. My question is...should we continue to check the index every day and keep submitting these URLs to be removed manually? Or since they all lead to a 404 page will Google eventually remove these spammy URLs from the index automatically? Thanks in advance Moz community for your feedback.
Intermediate & Advanced SEO | | peteboyd0 -
Help with htaccess
I just setup a WP install in a subfolder: domain.com/development/ However, there is an existing htaccess file in the root which contains the following: RewriteRule ^([A-Za-z_0-9-]+)$ /index.php?page=$1 [QSA]
Intermediate & Advanced SEO | | SCW
RewriteRule ^([A-Za-z_0-9-]+)/$ /index.php?page=$1 [QSA]
RewriteRule ^([A-Za-z_0-9-]+)/([a-z]+)$ /index.php?page=$1&comp=$2 [QSA]
RewriteRule ^([A-Za-z_0-9-]+)/([a-z]+)/$ /index.php?page=$1&comp=$2 [QSA] I need to leave the rules as-is due to the nature of CMS (not WP) under the root domain. Is it possible to include an exception or condition which allows URL requests containing /development/ to resolve to that folder? I tried to add: RewriteRule ^development/([A-Za-z_0-9-]+)$ /development/index.php?page=$1 [QSA] but this seems to send it in a loop back to the root. Thanks!!!0 -
Canonical url question
i just search seomoz tooll it say duplicate content for www.mysite.com and www.mysite.com/index.php should i use canonical url for this ? is yes then is this right ?
Intermediate & Advanced SEO | | constructionhelpline0 -
How to 301 redirect all URLs with /? in?
I want to redirect all URLs that have /? in it. Indexed in Google is a bunch of urls lik: mysite.com/?674764 mysite.com/?rtf8y78 I want all these URLs to be redirected to my home page. Any ideas?
Intermediate & Advanced SEO | | JohnPeters0 -
Page URL Issue
Hey Friend, I am having sort of a problem. I currently have a subpage with the url of: /musclecars/ I also have a subpage at /muscle-cars/muscle-car-restoration.html Obviously my main url is not listed here. My problem is I am trying to rank for the term Muscle Cars but the first URL does not have the keywords seperated so I rank no where. If I type MuscleCars into google I rank though (but nobody types the keyword in like that). So my question is can I create muscle-cars.mydomainname.com and rank well with that? Or is it better to just use mydomainname.com/muscle-cars/ even though that second term I am ranking for already has that in its url?
Intermediate & Advanced SEO | | shandaman0