Stripping Out Referral Spam From Past Reports
-
Hi,
I'm looking to confirm the best approach for retroactively stripping away referral spam (free buttons, SEMalt, etc.). Now to be clear, I already have filters in place to ignore them from current stats, so moving forward I'm fine. However, I'd love to go back and check untainted stats.
I've setup segments using a regex to strip the root words away and it seems to be working. I have a regex setup to strip out things like: social-buttons|seoanalyses|copyrightclaims|classifiedads|jobsense|free-share-buttons|e-buyeasy|acrobats.hol|cheap-online|amezon|search-help|qut-smoking and so forth.
I've been going through my referral data, noticing obvious spam, and adding their domains to my segment. Is this the optimal way for me to get a clear, untainted view of my past stats?
-
Sweet, glad to hear our filters will suffice. Thanks for the input, Daniel.
-
Hey, no worries and you're right that your filters should block them as well. Using .htaccess would be just an additional defense mechanism but may not be necessary.
-
Hi Daniel,
Thanks again for the response. What would be the difference in Analytics data between my filters and going straight to .htaccess? If the data is the same, is there an additional benefit to .htaccess?
For regular users, I'd suspect less bandwidth since they can't load my domain, but I don't think these bots actually load the page or visit.
-
I would use your .htaccess file to block them with the following code (this would for example block referrals from semalt.com and semalt.com subdomains):
RewriteEngine On
Options +FollowSymlinks
RewriteCond %{HTTP_REFERER} ^https?://([^.]+.)*semalt.com\ [NC,OR]
RewriteRule .* – [F]
You can also use .htaccess to block IP addresses associated with the spammy sources.
edit: just saw your edit but hope this helps nevertheless!
-
Hi Daniel,
Thanks for the additional tips. I do have the bot filtering feature enabled as another point of protection. I checked my referral exclusion list and apparently set this up about a year ago for the initial wave of referral bots I noticed. I didn't know it added them to direct.
The majority of my spam referral hosts have been added to regular filters. I think with the combination of my retroactive approach and new filters, I should have reliable data going forward.
-
Hi there,
You’re on the right track and the best way to retroactively remove spammy sources is through report filters and advanced segments.
A couple other notes:
- A good way to spot spammy referrers is to sort by bounce rate and eliminate any with 100% bounce and over 10 sessions.
- Avoid using the “referral exclusion list” since this will just count spam traffic as direct traffic instead.
- You should also enable the GA ‘bot filtering’ feature under ‘Reporting view settings’ as seen here
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Organic reports showing a URL that isn't in Search Ask Question
In the image I've attached you can see that I have pulled a source/medium > google organic report. I've also made "landing page" my secondary dimension. The first landing page that is showing up is /v3/?slug=fnl, that is this page (https://orders.freshnlean.com/v3/?slug=fnl). You can see that the page has 230 sessions from Sep 3 - 9 and 17 transactions during that same time frame. The only thing is, that landing page is nowhere to be found in the SERPs. So how is it showing up in this report as having received google organic visitors that converted if it's not even in search? 05OclDp
Reporting & Analytics | | tdastru0 -
Spam Direct Traffic
Hello, Lately, I have been receiving a big amount of unexpected direct traffic from Boston. After analyzing with Analytivs, this is what I get (please, check attachment). Normally I would be blocking this traffic source straight away from my Google Analytics account, and also blocking this traffic from accesing my servers, but check out the analytic metrics: this traffic represents 12% of my total traffic right now!!! av. session duration is 4:53 !! bounce rate is 72% !!!! pages/session 1.44 !! Service provider is "Microsoft Corporation" who looks like one of the typical spammy service providers. My question is, is this a bot?? what do you think ? Thanks, Luis zUlVHIi
Reporting & Analytics | | Yeeply.com1 -
Keyword Opportunities in Insights Area Suggest SPAM Keywords?
Happy Holidays Moz family, We were recently reviewing our insights on an account and the keywords suggested were: 1. malibog pinoy sex story 2. desi bhabhi ki chudai 3. cennai mamies pundai mulai Clearly these are SPAM or something. We have run malware scans of the site, used Google Webmaster tools to identify incoming and outgoing links and don't see anything. I have also exported the entire site to Notepad++ and searched for these terms. Nothing. Any ideas or suggestions? Thank you in advance for any suggestions! We're having some ranking issues with the same site so perhaps this is the root of the issue. The site has some great links.
Reporting & Analytics | | Tosten0 -
How to configure multilingual site in google analytic? Currently showing in Referral Traffic why?
Hello All, Currently my Multilingual site is showing in referral traffic is it because I have not added hreflang tag on site? If yes and if I add the hreflang tag on all sites when where it will show in google analytic traffic from international sites? And what type of configuration required in analytic? Thanks!
Reporting & Analytics | | pragnesh96390 -
Direct traffic spam on Google Analytics: how can you identify and filter it?
One of my smaller clients noticed a huge jump in direct traffic visits last month. The bounce rate was around 97% so I'm pretty certain that most of the traffic was illegitimate. I know how to filter out spam referrals and organic keywords in Google Analytics. However I'm not sure what to do about direct traffic spam. Are there recommendations for filtering this out? Can I identify spam IP addresses?
Reporting & Analytics | | RosemaryB0 -
Referral Traffic from Google
Hello, I have a question about my company's new website. I've worked in SEO and studied Google Analytics results for a few years now but have never really come across something like this. I started in this position in January of this year and when I started breaking down the traffic sources in Google Analytics, I noticed most of the traffic was coming from Google.com as a referral source. I had never seen Google.com as a referral source before so I looked into options for what it could be. It was not a paid ad and our organic traffic was coming through in Analytics, Before I could get any further, our new website was launched (we switched CRM's to WordPress) and the referral traffic from google went from 2,966 in January of 2015 to 22 in February 2015. for more comparison, in February of 2014, the referral traffic from Google was 2,496. I expected a drop when we switched CRM's but we correctly re-directed all pages and created a new sitemap and our organic traffic is up since the switch (not enough to cover drop in referral). I thought at first this had to do with our Google sellers account being de-activated when we made the switch, but I quickly fixed this over a month ago and no change. I'm wondering if anyone has ever seen Google.com come through as a referral source in Google Analytics and if they we're able to figure out what it actually was. This would be a great help! Thank you, Alex
Reporting & Analytics | | RASEO1 -
Moz Crawler suddenly reporting 1000s of duplicates (BE.net)
In the last 3-4 days we've had several thousand 'duplicate content' warnings appear in our crawl report, 99% of them related to our on-site blog. The blog is BlogEngine.Net, but the pages simply don't exist. The majority seem to be Roger trying quasi-random URLs like:
Reporting & Analytics | | Progauto
/?page=410 /?page=151 Etc. etc. The blog will present content for these requests, but it is of course the same empty page since there's only unique content for up to /?Page=10 or so. Two questions: 1. Did something change recently? These blogs have been up for months, and this problem has only come up this week. Did Roger change to become more aggressive lately? 2. Suggested remediation? On one of the blogs I've put no-index no-follow for any page that has a /?page querystring, and we'll see what effect that has come next crawl next week. However, I'm not sure this will work as per: http://moz.com/community/q/functionality-of-seomoz-crawl-page-reports Anyone else had dynamic blogs suddenly blossom into thousands of duplicate content warnings? Google (rightly) ignores these pages completely.0 -
If a site has 301 redirect - Will the Analytics of the target site show it as a referral or as the traffic source it came from?
Lets say I have a site www.abc.com and I rederect that site to www.xyz.com. If ABC.com is still ranking for keyword X and orgnically someone searches for X and they click on the ABC.com listing - In the XYZ site analytics (which is the target site) does it show as organic or referall, direct? Thanks
Reporting & Analytics | | M_80