The great bot spam experiment (results pending)


Hey Reader!

I've been seriously irked lately. Client sites are getting hammered with spam and bot traffic, primarily from China and Singapore, and I've been running experiments to see what I can filter out in Google Tag Manager before it ever pollutes our client’s production GA4. Here's where I'm at so far:

  1. GeoIP — block based on country. Only worked a little, since the IPs that spammers use typically aren't in geoIP databases.
  2. Browser version — most spammers are using an older version of Chrome, but some are using newer ones, so this only helped a bit.
  3. Time Zone — I'm testing this now to detect the browser's time zone and see if that provides a cleaner signal. Initial results are promising but this only works well if you aren’t expecting to have any visitors from China/Singapore time zones.
  4. Webdriver — most headless browsers report this as true, but smart spammers will report false (as these spammers are).
  5. Languages — spam browsers will usually only have one language set.
  6. Plugin count — spam browsers typically have zero plugins.

The jury is still out on which one or combination of these will actually hold up at scale. I'll report back in the next issue with what I've learned! If you've dealt with this problem and found something that works, I'd love to hear about it.

Dana DiTomaso

Founder
dana@kpplaybook.com


Stop Letting Ad Platforms Guess What "Good" Looks Like

Speaking of traffic quality, how do you actually tell Google Ads or Meta what a quality visit looks like, beyond just waiting for a form fill?

My latest guide walks through how to build a quality traffic metric in GTM: a composite event that requires two things to be true at the same time — a time threshold (like 30 seconds) AND a scroll depth or element visibility trigger. When both conditions are met, the event fires. One Trigger Group in GTM, multiple tags sending the signal to Google Ads, Meta, and GA4.

What I appreciate about this approach is that it sits right in the middle of your measurement hierarchy: more meaningful than "technically engaged," less restrictive than a form submission. And it gives you a diagnostic matrix for troubleshooting campaigns as low quality traffic rate alongside low conversion rate tells you something very different than healthy quality traffic rate with low conversions.

Read the full guide →


Articles Worth Your Time
———•

Log File Analysis Is Back on the Menu (Did It Ever Actually Go Away?)

SearchPilot pulled together a roundup of the most underrated retail SEO levers for 2026 from in-house folks and one point jumped out at me immediately: log file analysis is having a moment again.

Antonis Konstantinidis from Charlotte Tilbury makes the case that with all the talk about LLMs and bots, most teams still aren't actually checking their server logs to see what those bots are doing. Logs tell you whether crawlers are reaching your key pages, whether they're being blocked or slowed down, and whether a recent deployment accidentally cut something off.

Sure, this piece is retail-focused, but the principles apply everywhere. Do you have a way to easily analyze your log files? If not, this might be the nudge to set that up. The rest of the roundup is worth your time too.


From Keywords to Topics: What That Actually Means for Your Analysis

Ahrefs published a thorough explainer on how to focus on topics rather than individual keywords in your SEO strategy. Another SEO post? I promise this one has a very relevant analytics angle.

What I was thinking while I was reading is this is where analysis can really help. If you're working with Google Search Console data, building a blend that groups keywords by topic gives you a much more useful picture of performance than keyword-by-keyword reporting. Instead of watching 50 individual queries go up and down, you can see whether your topical coverage is growing or shrinking as a whole.

Worth reading, and worth thinking about how you'd structure your own reporting to reflect topical performance rather than just individual query rankings.


Why "I Don't Trust Their Data" Is a Positioning Problem

I know, another “this isn’t analytics!!!” post. Hear me out.

April Dunford wrote a guide to advanced B2B positioning on Lenny's Newsletter that made me think immediately about how analytics practitioners are perceived inside organizations.

Analytics can be a weird, siloed discipline that stakeholders either over-rely on or quietly distrust. One of the hardest situations I encounter is when there are known data quality issues and leadership uses those gaps as a reason to dismiss findings entirely ("we can't trust this data") instead of making decisions with appropriate caveats.

April's piece is really about how positioning shapes the lens through which people evaluate what you do. If stakeholders understand why a gap exists and whether it actually affects the specific decision at hand, they're in a much better position to move forward confidently. The difference between "I don't trust their data" and "we're aware of these gaps, but they don't affect this decision" is largely a positioning and communication problem, not just a technical one.


Where You Can Find Me
———•

Women in Tech SEO — Portland

I'll be speaking at the Women in Tech SEO conference in Portland this May! If you're going to be there, come find me — I'd love to meet some of you in person.


That's it for this edition of The Huddle. As always, if you have questions or want to share what you're working on, just hit reply!

Want more? You got it!

📈

Practical GA4

Sign up →

📘

Free Resources

Get yours today →

🛠️

GA4 Workshops

Level up in GA4 →

Was this email forwarded to you? Sign up here!

PO Box 68171 RPO Bonnie Doon Shopping Centre, Edmonton, Alberta T6C 4N6
Unsubscribe · Preferences

Analytics Playbook by Dana DiTomaso

Analytics Playbook gives you the analytics skills you need to land more clients, level up your career, or make smarter marketing decisions. Get bi-weekly insights curated by analytics expert Dana DiTomaso. Each issue includes expert tips, must-read articles, and free resources, all designed to help you take action and see real results.

Read more from Analytics Playbook by Dana DiTomaso

Hey Reader! I read a post recently from Sanity called "Content is Data" and it's been rattling around in my head ever since. The argument: content locked inside a traditional CMS, where it is tied to its presentation, copy-pasted between surfaces, and impossible to query, is a bottleneck to everything you want to do next. Your content isn't really content. It's HTML with some text trapped inside. That got me thinking about how if content is data, how would that interact with marketing data?...

Hey Reader! I want to talk about something I've been deliberately not writing about for a while and that’s server-side tracking. The honest reason I've held off is that for a lot of the people reading this, it hasn't been a practical option. The setup is technical, the cost is real, and most clients, when you describe what's involved, look at you like you've suggested they renovate the kitchen during a dinner party. There's no good way to time that conversation. But at the same time what’s...

Hey Reader! Here is a mantra that I want you to carry with you in your professional career: we are not data accountants. Somewhere along the way, marketers got sold the idea that good analytics means precise analytics. That if the numbers don't add up perfectly, something is broken. That your job is to account for every single session, every click, every conversion, and if you can't, you've failed. The problem with that whole idea is that the data was never complete. It just used to be less...