I spent a while this month in a rabbit hole named “last non-direct click session attribution in GA4.” If you happen to pass by the same rabbit hole, I recommend staying the #$%& away.
My journey down the hole started because I’d set up a GA4 BigQuery data pipeline for a client, but my session numbers weren’t matching GA4. I’d created basically the same pipeline for other clients, and everything matched up well, but this client was different because the majority of their traffic is repeat visitors, and completing a purchase often spans multiple sessions.
If you started to nod off in that last paragraph, I’ll leave you with this and recommend that you skip ahead:
Being good at working with data and understanding data are not the same thing.
If you work with someone who is really proficient at data engineering, don’t let your guard down. Start with the assumption that what they tell you is wrong and look for evidence to support your assumption. I know, I sound like “mr. negative”, but optimism has no place in data analysis.
Here are some of the issues I had to deal with to get my numbers to match GA4:
Now that you’ve made it this far, here’s my second piece of advice: strongly consider using a third-party solution for processing GA4 data. If someone tells you it’s easy to build reporting off of the BigQuery export, they either haven’t done it right or they haven’t done it at all. Last month I wrote about ga4dataform.com, which should be on your short list to consider. And I’m in the process of testing another solution from PipedOut, which I’ll cover in my next roundup. I did eventually make it out of the GA4 attribution rabbit hole I was in, and I learned some useful things while there, but in general I’d rather spend my time solving new problems than replicating GA4 metrics.
I usually write about GA4, Looker Studio and Google Tag Manager updates that stand out, but I guess even Google engineers get a little down time now and then. I was pretty excited when they announced a major update to charts in Looker Studio last week, but they unannounced it shortly thereafter:
I’m not sure what happened, but hopefully they’ll announce it again soon.
Google Meridian is an open-source marketing-mix-modeling framework and toolset that has just been made generally available.
If you are not familiar with marketing mix modeling, here is how Wikipedia describes it:
Marketing Mix Modeling (MMM) is a forecasting methodology used to estimate the impact of various marketing tactic scenarios on product sales. MMMs use statistical models, such as multivariate regressions, and use sales and marketing time-series data. They are often used to optimize advertising mix and promotional tactics with respect to sales, revenue, or profit to maximize their return on investment.
I haven’t had a chance to play with it yet, but it fits perfectly with my priorities and goals for 2025 so expect to hear more about it in months to come.
Sign up for our newsletter to get Nico’s monthly Analytics Roundups delivered to your email box.
Nico loves marketing analytics, running, and analytics about running. He's Two Octobers' Head of Analytics, and loves teaching. Learn more about Nico or read more blogs he has written.
Learn what your customers really want by studying Google search results. Search pages reveal what…
Our top-read & viewed content in 2024 shows marketers working on mastering GA4. The learning…
This month we cover an intriguing study on search behavior and a fascinating trove of…