Who does L&Y Decision work with?

L&Y Decision works with e-commerce brands, DTC companies, and retail businesses, typically with revenues between $500K and $50M, who are serious about making evidence-based marketing decisions. They specialise in small and medium businesses ready to graduate from gut feeling.

What data access does L&Y Decision need to get started?

Read-only access to ad platforms (Meta Ads, Google Ads), analytics (GA4 or equivalent), and commerce data (Shopify, WooCommerce, etc.). L&Y Decision never touches campaign settings, budgets, or creative.

How quickly will e-commerce brands see results from decision science?

Most engagements deliver initial insights and recommendations within 2–4 weeks. Implementation of those recommendations typically produces measurable impact within 60–90 days. Causal inference and attribution model projects take 4–8 weeks by design.

Do you need an in-house data team for marketing attribution modeling?

No. L&Y Decision handles data extraction, transformation, and modeling entirely. If you can grant read access to your platforms, they do the rest. Many clients have no dedicated data function at all.

What is causal inference in marketing?

Causal inference in marketing is a statistical method that isolates the true incremental impact of advertising: measuring what would have happened without the ads (the counterfactual). Unlike correlation-based attribution, causal methods such as geo-holdout experiments, difference-in-differences, and Bayesian structural time-series prove that ads caused revenue, not merely correlated with it.

What is marketing mix modeling (MMM)?

Marketing mix modeling (MMM) is a statistical technique that quantifies how each marketing channel, paid search, social, email, TV, out-of-home, contributes to sales. Unlike last-click attribution, MMM uses aggregate time-series data to measure true channel contribution, accounting for external factors like seasonality and price changes.

How do CMOs and founders justify marketing analytics investment to their CFO or board?

L&Y Decision builds board-ready marketing ROI models that CFOs and investors can audit. Platform-reported attribution overclaims by 30–200% due to cross-channel double-counting. Causal models, using geo-holdout experiments and difference-in-differences analysis, isolate true incremental revenue. Clients typically discover they can reallocate 20–35% of ad spend to higher-performing channels, reducing customer acquisition cost by 25–35% on average.

Is L&Y Decision suitable for small and medium-sized businesses or small-cap public companies?

Yes. L&Y Decision was built specifically for brands that want enterprise-grade measurement without an in-house data science team. Typical clients are e-commerce, DTC, or retail brands with revenues between $500K and $50M, spending $10K–$100K per month on paid advertising. L&Y Decision handles all data extraction, modeling, and interpretation. No data engineering resources required.

← All Articles

MeasurementMay 2026 · 6 min read

Incrementality Testing 101: What Every E-Commerce CMO Needs to Know

By L&Y Decision

Every attribution platform in your stack has a financial stake in the number it reports to you. Incrementality testing ignores those numbers entirely and asks the only question that should determine where your budget goes: how many purchases occurred because of your advertising that would not have happened otherwise?

The Measurement Problem Underneath Your Dashboard

Your attribution model answers a question you never asked: which platform was present when a conversion happened? The question that drives budget decisions is different, which platform caused a conversion that would not have happened otherwise? Confusing these two is not a minor data quality issue. It is a structural flaw that compounds every quarter you act on the wrong answer.

The mechanism is easy to see once stated clearly. A customer who would have bought from you regardless, because she is a brand loyalist, because she found you through a word-of-mouth referral, because she was already deep in your checkout flow before any retargeting ad reached her, will still generate an attribution event. That event gets credited to a channel. Your dashboard shows a conversion. Your CPA model treats it as a win. The spend that reached this customer was, in every meaningful sense, wasted. But your reporting has no way to distinguish it from spend that moved a purchase decision.

Research on multi-touch attribution models has repeatedly shown that most digital advertising environments produce significant over-crediting of retargeting and lower-funnel channels. These channels are excellent at being present at conversions. They are not always the cause of them. The difference, summed across a year of media spend, can represent 20 to 50 percent of budget allocated to channels that are measuring correlation rather than causing it.

What Incrementality Testing Measures

Incrementality testing is a controlled experiment. You take a target audience, a media market, a customer segment, a device cohort, and split it into two groups. The test group receives advertising as normal. The holdout group does not. You measure the conversion rate difference between the two groups over a fixed window. The gap in conversion rates, multiplied by the holdout population, gives you your incremental conversions. Dividing your spend by that number gives you your true incremental CPA.

The critical output is not just the incremental CPA. It is the relationship between your incremental CPA and your attributed CPA. If your attribution model reports a channel at a $45 CPA and your incrementality test reveals a $120 incremental CPA, the gap represents the non-incremental spend embedded in that channel. Every dollar closing that gap is being deployed against conversions that would have happened without you.

This calculation directly reprices your channel mix. A channel with a strong attributed CPA and a weak incremental CPA looks worse under incrementality measurement than it appeared before. A channel with a weak attributed CPA, perhaps a top-funnel video channel that attribution never credits with conversions, can look dramatically better when measured incrementally, because its influence appears downstream in the funnel rather than at the moment of conversion.

Three Methods, Three Tradeoffs

There is no single incrementality methodology. Three approaches are in wide use among performance marketers, each with different data requirements, cost structures, and confidence levels.

Geo-lift testing. The most accessible method for most e-commerce brands. You divide your target geography into matched market pairs (cities or DMAs with similar baseline conversion rates, seasonality patterns, and demographic profiles) and withhold advertising from one market in each pair while running normally in the other. After a fixed test window, you compare conversion lift across paired markets. Geo-lift tests require no special platform access, can be run across multiple channels simultaneously, and produce results independent of platform-reported data. The main limitation is geographic heterogeneity: no two markets are perfectly matched, and confounding factors can contaminate results if the test window is too short.

Platform-native holdout studies. Most major advertising platforms (Meta, Google, TikTok) offer in-platform holdout testing where a percentage of your target audience is randomly excluded from your campaigns. Conversion lift is measured within the platform's own data. The advantage is precision: platform-native holdouts use proper randomization within the platform's identity graph, eliminating geographic heterogeneity. The disadvantage is that you are measuring incrementality as reported by the platform being tested, which has a financial incentive to surface positive results.

Ghost ads (PSA holdouts). The cleanest methodology from a statistical standpoint, though operationally more complex. The holdout group sees a neutral public-service advertisement placed by you, matching the impressions and targeting of your real campaign, but with no commercial content. This eliminates selection bias while preventing the holdout from converting due to your real ad. Ghost ads are used primarily by large advertisers with dedicated measurement teams. For most DTC brands, geo-lift testing delivers comparable confidence at lower operational cost.

Running a Geo-Lift Test: What You Need

A geo-lift test requires four things: a geographic split with comparable baseline conversion rates, a clean test period with no major promotions or external shocks, a minimum hold period of three to four weeks to accumulate statistical power, and a clean way to measure conversions by geography.

The most common failure mode is impatience. Geo-lift tests require a minimum period to accumulate enough conversions in the holdout markets to detect a real lift. For brands with high transaction volume, three weeks is often sufficient. For brands with fewer than 200 daily orders across all test markets, four to six weeks may be required to achieve the statistical power needed to distinguish signal from noise. Starting with too short a window produces results that cannot be interpreted.

Match quality matters more than market size. The goal is not to find your largest markets and split them, it is to find pairs of markets where baseline conversion behavior is as similar as possible before the test begins. Building the comparison on eight or more weeks of pre-test conversion data and checking for parallel trends is the standard quality check. If your paired markets were not moving together before the test, any divergence during the test cannot be attributed to your advertising.

Reading Your Incrementality Results

Your test produces two numbers: the conversion rate in the test market and the conversion rate in the control market. The difference, expressed as a percentage lift, is your measured incrementality. If your test market converted at 2.4% and your control market at 1.8%, your measured lift is 33%. Roughly one-third of conversions in the test market were incremental to your advertising.

A 33% incrementality rate sounds like a success. Whether it is depends entirely on context. If your channel generated $80,000 in attributed revenue on $10,000 in spend, and your incrementality test shows 33% of that revenue was incremental, your actual incremental revenue is $26,400 against $10,000 in spend. That is still a positive return. But if your previous budget decisions were made assuming $80,000 in incremental revenue, you have been significantly overvaluing this channel in your allocation model.

The Metric That Changes Everything

The number that should govern budget allocation decisions is incremental ROAS, not attributed ROAS, not blended ROAS, not platform-reported ROAS. Incremental ROAS is the revenue that would not have existed without your advertising, divided by the cost of generating it. Every other ROAS definition includes conversions that happened for reasons unrelated to your advertising.

Setting incrementality targets requires knowing your unit economics. If your contribution margin on a first purchase is 35%, you need an incremental ROAS above approximately 2.9x to cover acquisition costs against a one-purchase payback window. The correct hurdle rate changes as your LTV/CAC ratio shifts and as your payback period assumption changes. But the input, incremental ROAS, is the only version of that metric that maps to actual causal impact.

Most brands discovering incrementality measurement for the first time find their incremental ROAS significantly below their attributed ROAS. The gap is not a measurement error. It is the cost of years of decisions made using metrics that were answering the wrong question.

Building a Measurement Calendar Around Incrementality

Incrementality testing is not a one-time audit. It is a recurring cadence. Channel performance changes as media markets shift, as your customer base matures, and as platform algorithms evolve. A test result from eighteen months ago may not reflect current performance.

The practical cadence for most DTC brands: one major geo-lift test per channel per year, covering your highest-spend channels on a rotating basis. Supplement with in-platform holdout studies when platform conditions allow. Use the results to set quarterly budget allocations, rather than relying on weekly attribution reports for strategic decisions. Attribution data remains useful for day-to-day platform optimization, it is the wrong tool for deciding which channels deserve capital.

Source

Danaher, Peter J. and Rust, Roland T. 'Determining the Optimal Return on Investment for an Advertising Campaign.' European Journal of Operational Research (2017). Gordon, Brett R., et al. 'Inefficiencies in Digital Advertising Markets.' Journal of Marketing (2021). Nielsen, 'Marketing Effectiveness and Incrementality' (2024).

View all →

Analytics

The Attribution Models Are Getting Smarter. Most Brands Are Still Too Broke to Use Them.

New causal marketing mix models, a 4,200-test CRO study, and fresh CAC benchmarks all say the same thing: sophistication isn't the bottleneck for most brands. Broken tracking, underpowered tests, and blended CAC reporting are.

July 2026 · 8 min read

Guide

Geo Holdout Testing: A Step-by-Step Guide for E-Commerce Brands

Geo holdout tests answer the question attribution dashboards can't: does your ad spend cause revenue? This guide covers market selection, test design, runtime, and reading the results, with no data science team required.

July 2026 · 12 min read

Attribution

The Platforms Grading Their Own Homework: Why Your Attribution Data Is Structurally Broken

A peer-reviewed paper from NeurIPS 2025 formally proves what performance marketers have suspected for years: the mechanism that decides which of your ad platforms gets credit for your conversions is mathematically designed to be gamed.

June 2026 · 8 min read

Currently accepting new clients

Ready to prove your marketing ROI?

Book a free 30-minute consultation. Bring your ad account numbers; leave knowing which channels earn their budget.

Review your current attribution and measurement setup

Identify the highest-priority evidence gaps

Leave knowing which channels earn their budget

Free. No commitment.