Further Reading
This is a curated list of recommended further reading — books, papers, datasets, and code repositories that go deeper on each topic, organized by part and chapter. These are suggestions for going further, not the works formally cited in the text; for the book’s curated bibliography see References.
Part 1: Introduction
What is Marketing Science?
- Provost, F. & Fawcett, T. (2013). Data Science for Business. O’Reilly Media. — Frames data science around business decisions rather than model accuracy — the book’s core stance; worth revisiting here in Part 1.
- Taddy, M. (2019). Business Data Science: Combining Machine Learning and Economics to Optimize, Automate, and Accelerate Business Decisions. McGraw Hill. — Connects machine learning to business decision-making, closely aligned with Part 1’s thesis.
- Davenport, T. H. & Harris, J. G. (2007). Competing on Analytics: The New Science of Winning. Harvard Business School Press. — On how organizations actually collect, analyze, and act on data — a fitting introduction beyond “run the analysis.”
- Agrawal, A., Gans, J. & Goldfarb, A. (2018). Prediction Machines: The Simple Economics of Artificial Intelligence. Harvard Business Review Press. — Prediction is only one input to a decision; pairs well with the “don’t stop at AUC 0.92” message.
Three Characteristics of Marketing Data
- Pearl, J. & Mackenzie, D. (2018). The Book of Why: The New Science of Cause and Effect. Basic Books. — An intuitive guide to confounding and unobserved causes; supports the “observed data is the tip of the iceberg” framing (also relevant to Part 3).
- Chan, D. & Perry, M. (2017). “Challenges and Opportunities in Media Mix Modeling.” Google Inc. — Concurrent campaigns, multicollinearity, and extrapolation risk illustrate why marketing data is messy (also cited in Part 6).
- Muller, J. Z. (2018). The Tyranny of Metrics. Princeton University Press. — On how measurement distorts behavior; relevant to data as a byproduct of operations (and to KPI design in 2.2).
Part 2: Analysis for Better Decisions
Framing the Right Question
- Shron, M. (2014). Thinking with Data: How to Turn Information into Insights. O’Reilly Media. — A framework for scoping a project, the data it needs, and the problem actually worth solving — the best single fit for this chapter.
- Hubbard, D. W. (2014). How to Measure Anything: Finding the Value of Intangibles in Business. Wiley. — On reducing a fuzzy business question to something measurable at the precision a decision requires.
- Rumelt, R. P. (2011). Good Strategy/Bad Strategy: The Difference and Why It Matters. Crown Business. — Bad strategy is vague aspiration; good strategy names the real obstacle — the same discipline applied to problem framing.
Designing KPIs for Diagnosis
- Farris, P. W., Bendle, N. T., Pfeifer, P. E. & Reibstein, D. J. (2020). Marketing Metrics: The Manager’s Guide to Measuring Marketing Performance. Pearson. — The standard reference for marketing metrics, ROI, and quantifying marketing’s contribution to profit.
- Croll, A. & Yoskovitz, B. (2013). Lean Analytics: Use Data to Build a Better Startup Faster. O’Reilly Media. — The “One Metric That Matters” idea pairs naturally with building a KPI tree.
- Goodhart, C. A. E. (1975). “Problems of Monetary Management: The U.K. Experience.” Papers in Monetary Economics. — The origin of Goodhart’s Law: a measure that becomes a target stops being a good measure.
- Muller, J. Z. (2018). The Tyranny of Metrics. Princeton University Press. — The downside of metrics — not just designing KPIs, but the danger of managing people by them.
Turning Data into Story
- Knaflic, C. N. (2015). Storytelling with Data: A Data Visualization Guide for Business Professionals. Wiley. — The most natural fit for this chapter; foundational guidance on data visualization and communication.
- Minto, B. (2009). The Pyramid Principle: Logic in Writing and Thinking. Pearson. — The source of the executive Situation–Complication–Resolution structure, close to this chapter’s storytelling arc.
- Duarte, N. (2019). DataStory: Explain Data and Inspire Action Through Story. Ideapress Publishing. — Argues a data story should inspire action — a decision story, not a list of findings.
- Tufte, E. R. (2001). The Visual Display of Quantitative Information. Graphics Press. — The classic on statistical graphics, charts, and tables.
- Few, S. (2013). Information Dashboard Design: Displaying Data for At-a-Glance Monitoring. Analytics Press. — A practical complement to Knaflic for dashboard and business-review audiences.
- Wexler, S., Shaffer, J. & Cotgreave, A. (2017). The Big Book of Dashboards: Visualizing Your Data Using Real-World Business Scenarios. Wiley. — Real-world dashboard examples that connect to the outputs shown to decision-makers.
Part 3: Causal Inference for Marketing
Causal Thinking & Selection Bias
- Pearl, J. & Mackenzie, D. (2018). The Book of Why: The New Science of Cause and Effect. Basic Books. — Accessible introduction to DAGs, do-calculus, and the counterfactual framework.
- Angrist, J. D. & Pischke, J.-S. (2009). Mostly Harmless Econometrics. Princeton University Press. — The standard reference for the potential outcomes framework and practical identification strategies.
- Hernán, M. A. & Robins, J. M. (2020). Causal Inference: What If. Chapman & Hall/CRC. — Graduate-level treatment of causal inference from observational and experimental data; freely available online.
- Cunningham, S. (2021). Causal Inference: The Mixtape. Yale University Press. — Practitioner-friendly coverage of DiD, IV, RDD, and synthetic control with code examples.
A/B Testing: Design and Pitfalls
- Kohavi, R., Tang, D. & Xu, Y. (2020). Trustworthy Online Controlled Experiments: A Practical Guide to A/B Testing. Cambridge University Press. — The definitive practitioner reference, covering design, analysis, pitfalls, and organizational considerations.
- Deng, A., Xu, Y., Kohavi, R. & Walker, T. (2013). “Improving the Sensitivity of Online Controlled Experiments by Utilizing Pre-Experiment Data.” WSDM ’13. — The original CUPED paper, showing how pre-experiment covariates reduce variance.
- Johari, R., Koomen, P., Pekelis, L. & Walsh, D. (2017). “Peeking at A/B Tests: Why It Matters, and What to Do About It.” KDD ’17. — Formalizes the peeking problem and proposes always-valid inference as a solution.
Quasi-Experiments in Practice
- Angrist, J. D. & Pischke, J.-S. (2009). Mostly Harmless Econometrics. Princeton University Press. — Core reference for DiD, IV, and RDD with accessible derivations.
- Abadie, A., Diamond, A. & Hainmueller, J. (2010). “Synthetic Control Methods for Comparative Case Studies.” Journal of the American Statistical Association, 105(490), 493–505. — The foundational synthetic control paper.
- Cattaneo, M. D., Idrobo, N. & Titiunik, R. (2020). A Practical Introduction to Regression Discontinuity Designs. Cambridge Elements. — Covers sharp and fuzzy RDD with the
rdrobustpackage. - Cunningham, S. (2021). Causal Inference: The Mixtape. Yale University Press. — Practitioner-oriented treatment of DiD, RDD, and synthetic control with code examples in R and Stata.
Meta-Learners for CATE
- Künzel, S. R., Sekhon, J. S., Bickel, P. J. & Yu, B. (2019). “Metalearners for Estimating Heterogeneous Treatment Effects Using Machine Learning.” Proceedings of the National Academy of Sciences, 116(10), 4156–4165. — Defines the S/T/X-learner framework and compares performance across settings.
- Chernozhukov, V. et al. (2018). “Double/Debiased Machine Learning for Treatment and Structural Parameters.” The Econometrics Journal, 21(1), C1–C68. — Introduces DML with cross-fitting to remove regularization bias.
- EconML — Microsoft’s Python library for CATE estimation (DML, DR-learner, Causal Forest, and more) with DoWhy integration.
- DoWhy — Causal inference library providing a four-step workflow: model → identify → estimate → refute.
Uplift Modeling: Find the Persuadables
- Gutierrez, P. & Gérardy, J.-Y. (2017). “Causal Inference and Uplift Modelling: A Review of the Literature.” JMLR Workshop and Conference Proceedings, 67, 1–13. — Comprehensive survey of uplift modeling methods and evaluation metrics.
- Radcliffe, N. J. & Surry, P. D. (2011). “Real-World Uplift Modelling with Significance-Based Uplift Trees.” White Paper, Stochastic Solutions. — Introduces uplift trees with statistical significance-based splitting criteria.
- CausalML — Uber’s Python library for uplift modeling and CATE estimation, including
UpliftRandomForestClassifierand Qini/AUUC evaluation tools. - Diemert, E., Betlei, A., Renaudin, C. & Amini, M.-R. (2018). “A Large Scale Benchmark for Uplift Modeling.” KDD ’18 Workshop. — Describes the Criteo uplift dataset (13M samples) used in benchmarking throughout this chapter.
Part 4: Customer Analytics
Customer Segmentation
- Bult, J. R. & Wansbeek, T. (1995). “Optimal Selection for Direct Mail.” Marketing Science, 14(4), 378–394. — the foundational paper on optimal selection for direct mail using RFM-based targeting. Establishes the breakeven framework for evaluating segment-targeted campaigns.
- Provost, F. & Fawcett, T. (2013). Data Science for Business. O’Reilly Media. — comprehensive introduction to data science for business decisions, including practical coverage of segmentation and clustering in a business context.
- Vizard, S. (2019). “P&G shifts from targeting ‘generic demographics’ to ‘smart audiences’.” Marketing Week, July 30, 2019.
- Marketing Week (2019). “Why behaviour beats demographics when it comes to segmentation.” April 15, 2019.
Customer Lifetime Value Modeling
- Fader, P. S., Hardie, B. G. S., & Lee, K. L. (2005). “Counting Your Customers” the Easy Way: An Alternative to the Pareto/NBD Model. Marketing Science, 24(2), 275-284. https://www.brucehardie.com/papers/bgnbd_2004-04-20.pdf
- Fader, P. S. & Hardie, B. G. S. (2013). The Gamma-Gamma Model of Monetary Value. https://www.brucehardie.com/notes/025/gamma_gamma.pdf
- PyMC-Marketing: https://github.com/pymc-labs/pymc-marketing
- Online Retail Dataset (License: CC BY 4.0): https://archive.ics.uci.edu/dataset/352/online+retail
Part 5: Commercial Analytics
Commercial Metrics
- Byron Sharp, How Brands Grow (2010) — the empirical case for penetration-driven growth.
- Tan, Steinbach, & Kumar, Introduction to Data Mining — chapter on association analysis for a deeper treatment of support, confidence, and lift.
Price Elasticity and Pricing Decisions
- Hoch, S. J., Kim, B., Montgomery, A. L., & Rossi, P. E. (1995). Determinants of store-level price elasticity. Journal of Marketing Research, 32(1), 17–29.
- Tellis, G. J. (1988). The price elasticity of selective demand: A meta-analysis of econometric models of sales. Journal of Marketing Research, 25(4), 331–341.
- Hermann Simon, Confessions of the Pricing Man (2015) — accessible treatment of pricing strategy, written by the founder of Simon-Kucher.
- The JCPenney “Fair and Square” pricing case — a cautionary tale about eliminating promotions without understanding reference price effects. Customers revolted not because the new prices were higher, but because the feeling of getting a deal disappeared.
Product Assortment Optimization
- Kök, A. G., Fisher, M. L., & Vaidyanathan, R. (2015). “Assortment Planning: Review of Literature and Industry Practice.” In Retail Supply Chain Management, Springer. — The comprehensive academic review of assortment optimization methods.
- Iyengar, S. S., & Lepper, M. R. (2000). “When Choice Is Demotivating: Can One Desire Too Much of a Good Thing?” Journal of Personality and Social Psychology. — The original “jam study.” Influential but contested.
- Scheibehenne, B., Greifeneder, R., & Todd, P. M. (2010). “Can There Ever Be Too Many Options? A Meta-Analytic Review of Choice Overload.” Journal of Consumer Research. — The meta-analysis that found the average choice-overload effect is close to zero, with high variability across contexts. Treat the paradox of choice as a hypothesis to test, not a universal law.
- Train, K. E. (2009). Discrete Choice Methods with Simulation, 2nd ed., Cambridge University Press. — The standard textbook for MNL and Mixed Logit. Free online at https://eml.berkeley.edu/books/choice2.html.
pylogit(Python) andxlogit(Python, GPU-accelerated) — accessible libraries for discrete choice modeling.
Demand Forecasting
- Hyndman, R. J., & Athanasopoulos, G., Forecasting: Principles and Practice (3rd ed.) — free online at otexts.com/fpp3. The standard reference for time-series forecasting. Chapters on ETS, ARIMA, and hierarchical reconciliation are particularly relevant.
- Nixtla open-source libraries: statsforecast, mlforecast, hierarchicalforecast — fast, scalable, well-documented. The unified
unique_id / ds / yformat is a genuine productivity advantage. - Makridakis, S., Spiliotis, E., & Assimakopoulos, V. (2022). M5 accuracy competition: Results, findings, and conclusions. International Journal of Forecasting. — The competition that demonstrated gradient boosting’s advantage over statistical models for large-scale retail forecasting.
- Ansari, A. F., et al. (2024). Chronos: Learning the Language of Time Series. — The architecture and pretraining approach behind Chronos 2. Relevant for understanding zero-shot forecasting.
- Olivares, K. G., et al. (2022). HierarchicalForecast: A Reference Framework for Hierarchical Forecasting in Python. — Theory and implementation of MinTrace, ERM, and other reconciliation methods.
- Fildes, R., & Goodwin, P. (2007). Against your better judgment? How organizations can improve their use of management judgment in forecasting. Interfaces, 37(6), 570–576. — The evidence on human override bias and FVA.
- GIFT-Eval (Salesforce Research). github.com/SalesforceAIResearch/gift-eval — A benchmark for fair comparison of Foundation Models on time-series tasks.
Part 6: Media Investment and Optimization
Attribution: Concepts and Limits
- Google. “About Data-Driven Attribution.” Google Ads Help.
- Berman, R. (2018). “Beyond the Last Touch: Attribution in Online Advertising.” Marketing Science, 37(5), 771-792.
- Dalessandro, B. et al. (2012). “Causally Motivated Attribution for Online Advertising.” ADKDD ’12.
MMM Fundamentals
- Jin, Y., Wang, Y., Sun, Y., Chan, D. & Koehler, J. (2017). “Bayesian Methods for Media Mix Modeling with Carryover and Shape Effects.” Google Inc. — Google’s paper on Bayesian MMM with hierarchical priors and geo-level data, the methodological foundation for Meridian.
- Chan, D. & Perry, M. (2017). “Challenges and Opportunities in Media Mix Modeling.” Google Inc. — Companion paper covering common pitfalls in MMM (multicollinearity, extrapolation, insufficient variation) and how to address them.
- Meridian — Google’s actively maintained Bayesian MMM library (successor to LightweightMMM), used in the walkthrough in this chapter.
- Robyn — Meta’s open-source MMM library using Ridge regression with hyperparameter optimization via Nevergrad.
- PyMC-Marketing — Bayesian MMM and CLV library built on PyMC, with flexible model specification and saturation/adstock transforms.
Budget Optimization
- Jin, Y., Wang, Y., Sun, Y., Chan, D. & Koehler, J. (2017). “Bayesian Methods for Media Mix Modeling with Carryover and Shape Effects.” Google Inc. — Describes the Bayesian MMM framework that underpins Meridian’s budget optimizer, including the posterior-based optimization approach.
- Chan, D. & Perry, M. (2017). “Challenges and Opportunities in Media Mix Modeling.” Google Inc. — Discusses practical challenges in MMM-based optimization, including extrapolation risks and multicollinearity.
- Meridian Budget Optimization Guide — Documentation for Meridian’s
BudgetOptimizer, including per-channel constraints and scenario analysis. - Fischer, M., Albers, S., Wagner, N. & Frie, M. (2011). “Dynamic Marketing Budget Allocation Across Countries, Products, and Marketing Activities.” Marketing Science, 30(4), 568–585. — Multi-level budget allocation framework with diminishing returns and cross-effects.
Calibrating MMM with Experiments
- Jin, Y., Wang, Y., Sun, Y., Chan, D. & Koehler, J. (2017). “Bayesian Methods for Media Mix Modeling with Carryover and Shape Effects.” Google Inc. — Google’s Bayesian MMM paper describing the prior calibration framework that Meridian implements.
- Vaver, J. & Koehler, J. (2011). “Measuring Ad Effectiveness Using Geo Experiments.” Google Inc. — The foundational paper on geo-lift testing methodology.
- Brodersen, K. H., Gallusser, F., Koehler, J., Remy, N. & Scott, S. L. (2015). “Inferring Causal Impact Using Bayesian Structural Time-Series Models.” The Annals of Applied Statistics, 9(1), 247–274. — Bayesian structural time-series approach to geo experiments, an alternative to DID.
- Meridian Calibration Guide — Official documentation for
roi_mpriors androi_calibration_period. - GeoLift — Open-source R package for designing and analyzing geo-lift experiments.