Add Continuous Time CLV Calculations #76

ColtAllen · 2022-11-11T15:26:15Z

Unrealistically high CLV estimations have been a common complaint with the legacy lifetimes CLV formulations. I believe these discrepancies are due to calculations being made in discrete rather than continuous time.

Follow this link for a primer on comparing summation to integration,:

https://math.stackexchange.com/questions/2089929/comparing-discrete-sums-and-integrals

Assume in the graphic below that the red line is our customer value function, and integrating over time will give us our total CLV for the customer:

The bars represent discrete value measurements at regular time intervals. Summing up the area of the bars is essentially what the existing customer_lifetime_value method is doing. However, note the corners of the bars protruding beyond the red line - this will inflate total CLV estimations; the wider the bar, the greater the inflation. This width is fixed to monthly intervals in the current customer_lifetime_value implementation; the freq parameter only reflects the time intervals in which data was aggregated for model training, which is daily for most use cases. In theory training models on weekly or monthly data would improve the accuracy of CLV estimates.

Fortunately continuous-time CLV expressions exist. In addition to accuracy they also have the advantage of summing over the total lifetime of the customer rather than a user-specified time period. However, implementation is model-specific.

An expression for the Pareto-NBD model is provided as equation (2) on page 8 of this paper:

http://brucehardie.com/papers/rfm_clv_2005-02-16.pdf

I've also found implementations for the Beta-Geometric/Beta-Binomial model and a few other models that haven't been added yet to btyd, but none for the BG/NBD model. I'll try reaching out to Fader himself on LinkedIn for assistance on this.

The text was updated successfully, but these errors were encountered:

CinelliGucci · 2023-03-03T16:10:28Z

Hi @ColtAllen,
first of all thank you for the work you are doing on this project.
Is there any news on the implementation of continuous-time CLV calculation? Did you find a formulation for the BG/NBD model?

Thanks!
Alfredo

ColtAllen added bug Something isn't working enhancement New feature or request labels Nov 11, 2022

ColtAllen self-assigned this Nov 11, 2022

This was referenced Nov 11, 2022

CLV value too high in btyd #73

Closed

Predicted LTV Too High CamDavidsonPilon/lifetimes#313

Closed

ColtAllen mentioned this issue Jan 10, 2023

Add utility to compute customer lifetime value pymc-labs/pymc-marketing#125

Merged

ColtAllen mentioned this issue Feb 24, 2023

Add ParetoNBDModel pymc-labs/pymc-marketing#177

Closed

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Continuous Time CLV Calculations #76

Add Continuous Time CLV Calculations #76

ColtAllen commented Nov 11, 2022

CinelliGucci commented Mar 3, 2023

Add Continuous Time CLV Calculations #76

Add Continuous Time CLV Calculations #76

Comments

ColtAllen commented Nov 11, 2022

CinelliGucci commented Mar 3, 2023