Customer Value Modelling: Model synthesis and extension proposals

The Customer Lifetime Value is a key concept in relationship marketing. This article develops, synthesizes and organizes formulas for computing this Lifetime Value in a progressive manner and continues a research direction started some years ago. Calculations are organized using a double taxonomy: the one of customer relationship behaviour models - retention and migration models - and the one of algebraic and matrix computing methods. The suggested formulas allow for explicit representation of recency censored customer migration processes, flexible integration of desynchronized financial flows and extend customer value optimisation procedures from the retention to the migration model.
Customer Value Modelling: Model synthesis and extension proposals Article to be published in Journal of Measurement and Targeting in Marketing, 11, 2002 Mihai Calciu, Associate professor, Université Lille 1, calciu@iae.univ-lille1.fr Francis Salerno, Professor, Université Lille 1, francis.salerno@iae.univ-lille1.fr

The Customer Lifetime Value is a key concept in relationship marketing. This article develops, synthesizes and organizes formulas for computing this Lifetime Value in a progressive manner and continues a research direction started some years ago. Calculations are organized using a double taxonomy: the one of customer relationship behaviour models - retention and migration models - and the one of algebraic and matrix computing methods. The suggested formulas allow for explicit representation of recency censored customer migration processes, flexible integration of desynchronized financial flows and extend customer value optimisation procedures from the retention to the migration model.

INTRODUCTION

INTRODUCTION
Relationship Marketing and Direct Marketing
Relationship Marketing and Direct Marketing

The development of a "customer oriented" interactive and database marketing brings to forefront, in most contexts, customer value and customer portfolio assessment methods. These methods have either been fine tuned in direct marketing and in catalogue sales or developed in certain sales force effort allocation models. Improvement possibilities for these methods are still numerous but the road towards such progress passes through a systematisation and unification effort.


Profitability of the customer relationship
Profitability of the customer relationship

From a customer attraction and retention perspective, a profitable customer is a customer whose incomes, generated during the commercial relationship, exceed with an acceptable amount costs supported to attract, satisfy and keep him. This amount is called customer value or lifetime value of the customer.


Need for customer value calculations
Need for customer value calculations

Customer value calculations help enterprises solve various fundamental problems like budgeting customer acquisition expenses, selection of recruiting media (the LTV is generally different according to the medias used), or of types of offer or distributing efforts between prospecting and preserving customers. Well conducted, LTV analysis can also help build a competitive strategic advantage (Jackson, 1996).


Dichotomies of the customer's relationship behaviours
Dichotomies of the customer's relationship behaviours

The modelling of customer value depends on the context and on the customer's relationship behaviour (whether the situation is contractual or not). A brief analysis shows that modelling efforts are based on the distinction between customer retention and migration.


Research objective
Research objective

This study of existing models shows also that, in a first set of contributions, the mathematical developments are algebraic and focus on retention models while, in the another set, they are based on matrix approaches and applied mainly to the migration model.

The need to desynchronise financial flows (expenses and gains) in the customer relationship, is also put forward by these contributions.

Based upon results emerging from a broad examination of these models, this article guides towards systematisation and unification. It finds elements that are common and identifies those that are not. It adds components that seem absent from the other approaches. In this way, a set of formulas that complete existing solutions is developed and adapted.


Research Framework
Research Framework

These, existing or new, formulas are grouped in a progressive customer value calculation framework. It is presented schematically in the figure 1 and in detail in the table of the appendix 1.

Figure 1. – Modelling customer value, a research framework

The algebraic approach and the matrix approach are treated apart. Formulations progressively integrate transactional flows expressing probabilities to pass orders, financial flows composed of gains and expenses and customer and prospect value optimisation procedures based upon long-term calculations.

The use of certain representation artefacts (migration trees and diagrams, transition probability matrixes) facilitates the development of some recursive algebraic formulas for LTV calculations.

Several additional aspects concerning dynamics of customer migration, like right and left censored migration processes or mechanics of purging the customer list, are treated explicitly.


Retention, migration and mathematical approaches of customer value

Retention, migration and mathematical approaches of customer value
Some works on customer value
Some works on customer value

Dwyer (1989) presents examples of LTV calculations drawn from the two major contexts that were distinguished by Jackson(1985) in the industrial buying environment: buyers of type "always a share" and buyers of type "lost for good".

On this occasion Dwyer (1989) reveals the more general character of this typology, extends it to the consumer market, and shows that the "always a share" behaviour, that he associates to the "migration model", is generally representative for catalogue buying; while the "lost for good" behaviour, that he associates to the "retention model", is representative for financial services, press subscriptions, etc.


Algebraic approaches
Algebraic approaches

Berger and Nasr (1998) use this distinction between customer retention and migration and present mathematical LTV calculation formulas for four situations implying retention models and for one context requiring a migration model.

Blattberg and Deighton (1996) build a model that computes "customer equity" and finds the optimal balance between customer acquisition (investments to convert prospects into customers) and customer retention (investments to convince active customers to continue to buy) expenses. This model uses an infinite time horizon (long term) and LTV formulas are therefore simple, easy to use and avoid all summing operations that limited time horizon (life time) models suppose. The concept of customer equity is further developed in Blattberg, Getz and Thomas (2000).


Matrix approaches
Matrix approaches

Building upon these works of Berger and Nasr (1998) and of Blattberg and Deighton (1996), Pfeifer and Carraway (2000) apply Markov chains to customer relationship modelling and LTV calculations. They insist on the flexibility of models based on Markov chains and show that all situations modelled by the previous authors can be solved with Markov chain models. While the previous contributions are limited to recency based customer value models, these authors illustrate the link between Markov chains and the RFM (Recency, Frequency and Monetary) modelling framework that dominates industry practices in Direct Marketing and Catalogue sales.

Some developments that are more specific to these sectors can also be found in Birtran and Mondschein (1996, 1997).


Comparison of works
Comparison of works

A comparative analysis of these contributions brings several points to the forefront, which underline the need of a systematisation and unification effort:

  • Berger and Nasr insist on the desynchronisation of financial flows (expenses and gains) during a customer relationship. They adopt an algebraic approach that they apply essentially to the retention model and neglect the migration model to a certain extent.

  • Blattberg and Deighton's developments are also algebraic. They are limited to the retention model, insist on customer value optimisation in a long-term perspective and focus on prospects.

  • The matrix solutions of Pfeifer and Carraway are applied primarily to the migration model. They adapt some long-term actual value calculations from Blattberg and Deighton to the matrix approach, but the optimisation procedures adopted by them are different (specific to Markov decisions processes).


The nature and mechanics of the customer's individual response

The nature and mechanics of the customer's individual response
Dichotomies of the customer relationship behaviour
Dichotomies of the customer relationship behaviour

The two types of temporal behaviour of customers in a relation with a company (that have been called differently according to authors: "lost for good"/"always has share" for Jackson, "retention model"/"migration model" for Dwyer or " contractual" / "non-contractual" for Reinartz and Kumar), display different migration patterns.


Particularities of the retention model
Particularities of the retention model

The retention model considers that a person or a company remains a customer as long as she generates transactions. This means that if at some given moment the customer doesn't renew his contract or doesn't generate any transaction he can be considered as "lost for good" or as an "ex-customer". It also means that if an "ex-customer" buys again he is considered as a new customer and one deals with an acquisition rather than a customer retention issue. The evaluation of customer potential in a relation of this kind will only take into account the customer's probability to remain active from one period to an another or what Blattberg and al. call the customer's survival probability. The customer lifetime corresponds to the number of successive periods during which the customer is and remains active.


Particularities of the migration model
Particularities of the migration model

The migration model considers that customers can re-appear (turn up again) after some periods during which they didn't make transactions and traces their probability to "reactivate". The evaluation of the customer's potential in such a relationship relies on the joint probabilities of remaining and becoming again an active customer after a fixed number of periods or in other words on the " survival " and "reactivation " probabilities. The lifetime of the customer corresponds to the number of successive periods during which the customer, according to the company's estimations, either remains active or reactivates. For a customer to preserve this status after several periods of inactivity his probability to reactivate, as estimated by the company, must to be superior to the prospects' response probability. Alternatively the customer is ticked off the customer database and he becomes an ex-customer or a "purge" if one adopts a certain terminology used by the direct marketing profession.


Distinction between transaction- and financial flows
Distinction between transaction- and financial flows

In order to encourage a progressive assessment of customers' potential and to facilitate the resolution of problems with increasing complexity it is convenient to distinguish transactional and financial flows that were generated by customers during the number of periods considered. The first will help estimate the number of transactions incurred by a customer during the relationship with the company and the others will facilitate profitability calculations for the customer relationship.


Probability to generate transactions and transaction potential
Probability to generate transactions and transaction potential

The status of a customer is given by an initial transaction. In order to asses transactional flows it is appropriate to count the transactions generated during several periods or business cycles by a number of initial transactions t0 or customers. Let's note tj the number of transactions generated by t0 customers or initial transactions after j periods. The customer's response probability after j periods is given by the number of transactions generated after j periods divided by the number of initial transactions or t j when t 0 = 1. The transaction potential of a customer will then be given by the expected number of transactions generated during the lifetime of a customer. This corresponds at each period to the sum of probabilities to generate a transaction: Tj =


The Retention Model

The Retention Model
Introduction to the Retention model
Introduction to the Retention model

The "lost for good" type of behaviour implies that the customer remains active until the moment he leaves the company. Simplified, this means that at each business cycle those who remain customers are going to generate a transaction.


The general case (variable retention probability with time)
The general case (variable retention probability with time)

The management of subscribers to a consumer magazine represents this situation well and raises a generic problem of customer retention and attrition. The customer potential and the customer value come from individuals with whom the relation persists. In an example drawn from the consumer magazine subscription business, Dwyer presents a situation in which the probability to remain a customer by renewing the subscription increases slightly with time and becomes stable after some periods.


Numeric example
Numeric example

Dwyer gives a numeric example that is repeated by many of the other authors mentioned before. We summarize and adapt this example in order to reveal essential mechanisms of the retention model and to allow for evaluation of similar situations in other industries by decision calculus. It deals with a company in the magazine subscription business who evaluates the transactional potential and the value of its customers for an estimated lifetime of five periods. The margin by retained customer is 40$ of which 20$ are incomes from subscriptions and 20$ are advertising revenues. The marketing costs by retained customer are 28$ and the discount rate is 20%.

Table 1 – The magazine subscriptions case (growing retention rate)

Periods
from
acquisition

Retention
rate

Survival
rate

Expected
number
of active
customers

Profit per
customer
and period

Discounted
profit per
customer
and period

Total
discounted
profit per
period

 

 

(P 2)

(n*3)

(m-c)

(5/(1+d)^1)

(4*6)

1

2

3

4

5

6

7

0

1

1

1000

12

12

12000

1

0.71

0.708

707.701

12

10

7077

2

0.79

0.561

560.602

12

8

4485

3

0.83

0.463

463.325

12

7

3243

4

0.84

0.389

389.395

12

6

2336

Lifetime

Transactions

3,121

3121

 

Value

29141


Formalisation of calculations
Formalisation of calculations

The survival probability at each period j is , the product of probabilities to remain a customer during all those periods. It indicates the expected number of transactions per customer on each period (tj).

The cumulated expected number of transactions per customer during the analysed periods is the sum of survival probabilities: Tj = . In order to facilitate customer value calculation we introduce the notion of discounted transactions [1] . The expected and discounted number of transactions during a customer lifetime is given by the discounted sum of his survival probabilities: Tja = , where a is the discount rate and j is the lifetime of a customer expressed number of periods. In the example, the expected number of transactions during the lifetime of a customer is 3.12 (see Figure 1.) and its discounted version is 2,43.

The expected value of the customers (Vj) is the profit of a transaction multiplied by the discounted expected number of transactions: .


The case where retention probabilities remain constant

The case where retention probabilities remain constant
Particularities of models with constant retention rate
Particularities of models with constant retention rate

In mature market situations one can suppose that for a constant marketing effort purchase probabilities remain constant from one period to another. Under these circumstances formulas are simplified and make it easier to estimate long-term evolutions. As a result survival probabilities can be expressed as powers of the constant retention probability and no more as a product of variable probabilities.


Numeric Example
Numeric Example

Dwyer's example adapted to a situation with constant retention probability leads to the following calculations (see table 2).

Table 2 – The magazine subscriptions case (constant retention rate)

Periods
from
acquisition

Retention
rate

Survival
rate

Expected
number
of active
customers

Profit per
customer
and period

Discounted
profit per
customer
and period

Total
discounted
profit per
period

 

 

(2..)

(n*3)

(m-c)

(5/(1+d)^1)

(4*6)

1

2

3

4

5

6

7

0

1

1

1000

12

12

12000

1

0.8

0.8

800

12

10

8000

2

0.8

0.64

640

12

8

5120

3

0.8

0.512

512

12

7

3584

4

0.8

0.41

409.6

12

6

2458

Lifetime
Long term

 

Transactions

3362
5000

 

Value

31162
36000


Formalisation of calculations
Formalisation of calculations

The survival probability after j periods is , the probability to remain customer to the power of the number of periods.

The sum of survival probabilities gives the transaction potential of a customer which is here = 3,36. The discounted transaction potential is then that is =2,6.

The net present value of a customer results from the profit of a transaction multiplied by the discounted transaction potential: that is (40$ - 28$) 2,6 = 31,2 $


"Long-term" customer value

"Long-term" customer value
Use of the long-term analysis
Use of the long-term analysis

When purchase probabilities remain constant from one period to the other, it is convenient to compute the long-term present value of a customer. The long term indicates a hypothetical situation where the customer's lifetime would be endless. Computing the long-term transaction potential and customer value is not of great interest in itself. It helps optimise marketing efforts: avoiding to spend too much in customer retention programs with no budget constraints or finding the optimum retention budget when budget constraints exist.


Transactional Potential and long-term customer value
Transactional Potential and long-term customer value

It can be shown that the long-term transaction potential is 1/(1-p). On the long term, the discounted transaction potential will then be = (1+a)/(1+a-p).

For example if the (constant) retention probability is 0.8 then the long-term transaction potential of a customer is 1/(1-0.8) = 5. This means that if the lifetime of a customer lasts forever that customer would generate 5 transactions.

The discounted transaction potential is (1+0.2)/(1+0.2-0.8)=3 and the customer's present value will be (m-c) (1+a)/(1+a-p) or (40$-28$)*3 = 36$.


Customer's Long-term and Lifetime values
Customer's Long-term and Lifetime values

If one remembers that the discounted transaction potential of a customer was 2,6 transactions and the customer's present value was 31,2$, it becomes clear that these long-term measures must be seen as limits towards which tend the cumulated transactions and the customer value.

These limits offer a fast computing instrument that avoids fastidious summations and produces some indicative values of the transaction potential and of the customer.


Optimisation of the long-term present customer value

Optimisation of the long-term present customer value
Retention probability: as a function of customer oriented expenses
Retention probability: as a function of customer oriented expenses

Up to this point the calculations have used given retention rates. It is yet justified to consider that the company can influence retention rates through marketing actions oriented towards its customers. This supposes to express the customer retention probability as a function of the marketing effort (budget) directed towards the customers during each period. The retention probability expressed as function of the retention budget increases with a growth rate that decreases progressively towards the end when the retention probability reaches its ceiling (see figure 1). A simple exponential function like pr = ceiling * (1-exp(b * R)) can be estimated by decision calculus using subjective estimations from mangers. Numeric example


Finding the budget that maximizes the customer's present value  
Finding the budget that maximizes the customer's present value  

The optimum retention budget for a customer R can be found by maximising the present long-term customer value, that is max(m-R/pr ) (1+a)/(1+a-pr ) where pr = f(R)

Figure 2 - Maximization of the long-term customer Value (without the acquisition stage)


Numeric Example
Numeric Example

In the example with a 23$ retention budget R by customer the company achieves a 0.8 retention rate, this corresponds to 28$ by retained customer and gives a long-term present value of the customer of (40-28)*(1+0.2) / (1+0.2-0.8)=36$. If the response to the retention budget is expressed by the function pr =0.81(1-exp(-0.16*R) one can see that this result is not an optimum one.

In such a situation, where the retention rate is limited upwards to 0.81 and marks an accelerated growth (coefficient of elasticity 0.16) with an effort of only 7$ per customer a retention rate of 0.785 is obtained. This results in 8,9$ per retained customer. The long-term value becomes then (40-8,9)*(1+0.2) / (1+0.2-0.785)=90$ which is optimum.


Finding the balance between customer acquisition and retention

Finding the balance between customer acquisition and retention
Acquisition and Retention Probabilities as functions of prospect/customer oriented expenses
Acquisition and Retention Probabilities as functions of prospect/customer oriented expenses

In order to have the complete picture of the customer relationship profitability, one cannot ignore the customer acquisition stage that transforms prospects into customers. Blattberg and Deighton (1996) call the measure of customer profitability "customer equity". By putting the problem in a prospect's[2] perspective they suggest a model that makes it possible to optimise customer acquisition and retention.

and = p a [m-c a +(m-c r ) *p r /(1+a-p r )]

Following the reasoning previously applied to retention costs, customer acquisition costs ca can be expressed as expenses by prospect divided by the acquisition probability pa, which itself is a function of these expenses. Numeric example.


Numeric Example
Numeric Example

Insuring customer relationships profitability reduces to successive optimisation of the customer acquisition and retention value.

Max[p a m - A + p a (m p r - R)/(1 + a - p r )]

where a = f(A) p and r = f(R)

The optimisation procedure, that has been adapted from Blattberg and Deighton, finds first prospect expenses that maximize the acquisition value (p a m - A) of a customer and then uses the resulting acquisition rate (p a ') in order to compute the retention value (p a ' (m p r - R)/(1+a-p r ) ). The maximum customer value is the sum of the optimum acquisition and retention values calculated in this way. Due to its simplicity and conciseness this model is well adapted to decision calculus.


Numeric Example
Numeric Example

In order to set the Blattberg and Deighton problem in the dual perspective of customer acquisition and retention, it is first necessary to express the acquisition probability as a function of acquisition expenses by prospect, for example pa = 0.4 (1-exp(-0.1 * A)). The ceiling acquisition rate (probability) and its elasticity should be lower then their corresponding parameters in the retention rate function. The optimised values can be found graphically (see in Figure 2).

With a 5$ budget per prospect a 0.16 response probability (acquisition rate) is reached. It gives the best acquisition value for a prospect 0.16*40-5 = 1,53 $ and can be read on the negative scale in figure 2. This acquisition rate enters the calculation of the retention value which becomes maximum when retention expenses by customer are 7$ and result in a retention rate of 0.78. In this way, the maximum retention value that can be achieved per prospect is 9,62$. The total value of a prospect is the sum of his acquisition and retention values 1,53$+9,62$= 11,16$. The customer value can be easily derived here from by dividing the prospect value by the acquisition rate (probability) 11,16$/0.16 = 68,32 $. The result is coherent with the 90$ customer value obtained when acquisition costs were ignored.

Figure 3 - Maximization of the customer value (with the acquisition stage)


Desynchronisation of the financial flows in the retention model

Desynchronisation of the financial flows in the retention model
Synchronized flows
Synchronized flows

In their survey of retention models Berger and Nasr (1998) pay a special attention to the distinction between the moments when input (gains) and output (expenses) flows intervene. In the previous examples we considered that gains (m) and promotional expenses (c) intervened at the same moment during a business cycle where from the formula .


Desynchronisation of flows and commercial cycle amplitudes
Desynchronisation of flows and commercial cycle amplitudes

A more attentive analysis of the alternation of gains and expenses of a company during the life of a customer, indicates the following timing: If acquisition costs that occur in the previous cycle are ignored, the customer brings gains at the beginning of the cycle and the company makes its promotional expenses at the middle of cycle in order to assure repeat purchase. Cycle lengths can be equal, lower or superior to one year. There can be different cycles for gains and for expenses.


Formalisation and numeric example
Formalisation and numeric example

The following formula generalizes and regroups several formulas suggested by Berger and Nasr (1998), in order to deal with various situations:

Vj = -

where cm and cc are the number of cycles per year for gains and for expenses.

Table 3 - Cases that illustrate commercial cycle particularities and the desynchronisation between the financial input (gains) and output (expenses) flows in different  industries

Case

Insurance (1)

Health club (2a)

Car leasing(2b)

Gains (m)

260

125

7000

number of gain cycles per year (nm)

1

2

0,33

Expenses (c)

50

25

95

number of expense cycles per year (nc)

1

2

1

Retention probability(p)

0,75

0,8

0,3

Number of years (j)

10

4

12

Discount rate (a)

0,2

0,2

0,2

Customer value (V)

569

355

8273

Berger et Nasr (1998) give detailed calculations for the three cases summarised here and for two other cases presenting situations where profits per retained customer aren’t constant but variable with time (case 3) and situations where financial flows aren’t discrete but continuous (case 4).


The Migration Model

The Migration Model
Mechanics of retention/reactivation - a series of artefacts
Mechanics of retention/reactivation - a series of artefacts

The "always a share" type of behaviour supposes that the customer doesn't have to be active during all the analysed period and that he can reactivate with a probability that increases with recency of the last transaction[3].

In order to get a better understanding and use of retention/reactivation mechanisms and of stochastic processes that characterize this kind of behaviour, several artefacts like decision trees, migration trees, diagrams and matrixes of transition probabilities are used. They facilitate the development or adaptation of a series of algebraic and matrix formulas.


Purchase profile (pattern) and the decision tree
Purchase profile (pattern) and the decision tree

With time customers have purchase patterns that alternate active and inactive periods. A prolonged inactivity can lead to elimination from the customer list. The progressive shaping of purchase profiles is visible in figure 4:

Figure 4 - Stochastic decision tree and purchase patterns

By regrouping customer purchase patterns at each stage according to recency criteria, by first separating the active customers from those that are not active and by segmenting afterwards the inactive ones according to the recency of their last purchase a customer migration tree is obtained. It is a key representation form for understanding transition mechanisms in this model.


The migration tree and the recurrence of the transition process
The migration tree and the recurrence of the transition process

The migration tree shows particularly well (see Figure 5.) the recurrent character of the transition process to which the customer is submitted. This facilitates the development of powerful and flexible algebraic formulas for computing the transactional and financial flows associated to the migration model.

Figure 5 - Customer migration tree


Formulas for a migration process censored in relation to the recency
Formulas for a migration process censored in relation to the recency

The expected number of transactions generated by a customer after j periods is the sum of reactivation probabilities of transactions initiated by the same customer in the previous periods. It is a recurrent expression that has as a starting point the first transaction, the one that turns a prospect into a customer:

This formulation is more general and more concise than the formulas given by Berger and Nasr (1998, eq. 10. p.25) and Dwyer (1997, p. 12). It integrates a recency limit, noted R that enables right censoring of the migration process and makes this expression and the other ones that have been derived there from comparable to the matrix formulations suggested by Pfeifer and Caraway (2000).

Censorship according to recency is largely used by the industry. When a segment or a customer crosses this recency limit, his response probability becomes lower or equal to the prospects' one. Consequently there are no more reasons to consider that individual as a customer and by convention his reactivation probability (PR) is fixed to zero.


Numeric Example
Numeric Example

Table 4 illustrates these aspects. Calculations are inspired from Dwyer’s example, which has been slightly modified in order to facilitate estimating the reduction of response probability with recency by decision calculus.[4]

A company is forecasting the behaviour of 1000 customers, knowing their purchase probability according to the recency of the last purchase. The marketing margin m is 40$ by transaction and promotion expenses are 4$ by customer. For simplicity reasons these values are kept constant.

Table 4 – The catalogue sales case

Periods
from
acquisition

Response
probability
and recency

Survival
probability

Reactivation
Probability

Purchase
probability

Number of
remaining
customers

Number of
purged
customers

Expected
number of
active
customers

Profit per
customer
and period

Discounted
profit per
customer
and period

Total
discounted
profit per
period

 

 

(2..)

(5-3)

 

(n-7)

 

(5*n)

(m*5-c(1-7))

(9/(1+d)^1)

(6*10)

1

2

3

4

5

6

7

8

9

10

11

0

1

1

0

1

1000

0

1000

36

36

36000

1

0,3

0,3

0

0,3

1000

0

300

8

6,667

6667

2

0,182

0,09

0,127

0,217

1000

0

217

4,69

3,26

3260

3

0,11

0,027

0,14

0,167

1000

0

167

2,66

1,542

1542

4

0,067

0,008

0,123

0,131

524,67

475,33

131

3,13

1,51

792

Lifetime

1,659

1,425

0,39

1,815

4525

475,33

1815

54,5

48,98

48979

The table shows that with a migration model the retention probability is generally low and that companies rely less on the survival probability of a customer than on his reactivation probability in order to boost sales and to increase the customer value.


Comments
Comments

The calculation of the purchase probability in column 5 uses the above-mentioned formula in order to illustrate this recurrent retention and reactivation process: t1 = [1*0.3] = 0.300. t2 = [t1*0.3 + t0*(1-0.3)*0.182] = 0.217 etc. This means that an initial transaction t0=1 generates 0.300 transactions after one period and 0.217 transactions after two periods. In the second period the 0.217 transactions come from the two evoked sources: t1*p1 = 0.9 expresses the retention or the survival and t0*(1-p1)*p2 = 1*(1-0.3) *0.182 = 0.127 expresses reactivation of customers that were inactive during the first period. In the fifth period the whole process rests essentially on the reactivation process, as the survival probability diminishes (0.002); at the same time intervenes the censorship of customers who passed the recency limit (R) because they haven't reactivated since 5 periods (their reactivation probability becomes too small).


Formula for a left censored migration process (the starting recency)
Formula for a left censored migration process (the starting recency)

The notation tj expresses a customer migration process that starts with a number of manifest customers that is customers that have just ordered or in other words customers for which the recency of the last order is one. The starting number of customers or the number of generating transactions can be noted tr,j. In a customers database important numbers of latent customers or temporarily inactive customers are kept whose recency of the last order is greater than one. The transaction potential of these customers needs to be explicitly considered in certain situation and justifies extending lifetime value calculations to customers' recency greater than one.

In this formula transactional flows with recency greater than one (tr,j, r>1) are expressed as a function of recency one transactional flows. Notations tj and t1,j are equivalent. It is a generalization of the previous formula that accepts left censored migration processes.


Numeric example
Numeric example

A customer with recency two, for example, will have a zero purchase probability at the beginning (column 5 in table 4), in period two his probability to reactivate is 0.18, in period three the purchase probability of 0.145, will be composed of the survival (remain active) probability of 0.055=0.18*0.3 and the probability to reactivate with recency three 0.09=0.92*0.11. After five periods (0 to 4) the purchase probability or the expected number of generated transactions t2,4 becomes 0.062 which is naturally less than t1,4 =0.131 for a customer whose migration process started with the active state (recency 1). The total number of generated transactions T2,4 during the 5 periods becomes 0.5 as compared to T1,4 =1,815. Thus, a customer of recency two can still bring a transaction out of two during the five following periods, while an active customer can bring 2 transactions.

This algebraic formulation of transactional flows in a migration model is equivalent with the matrix formulas suggested by Pfeifer and Carraway and makes a direct link to matrix approach presented in the next section.


Financial flows and customer value calculations
Financial flows and customer value calculations

From the number of transactions incurred during each period following the launching of a relationship-marketing program, it is possible to calculate the customer's profitability for the estimated duration of the relationship that is the Lifetime Value. The separation of input (gains) and output (promotion expenses) financial flows in the migration model is needed not only in order to mark the desynchronisation between the two flows (central topic in Berger and Nasr's work) but also in order to reflect the different transactional basis that applies to the two flows.


Customer Value - limits of some formulations
Customer Value - limits of some formulations

By definition, the migration model considers as a customer every individual having ordered already and not having yet exceeded the pre-established recency limit. This implies that all customers defined in this way are eligible for promotional activities conducted by the company during several periods even if they were not active in the period that precedes these actions. The marketing margin comes only from the active customers while promotional costs are incurred by default with all eligible customers and not only with the active ones as seems to suggest the formula developed by Berger and Nasr [5] . By eliminating the desynchronisation aspect in the timing of gains and expenses, their formula can be reduced to . It reflects a very particular case where promotion costs apply only to the active customers, which is a reduction, compared to the calculations presented by Dwyer, which the authors have followed.


Limited use of calculations adapted from the retention model
Limited use of calculations adapted from the retention model

For clarity reasons the discount rate is set to zero. The formula becomes then (m-c) or again (m-c)Tr,k . The customer value that results (40$-4 $) *1,815 = 65,34 $, is largely greater than the value in the general case that is presented in the table 4. As might be observed the Berger and Nasr formulation is limited to a very specific situation. Rare are the industries that can count on the spontaneous reactivation of customers without having to make the marketing efforts to stimulate this process.


Calculations adapted to the standard migration model
Calculations adapted to the standard migration model

Companies spend money in order to maintain the retention and reactivation process described by the migration tree. The simplified formula becomes then (m t k - c) or m Tj - c*j. By applying it to the data given in table 4, the customer value after 5 periods becomes 40 $* 1,815 - 4$*5 = 52,4 $.

Censorship is needed for the economic reasons because, if one continues to engage in promotional actions towards all members of a cohort, costs by active customer increase up to a point where the margin, even if it remains constant, is not enough anymore to cover these costs (see column 9).

With censorship included, the customer value formula is slightly modified to become

[(m tk - c(1-qk ))] or m T k -c R+ [c(1-qk )], because when the period j arrives to the recency limit R=4 customers that didn't reactivate are purged (excluded). The customer value becomes 40$*1,815-4$*4-4$*(1-0.475)=54,5 $.


The customer value formula for the standard migration model
The customer value formula for the standard migration model

The customer value formula suggested here allows for residual value calculations for customers that are inactive at the assessment time (left bound censorship), takes into account the promotional costs that are maintained by default during the lifetime of a customer and integrates customer elimination.

where: = Value of a customer with recency r = expected transactions generated by a customer of recency r in period j; m = net margin on a transaction; c = mailing cost; pl = response probability of a segment of recency l; R = recency limit beyond which a customer is ticked off the data base; a = discount rate; qk = cumulated probability for a customer to be eliminated (purged) after k periods.

This calculation cumulates the present gains produced by transactions that were generated during the projection period and systematically deducts mailing costs. It supposes that mailings are sent to customers only if they are maintained in the database, which is a standard marketing policy in a migration model.


Modelling the customer elimination process
Modelling the customer elimination process

The formula also integrates the modelling of the elimination process for customers that exceed the recency limit. Elimination intervenes when there is a succession of non-response (no-purchase) periods equal to the recency limit.

For an active customer (recency 1) the probability to be eliminated intervenes after R periods and corresponds at each period k to the probability to generate a transaction after k-R+1 periods multiplied with the probability of the non-response succession. In the example in table 4, an active customer, can be eliminated starting with period 4 (the recency limit), with a probability of 0.475, that is the product of the probability to generate a transaction t 1,4-4 =1 and the probability of the no-answer succession (1-0.3)(1-0.18)(1-0.11)(1-0.06)=0.475.

For an inactive customer (r > 1) the probability to be eliminated intervenes in two stages. The first stage intervenes after R-r+1 periods. For a customer of recency 3 the first elimination intervenes after 2 periods (4-3+1). The second stage begins after R+1 periods (here 4+1=5).

The first stage eliminates customers that don't reactivate anymore with a probability that corresponds to the non-response succession between the recency of the customer and the recency limit. A customer of recency 3 has a probability to be eliminated at the first stage of (1-0.11)(1-0.06)=0.83.

The second stage begins with the reactivation of customers from a cohort with given recency and continues up to the elimination of all customers from this cohort.

The probability to be eliminated at each period k is the product of the probability to generate a transaction after k-R+1 periods that multiplies the probability of the non-response succession. A customer of recency 3 will have in period 5 a probability to be eliminated of 0.0524 = 0.11*0.475. It is the product of the probability to generate a transaction t3,5-4 =0.11 and of the probability of the non-response succession 0.475.


Matrix approach

Matrix approach
Introduction
Introduction

The matrix approach suggested by Pfeifer and Carraway (2000) suites both retention and migration models. While for the retention models, the algebraic formulations are simple and straightforward enough, for the migration model the matrix approach represents a flexible and elegant alternative to the algebraic formulas.


Transition probabilities Matrix and its derivatives
Transition probabilities Matrix and its derivatives

Table 5 – Transition probabilities matrix and its transformations

a) After one period

b) After 4 periods

c) Cumulated : periods 0 to 4

d) Cumulated : long term 

 

r1

r2

r3

r4

r5

r1

0.3

0.7

0

0

0

r2

0.182

0

0.818

0

0

r3

0.11

0

0

0.89

0

r4

0.067

0

0

0

0.933

r5

0

0

0

0

1

 

r1

r2

r3

r4

r5

r1

0.131

0.117

0.124

0.153

0.475

r2

0.065

0.081

0.083

0.093

0.679

r3

0.031

0.029

0.053

0.056

0.83

r4

0.011

0.01

0.011

0.034

0.933

r5

0

0

0

0

1

 

r1

r2

r3

r4

r5

r1

1.815

1.179

0.869

0.662

0.475

r2

0.507

1.31

1.005

0.82

1.358

r3

0.276

0.171

1.116

0.946

2.49

r4

0.113

0.071

0.05

1.034

3.732

r5

0

0

0

0

5

 

r1

r2

r3

r4

r1

2,103

1,472

1,204

1,072

r2

0,675

1,472

1,204

1,072

r3

0,357

0,250

1,204

1,072

r4

0,141

0,099

0,081

1,072

P

P4

The transition probabilities matrix P in table 5a summarizes the migration process. It expresses transition probabilities, between two periods, from a given recency state either towards recency one (buying customer) or towards a recency state incremented by one (customer who doesn't buy). According to Markov Chain theory the same matrix P to the power 4 (table 5b) indicates transition probabilities after 4 periods. If after one period transition possibilities from a given recency state are limited to two states: recency one and the given recency incremented by one, after two periods possible transitions include three states and after four periods all the five states become accessible. The sum of the P matrixes to powers incremented from 0 to 4 (table 5c) contains the cumulated probabilities during five periods. It should be noticed that the starting period is considered equal to zero and that matrix P to the power zero corresponds to the identity matrix. It can be shown that on the long-term the sum of the probability matrixes P at incremented powers tends on the long term towards the inverse matrix of the difference between the identity matrix and P (table 5d).


Transactional flow
Transactional flow

In order to obtain a strict equivalence with the algebraic formulas presented before and summarized in the table 6., we multiply the matrixes in table 5 with the vector noted 11, that has the following shape [1, 0. 0. . .]. In this way the expected number of transactions vector is extracted. It consists of first column elements of the transition probabilities matrix and of its transformations.

Table 6 - The vector of the expected number of transactions in the transition probabilities matrix and its derivatives

a) After one period

b) After 4 periods

c) Cumulated : periods 0 to 4

d) Cumulated : long term 

 

t1,1

t2,1

t3,1

t4,1

r1

0,300

0,182

0,110

0,067

 

t1,4

t2,4

T3,4

t4,4

r1

0.131

0.117

0.124

0.153

 

T1,4

T2,4

T3,4

T4,4

r1

1.815

0.507

0.276

0.113

 

T1,n

T2,n

T3,n

T4,n

r1

2,103

0,674

0,357

0,141

T4 =

The vector t1 expresses the expected number of transactions that have been generated after one period by customers of different recency while the vector t4 expresses transactions generated after 4 periods. In the given example a customer of recency two can generate 0.182 transactions after one period and 0.117 after 4 periods. The transaction potential after 4 periods is contained in the vector T4 and the long-term transaction potential is contained in the vector Tn. In the given example a customer of recency two can generate 0.507 transactions during the four following periods and 0.674 transactions on the long-term. The sum of transition probabilities during several periods can be interpreted also as the expected number of periods spent in a state before transiting to the following state, the last state being the absorbing state. The line total of the long-term sum of probabilities matrix is the expected absorption time for the state represented by this line. In the example a customer in recency state one has an expected absorption time in the "ex-customer" state of 5,85 periods.


The customer value
The customer value

The value for the anticipated lifetime of a customer can be easily calculated by multiplying the discounted sum of probability matrixes with the rewards vector R. By considering gains and promotional expenses constant, the vector R looks like this: [m-c, -c, -c,..], meaning that customers of recency one bring gains in addition to promotional costs because they buy, while for customers of superior recency there are only costs.

 à

V1,4

V2,4

V3,4

V4,4

48,974

2,524

-0,714

-1,350


The long-term customer value
The long-term customer value

By applying the customer value calculation formula to Dwyer's example using a 5 periods time horizon (k=0 to 4) that seems to be this industry's estimated customer lifetime, a customer of recency one has a value of 48,97$, a customer of recency two has a value of 2,52$ and customers with a recency superior to two have negative values. The long-term calculations confirm these results.


The vector t1 expresses the expected number of transactions that have been generated after one period by customers of different recency while the vector t4 expresses transactions generated after 4 periods. In the given example a customer of recency two can generate 0.182 transactions after one period and 0.117 after 4 periods. The transaction potential after 4 periods is contained in the vector T4 and the long-term transaction potential is contained in the vector Tn. In the given example a customer of recency two can generate 0.507 transactions during the four following periods and 0.674 transactions on the long-term. The sum of transition probabilities during several periods can be interpreted also as the expected number of periods spent in a state before transiting to the following state, the last state being the absorbing state. The line total of the long-term sum of probabilities matrix is the expected absorption time for the state represented by this line. In the example a customer in recency state one has an expected absorption time in the "ex-customer" state of 5,85 periods.

Matrix approach : Customer value Optimisation

Matrix approach : Customer value Optimisation
Introduction
Introduction

By applying the logic used with retention models it is possible to represent the purchase probability with recency as a function of the marketing effort. To reduce complexity, let's suppose that by controlling the number of mailings sent to customers it is possible to modify the ceiling purchase probability leaving the other parameters unaltered.


Numeric example
Numeric example

In the example the function of purchase probability with recency was 0.3 exp(-0.5*recency-1), where 0.3 is the ceiling (pp) the function reaches when the recency is equal to one and the mailing costs per customer are 4$. If one spends more for mailings this ceiling can grow until a limit of 0.5 and if one spends less it can fall until 0. The ceiling purchase probability expressed as a customer cost function, is pp = 0.3 (1-exp(-0.5 * C)). When the budget is 4$ the ceiling is 0.3 as in Dwyer's example, when the budget is 1$ the ceiling becomes 0.1 and when the budget increases up to 10$ the ceiling approaches the limit of 0.5. Under these circumstances, finding the optimum cost per customer consists of maximising the long-term present value of a recency one customer: max V1 =  11 where R' = [(m-C/pp), - C / pp, - C / pp, - C / pp, 0] and p11 = pp = f(C), and P = f(pp, recency). The Marketing expenses c are C/pp; pp is a function of expenses per customer C and the purchase probability depends on the ceiling pp and the recency.


Optimisation Procedure
Optimisation Procedure

By applying this optimisation procedure to the given example it is optimum to spend 3$ per customer and not 4$. The graphical solution is presented in figure 6. Details of the calculations are given in appendix 2.

Figure 6 - Maximization of the long-term customer value for the migration model

It is easy to extend this reasoning to the acquisition of new customers and to find an optimum balance between customer acquisition and retention budgets. In this way the Blattberg and Deighton (1996) optimisation procedure is adapted to customer migration situations.


Optimisation using Markov decision processes
Optimisation using Markov decision processes

This logic seeks the best trade-off between long-term efforts and gains. For the migration model, other logics with shorter temporal horizons can also be envisaged. Pfeifer and Carraway (2000) present optimisation procedures extracted from the rich literature on Markov decision processes. Birtran and Mondschein (1996) formalize the direct marketing problem (that integrates a particular migration model) as a dynamic programming problem and develop optimisation heuristics adapted to this operations research problem. These approaches use RFM stratification and vary coverage and intensity of mailing campaigns in order to maximize the value of customers.


Optimising mailing policy without budgetary constraints
Optimising mailing policy without budgetary constraints

It can be shown that without budgetary constraints the optimal policy is to send mailings to all customer segments with positive Lifetime Value. As for monotonously decreasing response probabilities the customer value is also decreasing, when margins and mailing costs are constant, a breakeven response rate can be calculated for which the Lifetime Value is equal to zero. Using this property and the breakeven response rate, it is possible to calculate a profitability index from the response rate of each layer/segment.


Conclusions

Conclusions
Systematisation of calculations
Systematisation of calculations

The formulas for computing the economic value of a customer or the customer lifetime value that have been developed in this article are organized in systematic and progressive way according to a double taxonomy: the one of customer relationship behaviour models and the one of calculation methods. The retention/migration model dichotomy is based on strong theoretical foundations in consumer behaviour that have been underlined by the previous studies.

Besides the evoked behavioural characteristics, the comparative analysis of the two categories of customer relationship models reveals their economic substrata. In a migration model the retention probability is usually relatively low and companies rely less on the customer's survival probability than on his reactivation probability in order to increase sales and customer value.


Progressive modelling
Progressive modelling

The two calculation methods, algebraic and matrix based, are stepwise and progressively applied to both relationship behaviour models. On that occasion several stages are distinguished 1) a physical, quantitative level of transaction flows that are stochastically treated, 2) a monetary value level of financial flows, the temporal effects of which are studied, and 3) a decisions level the one of optimisation calculations. Calculations become in this way easier to compare and their formalisation in order to solve problems of increasing complexity becomes easier.


New algebraic formulas
New algebraic formulas

The algebraic formulas developed for the migration model represent an alternative to the matrix formulations. They make it possible to approach in an explicit manner the desynchronisation of financial flows and the censorship of migration processes on recency. Calculations to estimate eliminations from the customer list formalizes right censorship in a migration process; computing the probabilities to generate transactions for inactive customers formalizes left censorship. These aspects are only implicitly treated in the matrix approach of Pfeifer and Carrway (2000) and are absent from the algebraic approach suggested by Berger and Nasr (1998).


A matrix optimisation procedure
A matrix optimisation procedure

The optimisation procedure developed for retention models as well as the Blattberg and Deighton procedure that optimally balances marketing efforts between customer acquisition and retention have also been adapted to the migration model using the properties of long-term transition matrixes.


Application conditions
Application conditions

As shown by Mulhern (1999) customer LTV models cannot be applied to all situations. They require customers to have some persistent relations with enterprises and that financial flows (gains and expenses) can be forecasted with a certain precision at an individual level. When these conditions are not satisfied, the historic analysis of profitability can replace LTV calculations. Yet, under the combined and interdependent impulses from information technology progress and from the adoption of a customer centred marketing, situations for which models of LTV are applicable become dominant. This incites pursuing the customer evaluation and dynamic management models' systematisation and unification efforts.


REFERENCES

REFERENCES
REFERENCES
REFERENCES

Berger P.D. and N.I. Nasr (1998) "Customer Lifetime Value: Marketing Models and Applications." Journal of Interactive Marketing, 12, 1 (Winter), 17-30.

Birtran S, and S. Mondschein, (1996) "Mailing Decision in the Catalogue Industry", Management Science, Vol.42, No.9, September, 1364-1381.

Bitran S. and S. Mondschein (1997) "A Comparative Analyss of Decision Making Procedures in the Catalog Sales Industry," European Management Journal, 15, 2 (April), 105-116.

Blattberg, R.C. and J. Deighton, (1996) "Manage marketing by the customer equity test", Harvard Business Review, 74, 4 (July-August), 136-144.

Blattberg R., G. Getz and J. Thomas (2000) Customer Equity: Building and Managing Relationships As Valuable Assets, Harvard Business School Press, Boston

Dwyer F.R. (1989) "Customer Lifetime Valuation to Support Marketing Decision Making", Journal of Direct Marketing, 9, 1, 79-84.

Jackson, B.B. (1985), Winning and Keeping Industrial Customers: The Dynamics of Customer Relationships. Lexington, MA: Lexington Books.

Jackson, D.R. (1996) "Achieving Strategic Competitive Advantage Through the Application of the Long Term Customer Value Concept", Journal of Database Marketing, 4, 2, 174-186.

Mulhern, F.J. (1999) "Customer Profitability Analysis: Measurement, Concentration, and Research Directions", Journal of Interactive Marketing, 13, 1, 25-40.

Pfeifer Ph. E. et Carraway R.L. (2000) "Modeling customer relationships as Markov Chains", Journal of Interactive Marketing, 14, 2, 43-55.

Reinartz W. J et Kumar V. (2000) "On the Profitability of Long-Life Customers in a Noncontractual Setting: An Empirical Investigation and Implications for Marketing", Journal of Marketing, 64, October, 17-35.


Appendix 2: Matrix calculations for customer long term value optimisation

Appendix 2: Matrix calculations for customer long term value optimisation
Appendix 2: Matrix calculations for customer long term value optimisation
Appendix 2: Matrix calculations for customer long term value optimisation

Cost per customer

Transition probabilities

Rewards

Customer value

1€

 

r1

r2

r3

r4

r5

r1

0,102

0,898

0,000

0,000

0,000

r2

0,062

0,000

0,938

0,000

0,000

r3

0,038

0,000

0,000

0,962

0,000

r4

0,023

0,000

0,000

0,000

0,977

r5

0,000

0,000

0,000

0,000

1,000

 

R

r1

30

r2

-10

r3

-10

r4

-10

r5

0

 

r1

r1

14,872

r2

-22,223

r3

-16,915

r4

-9,489

r5

0,000

2€

 

r1

r2

r3

r4

r5

r1

0,184

0,816

0,000

0,000

0,000

r2

0,111

0,000

0,889

0,000

0,000

r3

0,068

0,000

0,000

0,932

0,000

r4

0,041

0,000

0,000

0,000

0,959

r5

0,000

0,000

0,000

0,000

1,000

 

R

r1

29

r2

-11

r3

-11

r4

-11

r5

0

 

r1

r1

16,127

r2

-22,722

r3

-18,008

r4

-10,335

r5

0,000

3€

 

r1

r2

r3

r4

r5

r1

0,248

0,752

0,000

0,000

0,000

r2

0,151

0,000

0,849

0,000

0,000

r3

0,091

0,000

0,000

0,909

0,000

r4

0,055

0,000

0,000

0,000

0,945

r5

0,000

0,000

0,000

0,000

1,000

 

R

r1

28

r2

-12

r3

-12

r4

-12

r5

0

 

r1

r1

16,479

r2

-23,725

r3

-19,385

r4

-11,313

r5

0,000

4€

 

r1

r2

r3

r4

r5

r1

0,300

0,700

0,000

0,000

0,000

r2

0,182

0,000

0,818

0,000

0,000

r3

0,110

0,000

0,000

0,890

0,000

r4

0,067

0,000

0,000

0,000

0,933

r5

0,000

0,000

0,000

0,000

1,000

 

R

r1

27

r2

-13

r3

-13

r4

-13

r5

0

 

r1

r1

15,853

r2

-25,324

r3

-21,110

r4

-12,452

r5

0,000

5€

 

r1

r2

r3

r4

r5

r1

0,341

0,659

0,000

0,000

0,000

r2

0,207

0,000

0,793

0,000

0,000

r3

0,125

0,000

0,000

0,875

0,000

r4

0,076

0,000

0,000

0,000

0,924

r5

0,000

0,000

0,000

0,000

1,000

 

R

r1

25

r2

-15

r3

-15

r4

-15

r5

0

 

r1

r1

14,242

r2

-27,557

r3

-23,211

r4

-13,765

r5

0,000

6€

 

r1

r2

r3

r4

r5

r1

0,373

0,627

0,000

0,000

0,000

r2

0,227

0,000

0,773

0,000

0,000

r3

0,137

0,000

0,000

0,863

0,000

r4

0,083

0,000

0,000

0,000

0,917

r5

0,000

0,000

0,000

0,000

1,000

 

R

r1

24

r2

-16

r3

-16

r4

-16

r5

0

 

r1

r1

11,687

r2

-30,422

r3

-25,694

r4

-15,255

r5

0,000

7€

 

r1

r2

r3

r4

r5

r1

0,399

0,601

0,000

0,000

0,000

r2

0,242

0,000

0,758

0,000

0,000

r3

0,147

0,000

0,000

0,853

0,000

r4

0,089

0,000

0,000

0,000

0,911

r5

0,000

0,000

0,000

0,000

1,000

 

R

r1

22

r2

-18

r3

-18

r4

-18

r5

0

 

r1

r1

8,260

r2

-33,885

r3

-28,542

r4

-16,915

r5

0,000