Estimating Buyer Willingness-to-Pay and Seller Reserve Prices from

advertisement
Estimating Buyer Willingness-to-Pay and Seller
Reserve Prices from Negotiation Data and the
Implications for Pricing
Robert Phillips
Graduate School of Business, Columbia University, New York, NY 10027 and Nomis Solutions, rp2051@columbia.edu
A. Serdar Şimşek
School of Operations Research and Information Engineering, Cornell University, Ithaca, NY 14853, as2899@cornell.edu
Garrett van Ryzin
Graduate School of Business, Columbia University, New York, NY 10027, gjv1@columbia.edu
We consider a company selling heterogeneous products where the final price is set by negotiation between
sales agents and customers. Negotiation is common in many industries including retail automotive sales,
consumer lending, insurance and many business-to-business markets. We assume that each sales agent has
a minimum reserve price that he/she will accept and each customer has a maximum willingness-to-pay;
however, these values are not directly observable. If the customer’s willingness-to-pay is greater than the
sales agent’s reserve price, then a sale takes places at the Nash bargaining equilibrium price, otherwise
no trade takes place. Given a record of the outcomes of a series of such negotiations, we would like to
estimate the underlying joint distribution of willingness-to-pay and reserve prices. To do so, we propose
and analyze a structural estimation method based on the expectation-maximization (EM) algorithm that
allows for the willingness-to-pay and reserve price distributions to depend on an arbitrary set of covariates
(e.g. characteristics of the customer, product or sales agent) and be potentially correlated. Using a realworld data set, we show, via out-of-sample tests, that our estimation method generates accurate estimates
of customer-price response and the prices of lost deals. We also show how our estimated distributions can
be used to optimize controls on negotiated prices that can significantly increase revenues relative to both
unconstrained negotiations and centrally-optimized fixed prices. These results provide a justification for the
policy commonly observed in practice of providing sales staff limited discretion to change prices relative to
a centrally-established price list.
Key words : negotiation; structural estimation; Nash bargaining solution; limited sales force price
discretion; consumer lending
1
2
Phillips, Şimşek, and van Ryzin: Estimating Buyer WTP and Seller RP from Negotiation Data and the Implications
1.
Introduction
Price negotiations are often associated with markets such as fine art and real estate in which the
items for sale are unique. However, negotiations are also found in many other markets in which the
items sold are identical or similar. Examples include retail automotive sales, insurance, consumer
loans, mortgages, and many business-to-business markets. In these markets, the price paid for
identical or similar items may be quite different depending upon the outcome of the negotiation.
Moreover, the seller and the customer may not be able to agree on a price, resulting in a lost
deal. Whether or not a deal occurs and, if it does, the price that results will depend upon the
willingness-to-pay of the customer, the reserve price of the seller and the negotiating power of each
party.
Understanding the variation of willingness-to-pay among customers is important to most firms
in order to segment customers and intelligently set prices for different products and customer segments. Estimating the willingness-to-pay distribution for a particular product within a particular
customer segment based on historical sales data is always challenging and a variety of approaches
have been developed. (See Breidert et al. 2006, for a good review.) Estimating price response in
cases where negotiation is involved poses two specific challenges. First, price information is typically
unobserved for lost deals, since sales staff may be unwilling or simply unable to accurately estimate the last price under consideration before a customer walked away. Second, when a trade does
take place, the resulting price is correlated with the seller’s reserve price as well as the customer’s
willingness-to-pay.
To address these challenges, we develop a structural estimation method to determine the joint
distribution of customer willingness-to-pay and seller reserve prices in a market with negotiation.
Specifically, we postulate a model of a heterogeneous sellers selling to heterogeneous buyers via
bilateral price negotiations. Each seller has a private reserve price, R, for each deal, and will not
sell at a price lower than R. Similarly, each buyer has a private willingness-to-pay, W , and will not
buy at a price higher than W . Hence, no sale will occur if R > W . If W ≥ R, we assume that the
parties will reach agreement and a sale will occur at the symmetric Nash bargaining equilibrium
price p = (R + W )/2. Given a data set on both completed and lost deals (and the prices at which
deals occur), we address the problem of how to estimate the parameters of the joint distribution of
R and W . We allow for the fact that the distribution of (R, W ) may depend on a set of covariates
describing the product, buyer and seller and that the values R and W may be correlated with each
other.
A novel feature of our model is the assumption of a reserve price distribution among sellers.
This reserve price distribution can reflect sales force heterogeneity; that is, a more skilled sales
agent should be able to extract a higher price from the same customer, which could manifest itself
Phillips, Şimşek, and van Ryzin: Estimating Buyer WTP and Seller RP from Negotiation Data and the Implications
3
as a higher reserve price. Additionally, sales staff might be more or less willing to accept specific
deals at lower prices based on idiosyncratic incentives, e.g. how close they are to meeting a sales
quota. Such differences would also be reflected as differences in reserve prices. If sales staff were
homogeneous in terms of their abilities and incentives to make deals, this would be reflected in
a common reserve price among similar deals – or a reserve price that is largely predictable as a
function of the characteristics of the deal. In our real-world auto lending data, this is not the case:
reserve prices appear to vary significantly among sales representatives for similar deals.
We consider three approaches to estimating the parameters of the joint distribution of R and W
given data on historical deals: direct maximization of the likelihood function and two variations
on the expectation-maximization (EM) algorithm, which is commonly used to estimate model
parameters given censored data. We show that the expectation-maximization algorithm is more
robust and generally provides faster convergence, particularly on large data sets. We therefore use
the EM algorithm to estimate the joint distribution of R and W as a function of customer and
deal characteristics for a large data set of auto lending deals.
Lastly, we illustrate how our estimated joint distribution can be used to improve negotiated
pricing and help compensate for heterogeneity in the sales force. We assume that “headquarters”
wishes to maximize expected profit across all deals and sets the parameters within which local
sales agents must work and compare three policies: centralized deal-specific optimal pricing, negotiated pricing with centrally set deal-specific minimum prices, and optimized seller reserve prices.
Applying these three policies to a real-world auto lending data set we find that optimized seller
reserve prices outperforms centralized deal-specific optimal pricing which outperforms negotiated
pricing with minimum prices. We discuss the implications of these findings in Section 6.
2.
Literature Review
Starting with Weinberg (1975), a number of researchers have considered the question of how much
pricing discretion (if any) should be granted to a sales force. A number of these papers have
assumed that local sales representatives have local market information that they can use to set
better prices than headquarters (Lal 1986). Mishra and Prasad (2004) show that, even when a
customer possesses private information, if the sales agent first observes the private information,
then signs the contract, a company can set prices centrally and capture the full benefits available
from discretionary pricing by using an appropriate contract structure – though compensation may
be a complicated function of quantity and price. They then analyze competitive markets and show
that, under asymmetric information, an equilibrium always exists where all firms use centralized
pricing, regardless of the intensity of competition (Mishra and Prasad 2005).
Joseph (2001) develops a model that explicitly incorporates the sales agent’s superior information
about customer willingness-to-pay and the possibility of sub-optimal trade-offs between price and
4
Phillips, Şimşek, and van Ryzin: Estimating Buyer WTP and Seller RP from Negotiation Data and the Implications
effort (using discounting rather than expending effort on selling). In his model, centralization of
pricing authority is sometimes preferable to full decentralization, depending on the effort cost of
following a high-quality prospecting strategy and the structure of customer segments. Bhardwaj
(2001) reaches a similarly mixed conclusion in the case of competing firms when principals and
agents have the same information. Roth et al. (2006) consider the case of bargaining versus fixed
(or posted) price for customized services and show that, if bargaining costs are low, bargaining
can be a preferred approach for the seller. Kuo et al. (2011) present a model that incorporates the
interactions among dynamic pricing, negotiation, and inventory and show that negotiation is an
effective tool to achieve price discrimination, particularly when the inventory level is high and/or
the remaining selling season is short.
However, all of these models have led to results concluding that either full pricing centralization
or full pricing decentralization is optimal (or showing the equivalence of the two for specified
contract structures). Yet we observe in many markets with decentralized sales forces, that local
sales staff has discretion to negotiate prices for individual deals within pre-set limits. A recent Ernst
& Young Consulting survey showed that about 75% of business-to-business companies surveyed in
the United States allowed their sales staff limited discretion to adjust prices (Williams 2014). One
conclusion of our work is that, in general, neither fully centralized nor fully decentralized pricing
maximizes profits. Fully centralized pricing is not optimal because it does not allow sales staff to
adjust prices toward the willingness-to-pay of individual customers. Fully decentralized pricing is
not optimal because it becomes too dependent on the idiosyncratic reserve prices of sales staff. Sales
staff with very low reservation prices will systematically under-price deals. Rather than relying on
full price centralization or decentralization, our work suggests firms can maximize profits by setting
list prices and giving sales representatives limited discretion to adjust the prices for individual
deals. In this way, our work provides a heretofore missing justification for the common practice of
list pricing with discretion.
In the remainder of the paper, we describe our structural model and how the EM method can be
used to estimate the parameters of the model given deal and lost-deal data. We then use synthetic
data to show that our estimates converge to the underlying true parameters when willingness-topay and reserve prices are drawn from normal distributions. Next, we apply the EM algorithm to
a large data set of negotiated deals and lost deals obtained from a major North American auto
lender and evaluate the predictions produced by our method relative to standard regression-based
predictors. Our method also provides better estimates of the prices associated with lost deals1 .
We then show how the distributions of reserve price and willingness-to-pay can be used to
determine the optimal amounts of pricing discretion to provide to the sales force. Finally, we
1
When a deal is lost, the last price (APR of the loan) offered to the customer is still recorded in this data set.
Phillips, Şimşek, and van Ryzin: Estimating Buyer WTP and Seller RP from Negotiation Data and the Implications
5
discuss the robustness of the model to the specification of the underlying distributions and to the
assumption about the relative bargaining power of sales staff and customers.
3.
Model Description
Consider a setting in which a group of heterogeneous sellers sells a (potentially heterogeneous)
product to a heterogeneous group of buyers. Prices are negotiated bi-laterally between a single
buyer and single seller in each transaction. We assume that in each negotiation, the seller has
a reserve price R and the buyer has a willingness-to-pay W . The values of R and W are both
unobservable, but they both may be correlated with observable attributes of the seller, the buyer
and the product being sold. If the reserve price is greater than the willingness-to-pay, then no
trade takes place. If the willingness-to-pay is greater than the reserve price, then the outcome of
the negotiation is assumed to be the generalized Nash bargaining solution. (See Binmore et al.
1986, Muthoo 1999, for reviews of Nash bargaining.) Nash bargaining is a widely-used model of
negotiations (c.f. Desai and Purohit 2004, Roth et al. 2006, Kuo et al. 2011).
In the Nash solution, both parties divide the total surplus from trade (the difference between
the sum of their utilities before and after the bargaining) according to their relative bargaining
powers. Let s1 and s2 be the utilities to players 1 and 2 derived from the bargaining outcome
and let s01 and s02 be, respectively, the corresponding disagreement utilities (utilities to each party
when no trade takes place). Then the generalized Nash bargaining solution outcome maximizes
(s1 − s01 )α (s2 − s02 )(1−α) , where α is a measure of the relative bargaining power of the buyer and
seller (see Binmore et al. 1986).
We assume that both the seller’s and buyer’s utilities will be zero if no trade takes place, i.e.,
s01
= s02 = 0. If trade takes place at a negotiated price p, then the seller’s utility will be p − R and
the buyer’s utility will be W − p. Therefore, when W ≥ R the generalized Nash bargaining solution
solves:
max
p
(p − R)α (W − p)(1−α)
The solution is
peq = αR + (1 − α)W
which gives the equilibrium price of the bargaining outcome2 . In our model, we assume that the
seller and buyer have equal bargaining powers, i.e., α = 0.5. Hence, if a trade takes place, it does
so at the price determined by the symmetric Nash bargaining equilibrium peq = (R + W )/2. We
show in Section 7.2 that α is in fact not identifiable from the data and moreover the estimation
results are robust to different choices of α. Lastly, in the case where no trade takes place, then we
only know that W < R.
2
Note that we are assuming an efficient bargaining outcome in a full information setting. Modeling an imperfect
bargaining situation in an asymmetric (or private) information setting would require additional assumptions (see, for
example, Chatterjee and Samuelson 1983).
6
Phillips, Şimşek, and van Ryzin: Estimating Buyer WTP and Seller RP from Negotiation Data and the Implications
4.
Structural Estimation Problem
Assume we have a data set D of the outcomes of N negotiations indexed by i, where the values
(Ri , Wi ) are IID draws from a (unknown) bi-variate normal joint distribution F (R, W ). For each
negotiation i in the data set, we observe only i) whether trade takes place or not, ii) if trade takes
place, the outcome price pi of the negotiation, and iii) a set of covariates describing the seller,
buyer and product involved in the negotiation. Given these assumptions and inputs, we want to
estimate the joint distribution F .
Define
• Ri = reserve price of seller in negotiation i (unobservable random variable).
• Wi = willingness-to-pay of buyer in negotiation i (unobservable random variable).
• pi = final price of negotiation i when trade takes place (data).
• Si = vector of seller characteristics for negotiation i (data).
• Bi = vector of buyer characteristics for negotiation i (data).
• Pi = vector of product characteristics for negotiation i (data).
• N = total number of negotiations (data).
We assume that Ri and Wi are joint normally distributed random variables, defined as
Ri = XiT βr + i
Wi = YiT βw + δi
where Xi = (Si , Pi ), Yi = (Bi , Pi ) are the vector of seller and buyer covariates, respectively, βr and
βw are the vectors of coefficients on these covariates, and (i , δi ), i = 1, . . . , N are IID error terms
which are bivariate normal with mean 0 and covariance matrix
σr2 σrw
Σ=
σrw σw2
with corresponding correlation coefficient ρ = σrw /(σr σw ). Note because we assume the error terms
have zero mean, E[Ri ] = XiT βr and E[Wi ] = YiT βw ; hence, the covariates influence only the means
of the willingness-to-pay and reserve price distribution. Without loss of generality, assume that
Xi and Yi have the same dimension M and let Xij and Yij denote the j-th component of Xi
and Yi , respectively. For notational convenience, also define R = (R1 , . . . , RN ), W = (W1 , . . . , WN ),
X = (X1 , . . . , XN ) and Y = (Y1 , . . . , YN ). We denote the marginal cdf of and δ by F and Fδ
with density functions f and fδ , respectively. We denote the marginal cdf of R by Fr (r, θr ) and
the associated marginal pdf fr (r, θr ). Similarly, the marginal cdf of W is denoted Fw (w, θw ) with
associated pdf fw (w, θw ).
Phillips, Şimşek, and van Ryzin: Estimating Buyer WTP and Seller RP from Negotiation Data and the Implications
7
Finally, we let θ = (βr , σr , βw , σw , ρ) denote the vector of parameters to be estimated from the
data set D and define the sets
T = {i : trade takes place}
T = {i : no trade takes place}
We seek to estimate θ from the data D.
4.1.
Non-Quasi-Concavity of the Log-Likelihood Function
Consider the simplest case where there are no covariates and R and W are independent. In this
case, both the reserve price and willingness-to-pay follow independent normal distributions with
θr = (µr , σr ) and θw = (µw , σw ), then θ = (µr , σr , µw , σw ). In this case, the log-likelihood function is
L(D, θ) =
X
ln {P (R + W = 2pi , R ≤ pi ≤ W )} +
i∈T
X
ln {P (W < R)}
i∈T
X
|T |
|T |
1
ln (2π) −
ln (σr2 + σw2 ) −
(2pi − µr − µw )2
2
2
2(σr2 + σw2 ) i∈T
!
!
X
(pi − µr )σw2 − (pi − µw )σr2
µr − µw
p
ln Φ
+
+ |T | ln Φ p
σr σw σr2 + σw2
σr2 + σw2
i∈T
=−
(1)
Proposition 1. The function L(D, θ) in equation (1) is not quasi-concave in general.
Proof:
We prove the proposition by providing a counterexample. Consider a case where there
are nine negotiations and trade happened in the first five of them at prices (60, 70, 75, 80, 88), hence
D = (60, 70, 75, 80, 88, N T , N T , N T , N T , N T ), where N T indicates that no trade took place.
Take the following two points: θ1 = (60, 10, 60, 36) and θ2 = (90, 35, 90, 10). Then, L(D, θ1 ) = −29.36
and L(D, θ2 ) = −29.40. If we take α = 0.5, we get an intermediate point θα = αθ1 + (1 − α)θ2 =
(75, 22.5, 75, 23), with L(D, θα ) = −29.74, which is smaller than the previous two. Therefore, quasiconcavity is violated. Q.E.D.
4.2.
An EM Estimation Method
The expectation-maximization (EM) method is a standard approach to parameter estimation with
censored data. Introduced by Dempster et al. (1977), the EM method is useful when the incomplete
data log-likelihood function is difficult to maximize directly but the complete data log-likelihood
function has a simple form, as we show is the case in our model. The book by McLachlan and
Krishnan (1996) is a good resource for details of the method and its extensions. The EM method is
used widely in the empirical operations management literature for estimation problems in choicebased revenue management (Vulcano et al. 2010), censored sales (Stefanescu 2012), substitutable
8
Phillips, Şimşek, and van Ryzin: Estimating Buyer WTP and Seller RP from Negotiation Data and the Implications
products with stock-outs (Anupindi et al. 1998), and selling with incomplete product availability
(Conlon and Mortimer 2007). Monte Carlo implementations of the EM algorithm have been developed for more complex likelihood functions (see, for example, Levine and Casella 2001, Chan and
Ledolter 1995).
Consider an ideal data set which records every pair (Ri , Wi ) for each negotiation i = 1, . . . , N .
Given these exact observations, as we show below, the log-likelihood function to estimate F is
straightforward. We can consider our data set D to be a partial observation of these complete
data. The idea of the EM method is to work with the conditional expected value of the simpler
complete-data likelihood rather than the more complex incomplete-data likelihood. Specifically,
the EM algorithm starts with initial estimates of the parameters θ (an initial ”guess”) and then
computes the expected value of the complete data log-likelihood function conditional on θ and
D. This is the ”E” (expectation) step. The resulting expected log-likelihood is of the same simple
form as the complete-data log-likelihood function and can be easily maximized to find updated
parameter estimates θ. This is the ”M” (maximization) step. The process is then repeated until
the estimates θ converge. The generalized EM algorithm improves the complete data problem’s
conditional expectation of the log-likelihood rather than maximizing it at every iteration.
In summary, the steps in the algorithm are:
• Initialization: Set θ = θ(0)
On the (k + 1)th iteration:
• E-step: Calculate Q(θ, θ(k) ) = Eθ(k) [Lc |D, θ(k) ]
• M-step: Choose θ(k+1) such that
Q(θ(k+1) , θ(k) ) ≥ Q(θ, θ(k) )
∀θ ∈ Ω
Repeat these E and M steps until convergence (e.g. until θ(k+1) ≈ θ(k) ).
4.2.1.
The EM Method Applied to the Structural Bargaining Model Suppose that
all of the values of Ri and Wi in our negotiating model are known for i = 1, 2, . . . , N . Then, the
maximum likelihood estimates of the model parameters are the solution to the normal equations
for the bivariate normal distribution:
(XT X)β̂r = XT R
(2)
(YT Y)β̂w = YT W
N
1 X
2
σ̂r =
(Ri − XiT β̂r )2
N i=1
(3)
σ̂w2 =
N
1 X
(Wi − YiT β̂w )2
N i=1
(4)
(5)
Phillips, Şimşek, and van Ryzin: Estimating Buyer WTP and Seller RP from Negotiation Data and the Implications
N
1 X
σ̂rw =
(Ri − XiT β̂r )(Wi − YiT β̂w )
N i=1
9
(6)
Consider the E-step on the (k + 1)th iteration of the EM algorithm, where θ(k) =
(βr(k) , σr(k) , βw(k) , σw(k) , ρ(k) ) denotes the incumbent estimate of the parameter values θ after the k th
EM iteration. The conditional expectation of the complete-data log-likelihood is then
1
1
1
Eθ(k) (R − XiT βr(k) )2 |R + W = 2pi , R ≤ pi
− ln(2π) − ln((σr(k) )2 ) −
(k)
2
2
2(σr )2
i∈T
X 1
1
1
Eθ(k) (R − XiT βr(k) )2 |W < R
+
− ln(2π) − ln((σr(k) )2 ) −
(k)
2
2
2(σr )2
i∈T
X 1
1
1
T (k) 2
+
− ln(2π) − ln((σw(k) )2 ) −
E
(k) (W − Yi βw ) |R + W = 2pi , W ≥ pi
θ
(k)
2
2
2(σw )2
i∈T
X 1
1
1
T (k) 2
− ln(2π) − ln((σw(k) )2 ) −
(7)
E
+
(k) (W − Yi βw ) |W < R
θ
(k)
2
2
2(σw )2
Eθ(k) [Lc |D, θ(k) ] =
X
i∈T
The M-step on the (k + 1)th iteration is implemented by replacing Ri , Wi , (Ri − XiT β̂r )2 and
(Wi − YiT β̂w )2 by their current conditional expectations in (2)-(6). Hence, the (k + 1)th estimates
of βr and βw , βr(k+1) and βw(k+1) , are specified by solving the following modified normal equations:
For all j = 1, 2, . . . , M
N X
M
X
(k+1)
Xij Xi` βr`
=
i=1 `=1
N X
M
X
X
Xij Eθ(k) [R|R + W = 2pi , R ≤ pi ] +
i∈T
(k+1)
Yij Yi` βw`
=
i=1 `=1
X
X
Xij Eθ(k) [R|W < R]
i∈T
Yij Eθ(k) [W |R + W = 2pi , W ≥ pi ] +
i∈T
X
Yij Eθ(k) [W |W < R]
i∈T
(k+1)
And, the (k + 1)th estimates of σr , σw , and σrw , σr(k+1) , σw(k+1) and σrw
are
(σr(k+1) )2 =
1 X
{
Eθ(k) (R − XiT βr(k) )2 |R + W = 2pi , R ≤ pi
N i∈T
X
Eθ(k) (R − XiT βr(k) )2 |W < R }
+
i∈T
(σw(k+1) )2 =
1 X
{
Eθ(k) (W − YiT βw(k) )2 |R + W = 2pi , W ≥ pi
N i∈T
X
+
Eθ(k) (W − YiT βw(k) )2 |W < R }
i∈T
(
h
i
X
1
(k+1)
σrw
=
Eθ(k) (R − XiT β̂r(k) )(W − YiT βw(k) )|R + W = 2pi , R ≤ pi
N i∈T

h
i
X
+
Eθ(k) (R − XiT βr(k) )(W − YiT β̂w(k) )|W < R

i∈T
10
Phillips, Şimşek, and van Ryzin: Estimating Buyer WTP and Seller RP from Negotiation Data and the Implications
4.2.2.
Convergence and Theoretical Properties All limit points of the EM algorithm for
our model are provably stationary points of the incomplete log-likelihood function because the
expected log-likelihood function (7) is continuous in βr(k) , σr(k) , βw(k) , σw(k) , and ρ(k) (see Wu 1983).
While the EM algorithm may not converge for a particular instance, if it does converge, it converges
to a stationary point. Moreover, one can easily detect during the course of the algorithm whether
or not it is converging. In practice, the EM algorithm has proved to be a robust and efficient way to
compute maximum likelihood estimates for incomplete data problems. Also, since the EM estimates
are maximum likelihood estimates, the statistical properties of maximum likelihood estimators are
also valid for them, i.e., they are consistent, asymptotically unbiased and asymptotically efficient.
4.3.
Numerical Tests on Synthetic Data
We conducted a series of numeric tests on synthetic data using direct maximization of the maximum
likelihood function in (1) as well as the EM method with and without using Monte Carlo simulation
to estimate the conditional expected values required in the E-Step. Our purpose in performing
these experiments was two-fold: first, to determine whether or not the EM method converged to
reasonable estimates of the underlying parameters and, second to compare the performance and
robustness of the EM method to direct maximization.
We constructed 24 examples by combining three different problem dimensions:
• Number of observations: 100; 1,000; 10,000 and 100,000,
• Percentage of negotiations that resulted in a deal: ∼ 40% and ∼ 80%,
• Number of covariates: 2, 5, and 10.
The mean willingness-to-pay and mean reserve price for each deal were linear in attributes. The
errors were assumed to be independent with zero means and known standard deviations.
We estimated the parameters for each of the 24 instances using the EM method with two different
approaches to calculating the conditional expected values required in the E-Step. In the first
approach (EM), we calculated the conditional expected value numerically. In the second approach
(Monte Carlo EM), we used Monte Carlo simulation to estimate conditional expectation. We
also estimated the parameters by directly maximizing the incomplete log-likelihood function in
MATLAB using its fmincon function. In all cases, we initiated the methods using 10 as the initial
intercept value and 0 as the initial estimates of the other coefficients. We ran the EM algorithm
until either the difference of two consecutive log-likelihood function was less than 10−2 or the
number of iterations exceeded 100. We used the default stopping criteria of the fmincon function.
In all cases, we terminated operations after 20,000 seconds (about 5 and 1/2 hours). The fmincon
function and the Monte Carlo EM method did not converge in some cases in which cases, we
restarted the algorithm using a different randomly-chosen initial point. Table 1 reports the total
Phillips, Şimşek, and van Ryzin: Estimating Buyer WTP and Seller RP from Negotiation Data and the Implications
11
amount of computation time for each method applied to each instance (including multiple starting
points). The times are for a laptop with Intel Core i7-3537U CPU with 2.00 GHz and 2.50 GHz
processor, 8 GB RAM and 64-bit Windows operating system.
Number of
Observations
100
1,000
10,000
100,000
Table 1
Instance
Number
1
2
3
4
5
6
Average
1
2
3
4
5
6
Average
1
2
3
4
5
6
Average
1
2
3
4
5
6
Average
Runtime (sec)
EM Monte Carlo EM Direct Max.
3.15
3.48
1.48
2.33
4.50
0.37
1.18
0.78
0.46
3.08
1.51
0.14
5.34
4.39
51.95
7.31
3.64
0.70
3.73
3.05
9.18
46.38
65.37
0.75
28.09
16.65
9.45
10.12
7.25
0.17
24.92
7.70
0.16
157.79
32.38
1.47
86.01
48.85
73.46
58.89
29.70
14.24
430.89
570.43
123.23
1,102.12
155.71
1,034.20
110.94
83.90
118.62
341.30
92.05
19,659.12
1,548.98
318.14
377.65
1,159.15
540.31
>20,000
782.24
293.42 >6,885.47
4,596.66
5,190.19
>20,000
11,012.59
2,447.16
18.03
2,125.79
768.79
2,076.20
4,194.65
845.05
5.02
15,480.73
4,158.28
>20,000
11,535.06
5,358.41
>20,000
8,150.08
3,127.98 >10,349.88
Computation Time Comparison of the Three Estimation Methods: EM, Monte Carlo EM, and Direct
Maximization of the Log-Likelihood Function
For all problem instances, the Monte Carlo EM algorithm converged to a stationary point in
under 100 iterations. The EM algorithm converged in under 100 iterations for 11 instances, but it
was very close to convergence when we stopped the procedure at 100 iterations for the other seven
instances. For three instances with 100,000 data points and one instance with 10,000 observations,
the direct maximization procedure could not find a solution after 20,000 seconds. When the methods
converged (or stopped in the EM method), they always converged to parameter estimates with
very similar likelihoods. The average deviation from the highest log-likelihood was 0.1% and the
maximum deviation of any method on any instance was 0.79%. Hence, we did not observe a
significant advantage of any procedure in terms of the quality of the parameter estimates. However,
12
Phillips, Şimşek, and van Ryzin: Estimating Buyer WTP and Seller RP from Negotiation Data and the Implications
both implementations of the EM algorithm were more reliable in converging to a solution and, on
average, converged more quickly for the larger instances.
Tables 2, 3, and 4 show the true and estimated parameters of the willingness-to-pay and reserve
price values for 100, 1,000, and 10,000 observations, respectively. They report the EM algorithm
results for 100 and 1,000 observations (since the algorithm converged in a reasonable time) and
the Monte Carlo EM algorithm results for the 10,000 observation cases (since the EM algorithm
required significantly longer computation times). The average mean squared errors between the true
and estimated parameters were 10.4991, 2.8293, and 0.4085 for 100, 1,000, and 10,000 observations,
respectively. For these cases, the EM methods appear to be converging to the true parameter
values.
Trade pc:
R
true prms
est. prms
W
true prms
est. prms
38%
βr,0
5
2.81
βw,0
7
4.71
βr,1
0.5
0.62
βw,1
1
1.13
βr,2
1.5
1.60
βw,2
0.4
0.59
σr
3
1.84
σw
6
7.73
Trade pc:
R
true prms
est. prms
W
true prms
est. prms
40%
βr,0
30
27.75
βw,0
10
10.75
βr,1
5
7.80
βw,1
1.5
6.51
βr,2
-0.1
-0.14
βw,2
-0.05
-0.08
βr,3
-3
-5.08
βw,3
2
-2.28
βr,4
5
5.72
βw,4
4
4.39
βr,5
1
1.36
βw,5
3
3.37
σr
8
10.83
σw
15
10.20
Trade pc:
R
true prms
est. prms
W
true prms
est. prms
Trade pc:
R
true prms
est. prms
W
true prms
est. prms
35%
βr,0
30
38.51
βw,0
15
15.99
72%
βr,0
12
21.45
βw,0
15
13.82
βr,1
5
4.68
βw,1
1.5
-4.63
βr,2
3
4.31
βw,2
4
8.49
βr,3
-0.1
-0.10
βw,3
-0.05
-0.07
βr,4
-3
-4.39
βw,4
2
0.17
βr,5
5
5.12
βw,5
4
2.67
βr,6
1
0.11
βw,6
3
5.51
βr,7
4
1.21
βw,7
6
9.62
βr,1
6
6.52
βw,1
1.5
1.49
βr,2
3
6.76
βw,2
4
2.21
βr,3
-0.1
-0.12
βw,3
-0.05
-0.05
βr,4
-3
-5.56
βw,4
2
4.19
βr,5
5
5.00
βw,5
4
4.04
βr,6
1
0.27
βw,6
3
2.80
βr,7
4
5.87
βw,7
6
6.64
Table 2
5.
Trade pc:
R
true prms
est. prms
W
true prms
est. prms
78%
βr,0
5
1.78
βw,0
8
15.77
βr,1
0.3
0.21
βw,1
1.2
0.91
βr,2
1
1.65
βw,2
0.4
-0.13
σr
3
5.81
σw
5
1.81
Trade pc:
R
true prms
est. prms
W
true prms
est. prms
82%
βr,0
30
33.28
βw,0
10
15.31
βr,1
5
18.57
βw,1
1
-4.03
βr,2
-0.1
-0.13
βw,2
-0.05
-0.04
βr,3
-3
-1.04
βw,3
2
-1.84
βr,8
-1
-1.29
βw,8
3
2.32
βr,9
10
12.03
βw,9
7
11.18
βr,10
3
2.56
βw,10
1
-0.34
σr
5
3.49
σw
10
12.47
βr,8
-1
-2.75
βw,8
3
4.46
βr,9
10
5.12
βw,9
7
11.41
βr,10
3
3.68
βw,10
1
0.58
σr
5
5.61
σw
10
7.63
βr,4
3
3.85
βw,4
5
4.31
βr,5
1
-0.87
βw,5
3
4.27
σr
8
15.59
σw
15
7.42
True and Estimated Parameters of Willingness-to-Pay (W ) and Reserve Price (R) for 100 Observations
Application to Auto Lending Data
We applied our structural EM method to a data set supplied by a major auto lender operating in
the United States. This data set is also used in Phillips et al. (2014). The lender sells to its end
customers through dealers who are located throughout the country. Although there is a centralized
annual percentage rate (APR) set for each loan, in most cases the dealer has discretion to negotiate
the final interest rate with customers3 . We excluded about 55.5% of the deals in the original data
base from the analysis because they were offered at promotional rates (so-called subvened rates)
over which the dealers had little or no authority to negotiate. Our structural EM method provides
3
We are using the word “negotiation” in a broad sense, since it is possible that some customers do not realize that
the rate is negotiable and may accept (or reject) the lender’s first offer without any “back and forth discussion”.
Phillips, Şimşek, and van Ryzin: Estimating Buyer WTP and Seller RP from Negotiation Data and the Implications
Trade pc:
R
true prms
est. prms
W
true prms
est. prms
35%
βr,0
5
4.81
βw,0
7
8.31
βr,1
0.5
0.49
βw,1
1
0.88
βr,2
1.5
1.70
βw,2
0.4
0.54
σr
3
3.61
σw
6
5.11
Trade pc:
R
true prms
est. prms
W
true prms
est. prms
38%
βr,0
30
34.40
βw,0
10
11.13
βr,1
5
4.73
βw,1
1.5
0.65
βr,2
-0.1
-0.10
βw,2
-0.05
-0.05
βr,3
-3
-2.50
βw,3
2
1.78
βr,4
5
5.22
βw,4
4
4.04
βr,5
1
0.51
βw,5
3
3.32
σr
8
9.76
σw
15
13.60
Trade pc:
R
true prms
est. prms
W
true prms
est. prms
Trade pc:
R
true prms
est. prms
W
true prms
est. prms
34%
βr,0
30
26.65
βw,0
15
3.32
73%
βr,0
12
12.52
βw,0
15
12.31
βr,1
5
5.25
βw,1
1.5
1.32
βr,2
3
3.19
βw,2
4
5.04
βr,3
-0.1
-0.10
βw,3
-0.05
-0.04
βr,4
-3
-2.22
βw,4
2
2.39
βr,5
5
5.05
βw,5
4
3.63
βr,6
1
0.98
βw,6
3
4.18
βr,7
4
5.01
βw,7
6
7.06
βr,1
6
6.30
βw,1
1.5
1.38
βr,2
3
3.04
βw,2
4
2.97
βr,3
-0.1
-0.10
βw,3
-0.05
-0.05
βr,4
-3
-2.18
βw,4
2
1.95
βr,5
5
5.05
βw,5
4
3.93
βr,6
1
0.69
βw,6
3
3.32
βr,7
4
4.64
βw,7
6
6.53
Table 3
Trade pc:
R
true prms
est. prms
W
true prms
est. prms
76%
βr,0
5
7.15
βw,0
8
10.21
βr,1
0.3
0.19
βw,1
1.2
0.92
βr,2
1
1.02
βw,2
0.4
0.55
σr
3
5.61
σw
5
2.66
Trade pc:
R
true prms
est. prms
W
true prms
est. prms
79%
βr,0
30
34.15
βw,0
10
7.60
βr,1
5
4.96
βw,1
1
1.01
βr,2
-0.1
-0.10
βw,2
-0.05
-0.04
βr,3
-3
-3.79
βw,3
2
2.94
βr,8
-1
-0.95
βw,8
3
4.03
βr,9
10
10.22
βw,9
7
7.24
βr,10
3
2.88
βw,10
1
0.22
σr
5
4.03
σw
10
12.29
βr,8
-1
-0.76
βw,8
3
3.01
βr,9
10
8.78
βw,9
7
8.44
βr,10
3
3.12
βw,10
1
1.04
σr
5
4.78
σw
10
9.64
βr,4
3
3.04
βw,4
5
4.67
βr,5
1
0.68
βw,5
3
3.33
13
σr
8
9.65
σw
15
12.66
True and Estimated Parameters of Willingness-to-Pay (W ) and Reserve Price (R) for 1,000
Observations
Trade pc:
R
true prms
est. prms
W
true prms
est. prms
38%
βr,0
5
6.96
βw,0
7
7.95
βr,1
0.5
0.41
βw,1
1
0.83
βr,2
1.5
1.62
βw,2
0.4
0.77
σr
3
4.13
σw
6
3.72
Trade pc:
R
true prms
est. prms
W
true prms
est. prms
40%
βr,0
30
29.56
βw,0
10
8.54
βr,1
5
5.08
βw,1
1.5
1.07
βr,2
-0.1
-0.10
βw,2
-0.05
-0.04
βr,3
-3
-2.42
βw,3
2
1.45
βr,4
5
4.97
βw,4
4
3.86
βr,5
1
0.95
βw,5
3
3.09
σr
8
7.61
σw
15
15.88
Trade pc:
R
true prms
est. prms
W
true prms
est. prms
Trade pc:
R
true prms
est. prms
W
true prms
est. prms
35%
βr,0
30
30.87
βw,0
15
16.86
72%
βr,0
12
12.12
βw,0
15
16.10
βr,1
5
4.96
βw,1
1.5
1.51
βr,2
3
3.08
βw,2
4
4.16
βr,3
-0.1
-0.10
βw,3
-0.05
-0.06
βr,4
-3
-2.87
βw,4
2
2.46
βr,5
5
4.94
βw,5
4
3.95
βr,6
1
0.91
βw,6
3
2.91
βr,7
4
4.63
βw,7
6
6.23
βr,1
6
6.00
βw,1
1.5
1.44
βr,2
3
3.21
βw,2
4
3.97
βr,3
-0.1
-0.10
βw,3
-0.05
-0.05
βr,4
-3
-2.61
βw,4
2
1.94
βr,5
5
5.05
βw,5
4
3.90
βr,6
1
1.00
βw,6
3
3.00
βr,7
4
3.94
βw,7
6
5.76
Table 4
Trade pc:
R
true prms
est. prms
W
true prms
est. prms
78%
βr,0
5
6.60
βw,0
8
8.84
βr,1
0.3
0.17
βw,1
1.2
1.05
βr,2
1
1.05
βw,2
0.4
0.54
σr
3
4.75
σw
5
2.96
Trade pc:
R
true prms
est. prms
W
true prms
est. prms
82%
βr,0
30
28.94
βw,0
10
11.58
βr,1
5
4.81
βw,1
1
1.55
βr,2
-0.1
-0.10
βw,2
-0.05
-0.05
βr,3
-3
-3.20
βw,3
2
1.72
βr,8
-1
-1.10
βw,8
3
2.93
βr,9
10
10.49
βw,9
7
6.92
βr,10
3
2.99
βw,10
1
0.97
σr
5
5.11
σw
10
9.76
βr,8
-1
-1.08
βw,8
3
3.04
βr,9
10
9.47
βw,9
7
6.48
βr,10
3
2.94
βw,10
1
0.99
σr
5
4.97
σw
10
9.99
βr,4
3
2.96
βw,4
5
5.02
βr,5
1
1.18
βw,5
3
2.78
σr
8
7.94
σw
15
14.89
True and Estimated Parameters of Willingness-to-Pay (W ) and Reserve Price (R) for 10,000
Observations
us estimates of the distributions of willingness-to-pay and reserve price as well as estimates of the
last prices (APRs) for lost deals – that is, failed transactions. We evaluate the results of using
the EM approach in two ways. First, we compare the take-up rates and APRs of both successful
deals and lost deals with a hold-out sample. Our benchmark is the set of APRs predicted using
linear regression on deal characteristics. We also compare the take up rates predicted by the EM
method with the take-up rates predicted by standard probit regression. Second, we compute the
price elasticities implied by our estimates and compare these to the endogeneity-corrected estimates
14
Phillips, Şimşek, and van Ryzin: Estimating Buyer WTP and Seller RP from Negotiation Data and the Implications
in Phillips et al. (2014) for the same auto lender data. In both cases, the EM method compares
favorably with the alternatives.
5.1.
Data Set and Model
Our data set includes information on the outcome of all approved applications for auto loans
received by a major North American indirect auto lender data during a multi-year period starting
in January 20094 . As noted previously, we considered only the non-subvened deals – that is those in
which rates were negotiable. There are 950,985 approvals in this data set of which 719,835 (75.69%)
were taken up. All of these loans were offered through the finance and insurance (F&I) departments
of local dealerships and, in each case, the F&I manager had the authority to negotiate the APR
of the loan with the customer. Each record includes the APR at which the loan was offered, the
final disposition of the loan (accepted by the customer or not accepted) and information about the
customer (FICO score), information about the product (make and model of car, size of loan, new
or used car, term of loan, etc.), and the identity and location of the dealer.
The variables in the data and their statistics for the sample that we used are shown in Tables 5
and 6. FICO score is an industry-standard measure of individual default risk that ranges from 300
to 850 with higher values representing lower risk. The term and amount of the loan and whether
or not the car being purchased is new or used are characteristics of the loan being requested.
The customer rate is the final rate charged to the customer in the case of a successful negotiation
and the last rate offered in the case of a lost deal. LIBOR is the interbank lending rate and is
closely correlated with the lender’s cost of funds and with the primer rate. The dealer offered cash
incentives for some deals. The amount of this incentive – if offered – is also recorded in the data.
If the incentive was not offered, this value is 0. The lender applied a proprietary methodology
to classify potential customers into five risk tiers with risk tier 1 being the least risky and risk
tier 5 the most risky. Risk tier is correlated with FICO score but is not fully determined by it
because the lender used additional information to classify customers into risk tiers. To create an
individual risk measure that is not correlated with risk tier, we created a normalized FICO score
for each customer, F\
ICO, defined by F\
ICO = Actual FICO Score - Mean FICO Score for risk
tier.5 . F\
ICO is uncorrelated with risk tier.
Applying the EM method to the entire data set would require an excessive amount of computational time. More specifically, estimating the parameters using the full data set would require 57
hours of CPU time on a laptop with Intel(R) Core(TM) i7-3537U CPU @ 2.00 GHz and 2.50 GHz
processor, 8GB RAM and 64-bit Windows operating system. (We ran the EM algorithm up to
4
The lender has requested anonymity and certain details about the data have been suppressed to conceal its identity.
5
This variable is defined in the same way in Phillips et al. (2014) as well.
Phillips, Şimşek, and van Ryzin: Estimating Buyer WTP and Seller RP from Negotiation Data and the Implications
15
Variable
Mean Std. Dev. Min
Max
FICO Score
740.66
79.35 371
900
F\
ICO
4.6
54.87 -344
231
Term (Months)
59.76
11.40
36
72
Amount ($1,000)
21.21
10.88 1.00 212.77
Customer Rate (%)
8.49
3.95 0.00 33.39
LIBOR (%)
0.27
0.07 0.19
0.56
Cash Incentive ($1,000)
1.32
1.28 0.00
9.00
Table 5
Summary Statistics for Continuous Variables
Variable
Category Frequency Percent
1
114,874
57.44
2
38,146
19.07
Tier
3
29,819
14.91
4
7,163
3.58
5
9,998
5.00
New
135,145
67.57
Vehicle Type
Used
64,855
32.43
Not Take
48,894
24.45
Customer’s Decision
Take
151,106
75.55
Table 6
Summary Statistics for Categorical Variables
1,000 iterations to ensure convergence and computation time for each iteration was 206 seconds.)
Therefore, we randomly selected a sample of 200,000 observed negotiation outcomes from the original data (21.03% of the “non-subvened” deals) for computations (this required 11 hours of CPU
time on the same platform – 1,000 iterations with 40 sec/iteration). We randomly split the data
set into estimation and holdout samples of equal size. As a first step, we estimated a single model
with separate coefficients for each of the variables listed in Tables 5 and 6 allowing R and W to be
correlated using the algorithm described in Section 4.2. To estimate the standard errors, we used
the bootstrapping approach described in Chapter 4.6 of McLachlan and Krishnan (1996). Table 7
shows the results of the estimation.
βr
Variable Est. SE t-stat Est.
Intercept 1.87 0.30
6.27 18.78
Tier2 1.73 0.04 42.02 1.48
Tier3 3.58 0.05 75.60 3.29
Tier4 5.32 0.12 45.92 4.03
Tier5 7.46 0.14 53.20 5.65
Used 5.11 0.08 63.08 -1.17
LIBOR 1.19 0.02 54.61 1.06
Covariance (σRW )
Correlation Coeff. for R and W
Log-Likelihood (for holdout)
Table 7
βw
SE t-stat
Variable Est.
0.33 57.35 Term Class 2 0.23
0.05 32.45 Term Class 3 0.47
0.05 69.04 Term Class 4 0.67
0.09 43.39 F\
ICO (/100) -1.3
0.17 33.48
Log Amount -0.17
0.08 -14.47
Cash Incen. -0.93
0.02 55.34 Std. Dev. (σ) 2.82
0.01 (SE: 0.07; t-stat: 0.14)
0.0014 (SE: 0.01; t-stat: 0.14)
-28,392.9
βr
βw
SE t-stat Est. SE t-stat
0.06
3.96 0.72 0.06 11.33
0.05
9.99 0.37 0.05
7.18
0.05 12.77 0.62 0.06 10.47
0.03 -49.59 -1.25 0.03 -45.16
0.03 -5.53 -1.39 0.03 -44.24
0.04 -22.48 0.98 0.04 22.53
0.03 88.77 2.57 0.03 77.03
Coefficient Estimates for the Structural EM Method. All values are significant at the .01 level.
16
Phillips, Şimşek, and van Ryzin: Estimating Buyer WTP and Seller RP from Negotiation Data and the Implications
The values in Table 7 generally accord with intuition. The coefficient βw increases with risk tier
and decreases with F\
ICO, indicating that customer willingness-to-pay increases with risk. This
is consistent with previous findings that more risky customers are less sensitive to the price of
loans than less risky borrowers (Phillips and Raffard 2011). Average willingness-to-pay decreases
with Log Amount, consistent with the intuition that customers are often more likely to shop for
alternatives if the amount of the loan – and hence the monthly payment – is higher. Term has no
consistent effect on average willingness-to-pay. On the other hand, the reserve price consistently
increases in term. This may be due to the fact that customers preferring longer-term loans tend to
be higher risk than those preferring shorter-term loans. Average seller reserve price increases with
risk tier. This is intuitive because risk represents a cost to the seller and increased cost needs to
be covered by an increased price. Seller reserve price decreases in Log Amount – this is consistent
with the common expectation that sellers should be more willing to discount larger deals. It may
also reflect a revenue-bias on the part of the sales force.
The results in Table 7 indicate that reserve price and willingness-to-pay are not highly correlated
with each other – the correlation coefficient between R and W is .0014 and the covariance is
.01. This suggests an alternative model in which we estimate a different model for each risk tier
assuming that the covariance between R and W is zero. The results for this second approach are
shown in Table 8, which also shows the empirical means and standard deviations of R and W for
each risk tier for the hold-out sample (numbers in parentheses show the bootstrapped standard
errors – all the variables except Term Class 3/4 and Used of Tier 5 are statistically significant at
the .01 significance level). As expected, the mean values of both R and W increase by risk tier.
Henceforth, we use this model as our standard model for comparison.
The results in Table 8 show some of the same patterns as Table 7. Notably, both reserve price
and willingness-to-pay increase with risk. This is evident from the fact that both the intercepts
and the mean values increase with tier and the coefficients for F\
ICO are less than zero for both
R and W . There is no longer any consistent pattern evident in the coefficients for term class for
either R or W suggesting that the pattern in the previous model was indeed due to the correlation
between risk and term. The coefficients for Log Amount are negative for all categories except Tier
1 of R. The coefficients for Used show a striking pattern in that they are all positive for R and
all negative for W . We conjecture that the positive coefficients for R reflect the fact that used-car
APR’s are rarely subvened in the marketplace and hence lenders feel they can extract a higher
APR than for new cars. On the other hand, the negative coefficient for W may reflect the fact
that used car buyers are, in general, more price-sensitive than new car buyers. The pattern for
Cash Incentive is also interesting. Offering a cash incentive increases the willingness of borrowers to
accept a higher APR for the loan, which is intuitive. It is more difficult to explain the consistently
Phillips, Şimşek, and van Ryzin: Estimating Buyer WTP and Seller RP from Negotiation Data and the Implications
Tier 1
Intercept
-1.92
(0.53)
Term Class 2
0.43
(0.09)
Term Class 3
0.73
(0.07)
1.61
Term Class 4
(0.08)
Used
5.15
(0.17)
F\
ICO (/100)
-0.54
(0.04)
Log Amount
0.16
(0.05)
Cash Incentive
-0.96
(0.09)
LIBOR
1.33
(0.04)
Std. Dev.
2.46
(0.06)
Emp. Mean
2.91
Emp. Std. Dev.
2.73
Table 8
Tier 2
14.94
(0.76)
-1.19
(0.19)
-1.76
(0.16)
-1.72
(0.17)
3.6
(0.16)
-1.53
(0.06)
-0.99
(0.07)
-0.81
(0.07)
0.94
(0.04)
2.61
(0.09)
7.05
3.34
R
Tier 3
18.52
(0.9)
-1.35
(0.25)
-1.92
(0.24)
-2.49
(0.23)
3.43
(0.19)
-2.57
(0.07)
-0.98
(0.09)
-0.76
(0.07)
0.68
(0.04)
2.92
(0.11)
10.23
3.46
Tier 4
18.4
(2.44)
-2.03
(0.66)
-3.16
(0.64)
-4.59
(0.69)
4.44
(0.39)
-2.72
(0.18)
-0.75
(0.23)
-1.00
(0.17)
1.01
(0.11)
3.62
(0.21)
12.61
4.58
Tier 5
33.22
(2.52)
-3.58
(0.65)
-3.66
(0.64)
-4.04
(0.68)
5.9
(0.43)
-2.66
(0.2)
-2.23
(0.24)
-1.22
(0.19)
1.58
(0.14)
4.80
(0.23)
15.61
5.82
Tier 1
17.11
(0.48)
0.38
(0.08)
0.39
(0.06)
0.89
(0.08)
-1.61
(0.13)
-0.49
(0.04)
-1.27
(0.05)
1.00
(0.09)
1.18
(0.03)
2.05
(0.06)
9.86
1.85
Tier 2
30.19
(0.84)
-0.95
(0.2)
-2.5
(0.17)
-1.78
(0.17)
-2.03
(0.18)
-1.95
(0.07)
-2.08
(0.08)
0.97
(0.08)
0.84
(0.04)
3.02
(0.10)
9.74
2.61
W
Tier 3
31.69
(0.99)
-0.68
(0.22)
-2.07
(0.2)
-1.65
(0.21)
-1.93
(0.18)
-2.63
(0.07)
-2.03
(0.1)
0.74
(0.08)
0.82
(0.04)
3.11
(0.11)
11.18
2.55
Tier 4
31.46
(2.88)
-0.93
(0.72)
-2.28
(0.7)
-2.56
(0.72)
-1.76
(0.39)
-2.49
(0.15)
-1.85
(0.28)
1.05
(0.16)
0.78
(0.11)
3.75
(0.22)
12.25
2.76
17
Tier 5
40.75
(3.26)
2.29
(0.83)
0.65
(0.71)
-0.56
(0.71)
-0.58
(0.48)
-2.83
(0.14)
-3.04
(0.33)
1.78
(0.19)
1.12
(0.14)
4.85
(0.31)
14.47
3.55
Coefficient Estimates for the Structural EM Method: Each Tier Separately
negative coefficients for R. The coefficients for LIBOR are all positive and are generally close to 1,
reflecting that it is a cost that needs to be reflected in the APR.
5.2.
Quality of Predictions
One of the advantages of our structural estimates is that they predict both take-up and APR.
Hence, the estimates can be used not only to estimate take-up and take-up sensitivity to price, but
also to impute APRs for lost deals (the last APR offered to a loan that a customer rejected) as well
as predict the APRs for future deals. Below, we compare the quality of these predictions to standard
regression-based approaches applied to the same data. In particular, we compare APRs predicted
by our structural EM method to the actual APRs of the deals and to the APRs predicted by linear
regression. We also compare the take-up predictions of our method to the take-up predictions of
a two-stage endogeneity-corrected probit model, as well as to the actual realized value recorded in
the data set.
5.2.1.
APR Predictions Table 9 compares the APR’s predicted for deals in the test data
generated by our method to the actual APRs and to the APRs predicted using linear regression
applied using APR as the target variable and all of the variables listed in Tables 5 and 6 (except
APR) as independent variables. This is the same approach used to generate the first-stage estimates
of APR’s described in more detail in Phillips et al. (2014). The structural EM method more
18
Phillips, Şimşek, and van Ryzin: Estimating Buyer WTP and Seller RP from Negotiation Data and the Implications
Tier
1
2
3
4
5
Total
Table 9
N
57,399
19,000
14,967
3,673
4,937
99,976
All Deals
Won Deals
Lost Deals
Avg.
LR
EM
Avg.
LR
EM
Avg.
LR
EM
APR (%) RMSE RMSE APR (%) RMSE RMSE APR (%) RMSE RMSE
6.36
1.82
1.76
6.23
1.79
1.72
7.45
2.09
2.05
8.52
2.19
2.07
8.22
2.22
2.09
9.07
2.13
2.04
10.80
2.51
2.30
10.44
2.50
2.24
11.21
2.52
2.38
13.37
3.53
3.31
11.49
3.12
2.78
14.76
3.80
3.65
16.90
5.33
4.74
13.82
4.23
3.64
19.22
6.03
5.42
8.21
2.37
2.21
7.33
2.09
1.95
10.94
3.08
2.88
Root Mean Squared Error (RMSE) of the APR’s Predicted for All Deals, Won Deals, and Lost Deals
for Linear Regression (LR) and Expectation-Maximization (EM) Methods
accurately forecasts APRs for all risk tiers for both won and lost deals than linear regression. The
amount of improvement generally increases with risk tier. This probably reflects the fact that the
range of APRs offered for different deals increases in risk tier, providing more room for improvement
in RMSE.
5.2.2.
Take-Up Predictions Table 10 shows the take-up rates estimated by our method
compared to the observed take-up and the take-up forecast by two different probit models. In the
first probit model (Probit A), we assume that prices are not available for lost deals and we use
the prices imputed by linear regression. In the second probit regression (Probit B), we utilized the
prices for all recorded negotiations whether they resulted in transactions or lost deals. A lender who
had access to rates for “lost deals” would typically utilize that information in modeling take-up
and would utilize an approach similar to the second probit model. A lender who did not have rates
for “lost deals” would typically use a method similar to the first probit model. Hence, these serve
as benchmarks for two different possible informational states of a lender.
Table 10 shows the take-up prediction performance of the various methods on the hold-out
sample. Note the two probit approaches did not differ significantly in terms of concordance for risk
tiers 1 - 3, nor for the test data set as a whole. Probit B improved the concordances for tiers 4
and 5 compared to Probit A and had a slightly higher log-likelihood for the sample as a whole.
Our structural EM estimates had higher concordance in total (.928) than either of the alternatives
(.921 and .922 for Probit A and B, respectively). The EM algorithm actually performed worse (in
terms of both concordance and log-likelihood.) in risk tiers 4 and 5 compared to Probit B, but this
was outweighed by its superior performance in the much larger higher risk tiers 1 - 3.
5.3.
Comparison of Elasticities
We can use the coefficients of our structural model to compute customer and seller price-elasticities.
Let FW and FR denote the marginal cumulative distribution functions and fw and fR the marginal
density functions for W and R respectively and let F̄W = 1 − FW . Then, the customer price elasticity
at price p is W (p) = −pfW (p)/F̄W (p). This corresponds to the standard definition of price elasticity;
Phillips, Şimşek, and van Ryzin: Estimating Buyer WTP and Seller RP from Negotiation Data and the Implications
Tier
1
2
3
4
5
Total
Probit A
Probit B
Actual
Pred.
Pred.
Take-up Take-up Concor.
LL
Take-up Concor.
89.73
89.92
.959
-7,748
89.95
.960
64.18
64.57
.862
-7,877
64.57
.864
54.13
53.29
.784
-7,968
53.27
.780
42.42
44.80
.774
-1,925
44.98
.812
42.92
42.21
.759
-2,713
42.04
.815
75.50
75.60
.921
-28,231 75.62
.922
Table 10
EM
Pred.
LL
Take-up Concor.
LL
-7,848
89.57
.961
-7,500
-7,878
64.64
.866
-7,850
-8,002
53.62
.788
-7,839
-1,841
44.85
.781
-1,871
-2,548
42.32
.789
-2,598
-28,117 75.47
.928
-27,659
Take-Up Prediction Performance for Probit A, Probit B and the Structural EM Method (EM)
Tier
Tier
Tier
Tier
Tier
Table 11
19
1
2
3
4
5
Customer
Seller
Price Elasticity Price Elasticity
Probit
EM
EM
-1.10
-1.81
0.16
-0.96
-1.69
1.39
-0.85
-0.89
2.80
-0.80
-0.53
2.68
-0.54
-0.24
2.49
Customer and Seller Price Elasticities by Risk Tier for Structural EM and Two-Stage Probit Estimates
it is the percentage change per unit percentage change in price. By analogy, we can also calculate
a seller price elasticity as R (p) = pfR (p)/FR (p). We computed the elasticity values for both R
and W using the empirical mean and standard deviations of R and W reported in Table 8 and
the mean customer rate reported in Table 5. Table 11 shows both the customer and seller price
elasticities for each risk tier. For comparison, Table 11 also shows the customer price elasticities
estimated via the method in Phillips et al. (2014) using an endogeneity-corrected probit model for
the same auto lender data.
The price-elasticities from Phillips et al. (2014) are not directly comparable because they were
calculated using the entire data set while our elasticities were calculated from the data set excluding
subvened deals. However, our estimates follow the expected pattern of decreasing in magnitude for
larger risk tiers – again reflecting the fact that higher-risk customers are less price-sensitive than
lower-risk customers. Seller price-elasticity increases in tier as well. This would indicate that sellers
are, in general, less-willing to negotiate with higher risk customers, all else being equal.
6.
Implications for Pricing Policy
We next consider how estimated willingness-to-pay and reserve price distributions can be used to
improve profits. We show –first theoretically and then empirically– that, it can be profitable for
headquarters to impose a minimum price by deal category on its sales force. Properly set, these
limits can generate significant profit improvements relative to unconstrained negotiation and even
relative to data-driven optimized fixed prices.
20
Phillips, Şimşek, and van Ryzin: Estimating Buyer WTP and Seller RP from Negotiation Data and the Implications
6.1.
Theoretical Analysis
When the sales force is given full pricing discretion, the expected revenue6 from a negotiation is
R+W
E(Revenue) = E
|W ≥ R P{W ≥ R}
2
We assume that “headquarters” can impose a minimum price for each transaction on the sales
force. That is, for a given negotiation, the sales agent cannot agree to a price less than pmin .
We note that pmin can be a function of observable covariates of a deal. The reserve price of each
individual sales agent is independent of the minimum price imposed by headquarters. In this case,
headquarters wishes to derive the p∗min that will maximize expected revenue. We note that there
are four cases given W ≥ R:
• Case 1: R ≤ W ≤ pmin . In this case no trade will occur.
• Case 2: R ≤ peq ≤ pmin ≤ W . Trade will occur at pmin
• Case 3: R ≤ pmin ≤ peq ≤ W . Trade will occur at peq
• Case 4: pmin ≤ R ≤ W . Trade will occur at peq
Figure 1 illustrates these cases. For cases 3 and 4, the imposition of a minimum price does not
change revenue while revenue may decrease for case 1 and increase for case 2.
R
2pmin
Case 4
pmin
Case 3
pmin
Case 1
Figure 1
2pmin
W
Case 2
Revenue Change Zones Based on the Value or R and W
The expected revenue from a transaction under a minimum price policy (MPP) can be written
as
6
In this section we assume w.l.o.g. that incremental costs are zero so that maximizing revenue is the same as
maximizing profit.
Phillips, Şimşek, and van Ryzin: Estimating Buyer WTP and Seller RP from Negotiation Data and the Implications
21
E(RevenueM P P ) = 0 ∗ P{R ≤ W ≤ pmin }
+ pmin ∗ P{R ≤ peq ≤ pmin ≤ W }
R+W
eq
+E
|R ≤ pmin ≤ p ≤ W ∗ P{R ≤ pmin ≤ peq ≤ W }
2
R+W
+E
|pmin ≤ R ≤ W ∗ P{pmin ≤ R ≤ W }
2
With a little bit of algebra, we can write this expression as
Z pmin
E(RevenueM P P ) = pmin ∗
Fr (x)fw (2pmin − x)dx
−∞
1
+
2
Z
1
−
2
Z
∞
Z
∞
xfw (x)Fr (x)dx −
pmin
xfw (x)Fr (2pmin − x)dx
pmin
∞
Z
pmin
xfr (x)Fw (2pmin − x)dx +
xfr (x)Fw (x)dx +
pmin
−∞
µr
2
(8)
Lemma 1. If Fr and Fw have support on (0, ∞) and if fw is non-increasing in an interval (a, 2a),
then:
∂E(RevenueM P P ) ≤0
∂pmin
pmin =a
Proof:
The derivative of (8) is
Z pmin
∂E(RevenueM P P )
=
Fr (x)fw (2pmin − x)dx − pmin fw (pmin )Fr (pmin )
∂pmin
−∞
Z ∞
=
Fr (2pmin − x)fw (x)dx − pmin fw (pmin )Fr (pmin )
pmin
Because Fr and Fw have support only on (0, ∞),
Z pmin
∂E(RevenueM P P )
=
Fr (x)fw (2pmin − x)dx − pmin fw (pmin )Fr (pmin )
∂pmin
0
By the Mean Value Theorem, there must be some p̂ ∈ (0, pmin ) such that:
Z pmin
Fr (x)fw (2pmin − x)dx = pmin Fr (p̂)fw (2pmin − p̂)
0
So,
∂E(RevenueM P P )
= pmin [Fr (p̂)fw (2pmin − p̂) − fw (pmin )Fr (pmin )]
∂pmin
Since Fr (p̂) ≤ Fr (pmin ) and fw (2pmin − p̂) ≤ fw (pmin ), the partial derivative is less than or equal to
0. Q.E.D.
22
Phillips, Şimşek, and van Ryzin: Estimating Buyer WTP and Seller RP from Negotiation Data and the Implications
Corollary 1. If W is distributed exponentially, then p∗min = 0. If R ∼ U nif orm(ar , br ) and
W ∼ U nif orm(aw , bw ) with some ar , aw ≥ 0, then:
• If aw ≤ ar , then p∗min = 0
• If aw > ar , then p∗min = aw
Proof:
The first part follows immediately from Proposition 1. The second part follows since
MP P )
≤ 0 by Proposition 1 and if pmin < aw then
in case aw > ar , if pmin > aw , then ∂E(Revenue
∂pmin
R
p
∂E(RevenueM P P )
= 0 min Fr (x)fw (2pmin − x)dx ≥ 0 and non-decreasing (zero if pmin < a2w and increas∂pmin
ing if
aw
2
≤ pmin ≤ aw ) in pmin . Finally, in case aw ≤ ar , if pmin > aw , then
Proposition 1 and if pmin < aw then
∂E(RevenueM P P )
∂pmin
∂E(RevenueM P P )
∂pmin
≤ 0 by
= 0. Q.E.D.
Corollary 2. If W follows a normal distribution, then p∗min ≤ µW .
When R and W are both normally distributed, there is no closed-form solution for p∗min but it
can be calculated numerically. For the auto lending data, Figure 2 shows the graph of expected
customer rate from a negotiation given by (8) as a function of pmin when R and W are normally
distributed with the estimated parameters given previously in Table 87 . Note that revenue in each
tier could be increased by imposing a minimum price. The increase is most significant for Tier 1
customers due to the fact that the mean reserve price is lowest for this tier. Note that, as required
by Corollary 2, p∗min ≤ µW in each tier.
6.2.
Policy Comparisons
Using our estimates of the willingness-to-pay and reserve price distribution, we can estimate the
counter-factual change in expected profit that would have occurred from alternative policies. For
a given deal j, we calculate the expected profit at price p by:
Πj (p) = P OPj ∗ (T otalP aymentj (p) − CapitalCostj ) − (1 − P OPj ) ∗ LGD ∗ CapitalCostj
(9)
where POP denotes the Probability of Payment which is the probability that the borrower will repay
the loan in full as estimated by the lender for each prospective deal (this variable was calculated
by the auto lender and recorded in the data), and LGD denotes the loss-given-default ratio – that
is, the expected fraction of the capital cost of the loan that will need to be written off if the
borrower defaults. We set LGD = .5, which is the value used by the lender. The T otalP aymentj
is calculated by multiplying the monthly payments (calculated using the standard equation p =
P r(1 + r)n /[(1 + r)n − 1], where p, P , r, and n are monthly payment, initial principal, monthly
APR, and term in months, respectively) by the term of the loan and CapitalCostj is obtained by
calculating the total payment at the LIBOR, which is an indicator of the lender’s capital cost.
7
In this example, “revenue” corresponds to the customer rate.
23
Phillips, Şimşek, and van Ryzin: Estimating Buyer WTP and Seller RP from Negotiation Data and the Implications
Tier1 (r=2.91%,  r=2.73%, w =9.86%,  w =1.85%)
Tier2 (r=7.05%,  r=3.34%, w =9.74%,  w =2.61%)
7
Tier3 (r=10.23%,  r=3.46%, w =11.18%,  w =2.55%)
6.2
6.1
6
6
6
5.5
Expected Customer Rate (%)
5.8
Expected Customer Rate (%)
Expected Customer Rate (%)
6.5
5.6
5.4
5.2
5
5.9
5.8
5.7
4.8
5
5.6
4.6
4.5
0
1
2
3
4
5
6
7
Minimum Price -- pmin (%)
8
9
10
4.4
0
1
2
3
4
5
6
7
Minimum Price -- pmin (%)
8
Tier4 (r=12.61%,  r=4.58%, w =12.25%,  w =2.76%)
5.92
5.36
5.915
Expected Customer Rate (%)
Expected Customer Rate (%)
5.34
5.32
5.3
5.28
5.26
5.24
5.5
0
1
2
3
4
5
6
7
Minimum Price -- pmin (%)
8
9
10
5.91
5.905
5.9
5.895
5.89
5.885
5.22
Figure 2
10
Tier5 (r=15.61%,  r=5.82%, w =14.47%,  w =3.55%)
5.38
5.2
0
9
1
2
3
4
5
6
7
Minimum Price -- pmin (%)
8
9
10
5.88
0
1
2
3
4
5
6
7
Minimum Price -- pmin (%)
8
9
10
Expected Customer Rate as a Function of the Minimum Price Threshold (pmin ) by Risk Tier
For all three of the policies that we consider, we assume that headquarters wants to maximize
total expected profit – that is, the sum of Πj (p) over all deals – and will set centralized prices and
associated parameters that achieve that goal. Field sales will negotiate individual deals within the
parameters specified by headquarters. We consider the following three policies:
1. Centralized Fixed Pricing (CFP). In this policy, we assume the headquarters does not
allow negotiation and sets a deal-specific rate based on observable customer and product characteristics. The price for each deal is given by:
P
pCF
= arg max
j
Πj (pj ) ∗ P{Wj ≥ pj }.
(10)
pj
2. Negotiated Rate with Floor (NRF). In this policy, headquarters imposes a lower bound
(or floor) on the final rate. Agents negotiate individual deals but cannot commit to a final price
lower than the minimum negotiated rate. The headquarters calculates the optimal lower bound for
each deal, pmin
, to maximize expected profit from each deal:
j
pmin
= arg max 0 ∗ P{R ≤ W ≤ p} + Πj (p) ∗ P{R ≤ peq ≤ p ≤ W }
j
(11)
p
+ E(Πj (peq )|R ≤ p ≤ peq ≤ W )) ∗ P{R ≤ p ≤ peq ≤ W }
+ E(Πj (peq )|p ≤ R ≤ W )) ∗ P{p ≤ R ≤ W },
N RF
where peq
. As before,
j = (R + W )/2. We denote the price for deal j under the NRF policy as pj
RF
min
if R > W , there will be no deal, otherwise, pN
= max[peq
].
j , pj
j
24
Phillips, Şimşek, and van Ryzin: Estimating Buyer WTP and Seller RP from Negotiation Data and the Implications
3. Imposed Reserve Price (IRP) Policy. All of our models so far have assumed that the sales
agent reserve prices are exogenous. Under the IRP policy, we consider the case where headquarters
sets the reserve price for each deal. To the extent that the reserve price incorporates elements
of sales agent negotiating skill, this may not be feasible. However, it is certainly feasible that
headquarters could influence the reserve price by, for example, setting deal-specific commission
structures (e.g., headquarters could provide a bonus to sales agents based on the spread between
the negotiated rate and the imposed reserve price). In this case, headquarters sets an optimal
reserve price `∗j as:
`∗j = arg max
E[Πj ((`j + Wj )/2)|Wj ≥ `j ] ∗ P{Wj ≥ `j }
(12)
`j
The expected price for each deal under the IRP policy is then pIRP
= E[(`∗j + Wj )/2|Wj ≥ `∗j ].
j
For the CFP and NRF policies, we compute the expected counter-factual profit for each deal by
e {Wj ≥ pi }, where pi denotes the price for deal j under policy i where i ∈
E[Πj (pij )] = Πj (pij ) ∗ P
j
j
e is the conditional take-up probability,
{CF P, N RF }, Πj (p) is calculated as in equation (9), and P
e {Wj ≥ pi } = P{Wj ≥ pi | customer’s decision at rate = Customer Rate}. For the IRP policy,
i.e., P
j
j
e {Wj ≥ `∗ }.
the expected counter-factual profit is computed by E[Πj ((`∗ + Wj )/2)|Wj ≥ `∗ ] ∗ P
j
j
j
The results of applying the three policies to the prospective deals in the test data set are shown
in Table 12. “Mean Final Price” is the average APR offered across all prospective deals, the “Mean
Take-up Probability” is the expected fraction of the prospective deals that would be accepted under
each policy, the average profit per negotiation is the average profit realized over all negotiations
(both successful and unsuccessful). All three policies showed significant increases in profit relative
to what was actually achieved. We did not have full information about the actual costs per deal so
some of this increase may be artificial. However, this limitation does not affect the relative results
from the three alternative policies considered.
Mean
Mean
Average
Final
Take-Up
Profit per
Increase
Policy
Price (APR) Probability Negotiation From Actual
Centralized Fixed Pricing (CFP)
9.89%
.76
$3,288
18.10%
Negotiated Rate with Floor (NRF)
9.64%
.70
$3,216
15.52%
Imposed Reserve Price (IRP)
8.82%
.87
$3,973
42.71%
Actual
8.49%
.76
$2,784
–
Table 12
Final prices, take-up probabilities and average profits for the three policies compared with the actual.
The Imposed Reserve Price policy achieved significantly higher revenue than the other two
policies (and the status quo) by a large margin. It was able to both achieve higher average margins
for funded deals as well as increase the expected number of total deals by lowering rates for deals
Phillips, Şimşek, and van Ryzin: Estimating Buyer WTP and Seller RP from Negotiation Data and the Implications
25
that were not accepted by customers. For deals that were actually booked, the mean customer
rate was 7.6%. For the same deals, the mean rate offered under the IRP policy was 9.03% with a
corresponding take-up rate of .96. For these deals, the mean profit under IRP was $4,446 compared
with the actual mean profit of $3,688. IRP also lowered prices on average for deals that were lost
and hence increased the acceptance rate from those deals. IRP offered an average rate of 8.17%
for these deals (the actual mean rate of them was 11.22%) and converted 57.9% of them leading
to additional expected profit of $2,516 for these deals, which contributed no profit in actuality.
The Centralized Fixed Pricing policy performed slightly better than the Negotiated Rate with
Floor policy. This would suggest that, in this case, the benefits of centralized pricing control
outweigh the benefits of allowing sales staff to negotiate prices with a simple lower bound. Still,
it is worth noting that imposing an optimized lower bound on negotiations provides a significant
improvement in profits (15.52%) over the status quo. This shows that the simple change of imposing
a price limit on negotiations can provide significant pricing improvement, provided the negotiating
limits are intelligently computed.
7.
Robustness
The EM method applied to the structural negotiation model described in this paper results in
estimated parameters for the distributions of W and R that meet intuition and lead to APR and
take-up predictions as good or better than competing models. However, the estimation approach
is based on several important structural assumptions. For one, it requires ex ante specification
of the form of the distributions of reserve price and willingness-to-pay. If these distributions are
misspecified, this could lead to incorrect results. Second, it specifies that, when a transaction
takes place, it takes place at the symmetric Nash bargaining equilibrium price – halfway between
the willingness-to-pay and the reserve price. While the symmetric Nash bargaining equilibrium
price has appealing theoretical qualities, it is conceivable that individual deals take place at prices
that are closer to either the reserve price or the customer willingness-to-pay depending upon the
respective bargaining skills of the customer and seller and possibly other factors. In this section
we investigate the robustness of our approach to these assumptions.
7.1.
Misspecification of the Joint Distribution
Our analysis of the auto lender data assumed that R and W were jointly normally distributed.
To assess the robustness of our results with respect to this assumption, we generated synthetic
data from different distributions with known parameters, and applied our EM algorithm to estimate the willingness-to-pay and reserve price distribution parameters, assuming (incorrectly) that
the data was normally distributed. Specifically, we drew 1,000 willingness-to-pay and reserve price
26
Phillips, Şimşek, and van Ryzin: Estimating Buyer WTP and Seller RP from Negotiation Data and the Implications
Dist. of R
Normal
Exp.
Logistic
Normal
Exp.
Exp.
Logistic
Normal
Logistic
Dist. of W
Normal
Exp.
Logistic
Exp.
Normal
Logistic
Exp.
Logistic
Normal
Actual Values
µp
σp Tr. %
94.05 50.93 68.60
87.45 63.12 66.00
89.78 51.44 64.90
88.80 69.95 65.10
93.25 44.58 69.30
88.39 45.80 67.20
88.10 67.20 64.50
89.56 52.21 65.10
94.45 47.74 68.40
Actual Values:
Table 13
EM Parameter Estimations
µˆr
σˆr
µˆw
σˆw
47.48 48.82 102.94
103.51
64.77 93.34 116.66
84.84
37.59 40.69 85.14
118.7
66.61 104.84 120.64
92.52
31.91 17.54 94.57
121.78
24.54 14.11 83.08
127.8
67.19 99.87 117.01
89.88
41.00 46.11 88.22
113.33
47.86 42.72 100.73
101.90
50
50
100
100
Predicted Values
µ̂p
σ̂p Tr. %
94.07 50.94 68.55
87.26 62.89 65.92
89.73 50.94 64.80
88.61 69.57 65.04
92.93 44.74 69.45
87.22 45.34 67.51
88.16 66.98 64.48
89.60 51.94 64.96
94.45 47.71 68.43
EM Method Robustness Check
observations from normal, exponential, and logistic distributions, and, for each distribution combination, estimated the willingness-to-pay and reserve price distribution parameters assuming both
the willingness-to-pay and reserve price are distributed normally. We evaluated the nine instances
shown in Table 13. We fixed the mean and standard deviation of the willingness-to-pay to 100, and
mean and standard deviation of the reserve price to 50, and generated data from the following ran√
dom variables: W1 ∼ N (100, 100), W2 ∼ Exp(λ = 1/100), and W3 ∼ Logistic(µ = 100, s = 100 3/π)
where µ is the location and s is the scale parameter; similarly, R1 ∼ N (50, 50), R2 ∼ Exp(λ = 1/50),
√
and R3 ∼ Logistic(µ = 50, s = 50 3/π). We used the same criteria for the stopping conditions of
the EM algorithm as before.
Table 13 shows the results. In Table 13, µp and σp correspond to the mean and standard deviation of the prices realized among the 1,000 observations (for the deals where the trade occurs), the
column labeled “Tr. %” shows the corresponding trade percentage among these 1,000 observations.
p
µ̂p = E(p|W ≥ R) and σ̂p = V ar(p|W ≥ R) and the last column corresponds to the predicted
trade probabilities when R and W are both normally distributed with mean and standard deviations that are taken as the estimated parameters in columns 6-9. In all cases, the parameters
estimated by the EM method generated price distributions and trade percentages that closely
matched those in the holdout sample. Estimation of the means and standard deviations of the
underlying distributions were less robust to the forms of the distributions. When the actual distributions were normal, the errors in estimating the means of the reserve price and willingness-to-pay
distributions were 5.04% and 2.94% respectively. In the worst cases, the corresponding errors were
50.92% (Exponential/Logistic) and 20.64% (Normal/Exponential) respectively. This degradation
in performance is likely due to the very different form of the exponential distribution relative to
the normal.
Phillips, Şimşek, and van Ryzin: Estimating Buyer WTP and Seller RP from Negotiation Data and the Implications
7.2.
27
Asymmetric Bargaining
In our model, we assumed that the status quo utilities (i.e. the utility obtained if one decides not
to bargain with the other player) of the seller and the customer are equal, so that for cases where
the trade takes place, the Nash bargaining solution is symmetric, i.e.,
R≤W
⇒
p = 0.5R + 0.5W
The generalized Nash bargaining solution (Binmore et al. 1986) specifies that the price for a
particular deal will be given by p = αR + (1 − α)W for some α ∈ (0, 1). If α < 0.5, it can be
interpreted as the sellers’ having systematically more bargaining power than the customers and, if
α > 0.5, the customers would have more bargaining power. Ideally, we would like to estimate the
value of α from the data. Unfortunately, the following proposition shows that α is not identifiable
from data and therefore must be assumed.
Proposition 2. Assume that the true values of the model primitives are R1 , R2 , . . . , RN and
W1 , W2 , . . . , WN and α and that a trade occurs for observations 1, 2, . . . , n at rate pi = (1 − α)Ri +
αWi with Wi ≥ Ri , and no trade occurs for the remaining observations implying that Wi < Ri for
i = n + 1, . . . , N . Then, the model primitives are not jointly identifiable.
Proof:
Fix the W ’s and assume that we consider some α̂ 6= α for some 0 ≤ α̂ ≤ 1. Define
R̂i =
(α − α̂)Wi + (1 − α)Ri
1 − α̂
We will show that the underlying variables R̂1 , R̂2 , . . . , RˆN and W1 , W2 , . . . , WN and α̂ will produce
exactly the same set of observations (prices and no-trade outcomes) as the “real” values. For
i = 1, 2, . . . , n:
(1 − α̂)R̂i + α̂Wi
(α − α̂)Wi + (1 − α)Ri
=(1 − α̂)
+ α̂Wi
1 − α̂
=αWi − α̂Wi + Ri − αRi + α̂Wi
=αWi + (1 − α)Ri
=pi
And for i = n + 1, . . . , N , let γ =
α−α̂
.
1−α̂
Then, γ ≤ 1 and:
R̂i = γWi + (1 − γ)Ri
= Ri + γ(Wi − Ri )
≥ Wi since Wi − Ri < 0 and γ ≤ 1
Q.E.D.
28
Phillips, Şimşek, and van Ryzin: Estimating Buyer WTP and Seller RP from Negotiation Data and the Implications
This raises the question of whether or not an incorrectly specified value of α would lead to
significant misestimation of the parameters of the underlying distribution. To address this question,
we generated synthetic data for three values of α = 0.2, 0.5, and 0.8 and normally distributed values
for R and W . We then estimated the model parameters µr , µw , σr , and σw using the EM algorithm
and assumed values of α̂ from 0.1 to 0.9 at increments of 0.1. The results are shown in Table 14
for true values (µr , σr , µw , σw ) = (58, 6, 55, 10) and (µr , σr , µw , σw ) = (50, 5, 60, 10). The former set
of parameter values corresponds to 40% trade probability and the latter set corresponds to 80%
trade probability.
True Parameters
µr
σr
µw
σw
58
6
55
10
α = 0.2
EM Estimations
α̂
µˆr
σˆr
µˆw
σˆw
0.1
56.95 4.99 54.22 9.27
0.2
56.99 4.81 54.17 10.07
0.3
56.28 4.25 53.21 12.07
0.4
56.46 4.23 52.74 13.71
0.5
56.19 4.04 51.49 17.06
0.6
56.27 4.28 50.97 20.37
0.7
56.21 4.02 49.54 26.34
0.8
56.80 4.47 47.01 36.46
0.9
58.95 5.50 48.02 42.58
α = 0.5
EM Estimations
α̂
µˆr
σˆr
µˆw
σˆw
0.1
57.63 6.46 55.22 6.44
0.2
56.55 5.10 54.14 7.67
0.3
55.55 4.33 52.81 9.57
0.4
55.29 4.08 52.31 11.14
0.5
55.79 4.31 52.53 12.13
0.6
55.48 4.22 51.42 15.31
0.7
55.61 4.20 50.62 19.26
0.8
55.44 3.98 47.98 28.48
0.9
57.57 4.88 49.80 30.40
α = 0.8
EM Estimations
α̂
µˆr
σˆr
µˆw
σˆw
0.1
55.92 7.06 53.69 5.81
0.2
54.49 5.23 52.24 7.15
0.3
54.09 4.80 51.70 8.24
0.4
54.20 4.63 51.62 9.05
0.5
53.69 4.28 50.57 11.35
0.6
54.42 4.52 50.84 12.22
0.7
53.76 4.21 49.35 17.02
0.8
53.91 4.27 47.82 23.47
0.9
54.87 4.57 47.00 31.46
*LL: Log-Likelihood
Table 14
LL*
-6672.10
-6718.01
-6774.54
-6897.76
-7071.26
-7304.71
-7501.00
-7931.36
-8294.12
LL*
-6565.73
-6503.83
-6561.17
-6654.65
-6794.94
-7006.49
-7230.15
-7568.02
-7837.32
LL*
-6550.62
-6459.69
-6515.71
-6573.37
-6721.09
-6850.56
-7110.46
-7445.59
-7806.27
True Parameters
µr
σr
µw
σw
50
5
60
10
EM Estimations
µˆr
σˆr
µˆw
σˆw
40.89 23.70 62.26 7.11
46.92 17.87 63.74 7.33
50.32 15.12 64.95 7.50
52.72 11.60 65.24 8.42
50.32 4.08 64.66 16.07
50.46 3.86 67.99 19.85
50.49 3.79 73.69 26.21
50.83 3.93 84.31 38.03
54.70 6.06 94.44 46.06
EM Estimations
µˆr
σˆr
µˆw
σˆw
36.59 25.08 58.56 4.90
43.81 18.05 59.93 4.70
48.88 11.12 59.53 5.36
50.65 8.00 59.43 6.27
50.26 5.06 59.50 9.20
50.29 4.53 61.32 11.71
50.52 4.34 64.25 15.29
50.88 4.29 69.73 21.24
51.97 4.47 81.56 33.66
EM Estimations
µˆr
σˆr
µˆw
σˆw
30.71 27.48 54.90 3.82
39.96 18.17 55.93 3.54
44.34 13.32 56.42 3.40
47.33 9.40 56.06 3.99
47.45 4.84 54.91 7.12
47.92 4.58 56.14 8.24
47.55 3.93 58.60 11.88
48.02 3.84 62.51 15.95
48.68 3.84 71.96 26.11
Asymmetric Bargaining Synthetic Data Test
LL*
-7964.78
-7712.79
-7568.83
-7419.24
-7021.68
-7175.71
-7437.48
-7845.28
-8468.77
LL*
-7648.84
-7278.44
-6925.84
-6753.32
-6679.13
-6808.68
-7032.34
-7351.11
-7850.62
LL*
-7490.88
-7002.48
-6650.49
-6461.98
-6377.11
-6469.82
-6681.62
-6952.94
-7444.50
Phillips, Şimşek, and van Ryzin: Estimating Buyer WTP and Seller RP from Negotiation Data and the Implications
29
In these cases, assuming α = 0.5 does a reasonable job of estimating the underlying parameters
even when the true value of α is 0.2 or 0.8. In fact, for some cases in which the actual value of α is
not 0.5, using a value of α = 0.5 in the estimation actually results in estimates of parameters that
are closer to their true values than using the true value of α. Additionally, the log-likelihood is not
always the highest for the estimations using the true values of α, reflecting the fact that α is not
identifiable from the data. It also notable that, for each set of estimated parameters, the predicted
p
values of µ̂p = E(p|W ≥ R), σ̂p = V ar(p|W ≥ R), and trade probabilities are very close to their
true values.
8.
Discussion and Conclusions
In this paper, we proposed a structural EM method to estimate the joint distribution of customer
willingness-to-pay and seller reserve prices from a series of bi-lateral price negotiations. The resulting distributions can be used to predict both the take-up probability and the price of negotiations.
We showed that our method successfully uncovers the parameters of willingness-to-pay and reserve
price distributions from synthetic data given a sufficient number of observations. When applied to
a real-world auto-lending data set, our method generally predicted take-up and APRs on a test
data set better than standard regression-based approaches. Furthermore, our estimates provide
interesting insights into the reserve price distribution among sales representatives and how it varied
as a function of deal characteristics.
While our structural EM method has its strengths, the EM algorithm is notoriously computationally intensive and can be slow to converge. Computational considerations limited us to using
only 200,000 observations out of 950,985 available in the estimation process. In this case, the results
using the smaller sample were still an improvement over direct maximization of the likelihood, but
the computational intensity of the approach could be a limitation in other settings. In addition,
our approach requires a priori specification of the bargaining parameter α and the underlying form
of the distributions of R and W . While robustness tests suggest that the results are reasonably
insensitive to the form of the joint distribution and the value of α, severe misspecification (e.g. normal for exponential) can lead to misleading results. Both the development of improved algorithms
and techniques for choosing appropriate distributions and values of the bargaining parameter are
useful directions for future research.
Lastly, we used our estimation method to perform a counter-factual analysis of pricing policies on
a real-world data set of auto loans. The analysis showed if local sales staff are given pricing discretion, significant profit improvements can be achieved by providing an optimized lower bound on the
negotiated price. The lower-bound prevents sales staff with very low reservation prices from dropping the final price too low. These results also provide an explanation for the commonly-observed
30
Phillips, Şimşek, and van Ryzin: Estimating Buyer WTP and Seller RP from Negotiation Data and the Implications
practice of providing sales staff limited discretion to change prices relative to a centrally-established
price list. Yet even with optimized lower bounds, deal-specific optimized fixed prices generated
slightly higher profits than negotiated prices. However, the highest level of profits observed in our
counterfactual analysis occurred when headquarters was allowed to set optimized reserve prices for
each deal and individual sales staff then negotiated deals given those optimized reserve prices. This
last result suggests that a combination of data-driven pricing controls and sales agent negotiation
can produce higher profits than either approach alone. How to achieve such results in practice
through appropriate training, controls, and sales force incentives is an open topic for further study.
References
Anupindi, R., M. Dada, S. Gupta. 1998. Estimation of consumer demand with stock-out based substitution:
An application to vending machine products. Marketing Sci. 17 406 – 423.
Bhardwaj, P. 2001. Delegating pricing decisions. Marketing Science 20(2) 143 – 169.
Binmore, K., A. Rubinstein, A. Wolinsky. 1986. The nash bargaining solution in economic modeling. RAND
Journal of Economics 17(2) 176 188.
Breidert, C., M. Hahsler, T. Reutterer. 2006. A review of methods for measuring willingness-to-pay. Innovative Marketing 2(4).
Chan, K. S., J. Ledolter. 1995. Monte carlo em estimation for time series models involving counts. Journal
of the American Statistical Association 90(429) 242 252.
Chatterjee, K., W. Samuelson. 1983. Bargaining under incomplete information. Operations Research 31(5)
835 – 851.
Conlon, C. T., J. H. Mortimer. 2007. Demand estimation under incomplete product availability. Working
paper. Harvard University.
Dempster, A. P., N. M. Laird, D. B. Rubin. 1977. Maximum likelihood from incomplete data via the em
algorithm. J. Roy. Statist. Soc. B 39(1) 1 38.
Desai, P. S., D. Purohit. 2004. Let me talk to my manager: Haggling in a competitive environment. Marketing
Sci. 23(2) 219 233.
Joseph, K. 2001. On the optimality of delegating pricing authority to the sales force. Journal of Marketing
65(1) 62 – 70.
Kuo, C., H. Ahn, G. Aydin. 2011. Dynamic pricing of limited inventories when customers negotiate. Oper.
Res. 59 882 – 897.
Lal, R. 1986. Delegating pricing responsibility to the salesforce. Marketing Science 5(2) 159 – 168.
Levine, R. A., G. Casella. 2001. Implementations of the monte carlo em algorithm. Journal of Computational
and Graphical Statistics 10(3) 422 439.
31
Phillips, Şimşek, and van Ryzin: Estimating Buyer WTP and Seller RP from Negotiation Data and the Implications
McLachlan, G., T. Krishnan. 1996. The EM Algorithm and Extensions. John Wiley & Sons, New York, NY.
Mishra, B. K., A. Prasad. 2004. Centralized pricing versus delegating pricing to the salesforce under information asymmetry. Marketing Science 23(1) 21 – 27.
Mishra, B. K., A. Prasad. 2005. Delegating pricing decisions in competitive markets with symmetric and
asymmetric information. Marketing Sci. 24(3) 490 – 497.
Muthoo, A. 1999. Bargaining Theory with Applications. Cambridge University Press, Cambridge, UK.
Phillips, R. L., A. S. Şimşek, G. van Ryzin. 2014. The effectiveness of field price discretion: Empirical
evidence from auto lending. Forthcoming in Management Science.
Phillips, R. L., R. Raffard. 2011.
Price-driven adverse selection in consumer lending.
Work-
ing Paper. Center for Pricing and Revenue Management, Columbia University. Available
on-line at: http://www7.gsb.columbia.edu/cprm/sites/default/files/files/Working%20Papers/2011-3Price-Driven-Adverse-Selection.pdf.
Roth, S., H. Woratschek, S. Pastowski. 2006. Negotiating prices for customized services. Journal of Service
Research 8(4) 316 – 329.
Stefanescu, C. 2012. Multivariate demand: Modeling and estimation from censored sales. Working Paper.
Management Science and Operations, London Business School, London, UK. Available on-line at:
http://faculty.london.edu/cstefanescu/Stefanescu09.pdf.
Vulcano, G., G. van Ryzin, W. Chaar. 2010. Choice-based revenue management: An empirical study of
estimation and optimization. Manufacturing Service Oper. Management 12(3) 371392.
Weinberg, C. B. 1975. An optimal commission plan for salesmen’s control over price. Management Sci.
21(April) 937 – 943.
Williams, L. 2014. Pricing mediated by humans: the role of sales rep discretion. Presented at POMS-College
of Behavioral Operations Management. Atlanta, GA. May 8, 2014.
Wu, C. F. 1983. On the convergence properties of the em algorithm. The Annals of Statistics (11) 95 – 103.
Download