Recommender Systems and Consumer Product Search (full paper, word count: 7773) Vidyanand Choudhary Zhe (James) Zhang University of California, Irvine VeeCee@uci.edu University of Texas, Dallas zxz145430@utdallas.edu Abstract Recommender systems are popular tools used by online retail websites to suggest products to consumers, such as those featuring music, movies, and other online content. An online retailer can use this technology to entice consumers to buy the recommended products, and consumers can avoid searching for alternative products if they accept the recommended products. In this paper, we develop an analytical framework to examine the optimal recommender system strategy of an online multi-product retailer and the impact of the recommender system on society. We show that the retailer’s optimal recommender system strategy is driven by consumers’ search cost and misfit cost, as well as the probability that a consumer continues product search if she rejects the recommendation. We find that when the recommender system is relatively less accurate, the firm cannot mislead all consumers with the recommended products. Although these recommended products are a perfect match for some consumers, other consumers may reject the recommendation. When the firm improves the accuracy of the recommender system, the net of aggregate consumer mismatch cost and search cost decrease. This means that the firm’s use of recommender system generates externality that benefits consumers. Moreover, the firm’s revenue increases, and thus, social welfare increases. We also find that when the recommender system is relatively accurate, the firm deliberately misleads all consumers by not giving them their ideal products. Nonetheless, all consumers take the recommended products, and the retailer’s revenue increases. This leads to a mismatch cost that is borne by the consumers, and aggregate consumer surplus reduces; but interestingly, this does not lower social welfare. 1 1. Introduction Product recommender systems are personalized sale assistance tools to make product search easier for consumers before making their purchase decisions. In general, online retailers or intermediaries use recommender systems to provide personalized product or content recommendations that individual consumers may be interested in. These systems are used in cases where there are a large number of differentiated products and become extremely popular in recent years in a variety of applications, like books, movies, and music. Figure 1 illustrates the personalized product recommendation provided by Amazon’s recommender system. A consumer sees this set of recommended products at the home page immediately after her logging in, and she can end product search if she finds what she wants among these products. From consumers’ perspective, recommender systems make their product search easier and save product search time. The question is that how does a recommender system affect consumers’ product search process; and how does an online retailer strategically manage the recommender system to maximize revenue. Figure 1: Amazon product recommendation The value proposition of recommender systems is that they provide consumption experience that is personalized to consumers’ tastes (Hosanagar, Fleder, Lee and Buja, 2014). Giving individual consumers what they want reduces their product search cost in a context where consumers face many choices. In Figure 1, Amazon identifies this consumer as someone who is looking for books about business intelligence or data mining, and thus, recommends to her three books, Principles of Data Mining, Python in a Nutshell, and Introductory Statistics with R. She would buy one of the recommended 2 books if it was the book she was looking for and finish the product search; otherwise, she might start searching for other books using the in-store search engine. As a result, a consumer has a better shopping experience with Amazon if the recommended products are the ones she is interested in. Thus, from online retailers’ perspective, these tools are used to build loyalty and turn browsers into buyers. Google News reports that use of recommendation increases articles viewed by 38% (Das et al. 2007); Amazon reports that 35% of sales originate from recommendations (Lamere and Green 2008); and Netflix reports that 60% of their movie rentals originate from recommendations (Thompson 2008). Thus recommender systems are highly relevant and becoming increasingly powerful tool for retailers. On the other hand, online retailers can manipulate the product search process of consumers by providing information about only a selection of products or contents. Online advertising-supported intermediaries, like online news sites and search engines, expose advertisements to viewers in more prominent places than the content that is of interest to consumers. This is referred to as search diversion by Hagiu and Jullien (2014). One recent account is that Google allegedly manipulated search results—promoting its own services and suppressing competitors’1 2. In addition, Amazon deliberately removed pages promoting books by Hachette in midst of a dispute over e-book pricing3. Similarly, an online retailer can deliberately direct consumers’ attention to specific products in the form of personalized recommendation, In this paper, we examine a context where consumers first get recommendation from the recommender system, and then they may use a product search engine to find alternative products before making their final purchase decision. The research questions are: 1) What is the effect of product recommendation on consumers’ product search and purchasing decision? 2) How does an online retailer strategically deploy a recommender system? 3) What is the effect of product and consumer characteristics on the online retailer’s optimal product recommendation strategy? 4) What is the impact of a recommender system on consumer surplus and social welfare? 1 2 3 http://www.telegraph.co.uk/technology/google/9281639/Google-warned-by-EUs-antitrust-inquiry-of-search-manipulation-concerns.html http://www.bidnessetc.com/22603-yelp-doesnt-like-googles-likely-antitrust-settlement-with-the-eu/ http://bits.blogs.nytimes.com/2014/05/23/amazon-escalates-its-battle-against-hachette/?_php=true&_type=blogs&_r=0 3 Understanding consumer information search and acquisition has been an important theme in economics and marketing, and information systems. In Bakos (1997), the electronic market system provides product search function that lowers buyer search costs and improves market efficiency. In recent years, online retailers have not only improved product search functions, but also provided recommender systems to consumers to improve their product search experience. Due to its popularity, recommender systems have become a major theme of research (Murthi and Sarkar 2003, Dellarocas 2009, Hosanagar et al. 2014), and its impact on consumer product search and market structure is being studied (Kim, Albuquerque, and Bronnenberg, 2010). Prior research has typically treated recommender systems as a tool or an add-on service that provide accurate match between buyers and products (Häubl & Trifts 2000, Tam & Ho 2006, Fong 2012). In this study, we develop a framework in a context where products are differentiated and an online multi-product monopolist can skew recommendations in favor of products with higher profit margin. Consumers accept or reject the product recommendation depending on their preferences. If consumers reject the recommendation, then they may continue to search for alternative products, thus incurring search costs. We find that the firm can benefit from strategically recommending products with higher profit margin. We also show that the firm’s decision about product recommendation depends on product profit margin, consumer search cost and misfit cost. In our analysis of consumer surplus and social welfare, we find that when the recommender system is relatively less accurate, the firm cannot mislead all consumers with recommended products. When the firm improves the accuracy of the recommender system, both the consumer surplus and the firm’s revenue increase, thus, social welfare increases. On the other hand, when the recommender system is relatively accurate, the retailer deliberately misleads all consumers by not giving them their ideal product. This leads to a mismatch cost that is borne by the consumers. Thus, when the firm improves the accuracy of the recommender system, consumer surplus falls. Surprisingly, this does not lead to any decrease in social welfare. 4 1.1 Literature Review A simplified taxonomy of recommender systems divides them into two types: contentbased and collaborative filtering-based systems. Content-based systems use product information (e.g., genre, mood, author) as criteria to recommend items similar to those a user rated highly. On the other hand, collaborative filtering-based systems recommend products that other consumers with similar tastes and preferences bought in the past. There is a large body of work on designing recommender systems, and Adomavicius and Tuzhilin (2005) provide an extensive review in the information systems literature. The economic implications of recommender systems and how they affect society are not well understood. There is a growing stream of research focusing on the economic implications of electronic commerce in online market. Studies have examined the impact of use of recommender systems on sales (Ansari et al. 2000, Das et al. 2007, Bodapati 2008) and future sales (Johar et al. 2014), the influence of recommender systems on consumer search behavior (Kim et al. 2010), “long tail” effect of information goods (Fleder and Hosanagar 2009, Oestreicher-Singer and Sundararajan 2012), and personalization service (Murthi and Sarkar 2003, Hosanagar et al. 2014). Another relevant stream of literature examines the impact of the reduction of consumer search cost on firms’ profit and social welfare. Bakos (1997) studies the implication of consumer search cost on online sellers’ pricing and product strategies, and the resulting reduction on market inefficiency. Branco, Sun and Villas-Boas (2012) develop a framework to study the role between graduate learning, search cost, and consumers’ purchase decision. They discuss the impact of consumer search on profits and social welfare, and how the seller chooses its price to strategically influence the extent of consumers’ search. We consider recommender systems as an integral part of the consumer product search process, and consumers use recommender systems in combination with other product search features, such as search engines, before making a purchase decision. We consider the case where recommender systems can be used strategically to favor higher margin products. The firm needs to balance the tradeoff between profit margin of the 5 recommended product and the likelihood of consumers’ acceptance of the recommendation. In our framework, product recommendation is a substitute for the use of search engines as consumers can learn about the product recommendation without incurring search cost. Therefore, this paper also contributes to the literature on consumer search. We describe the model setup in §2 and the results in §3. We conclude with a discussion and directions for future research in §4. 2. Model Setup 2.1 Consumers In a world with differentiated product offerings, we consider a market of consumers with heterogeneous preferences and one monopolist firm selling products that are spatially differentiated along a single dimension. In our horizontal differentiation model, consumer preference or location is denoted by, xi and it is distributed on the interval [0,1] . The monopolist’s products are differentiated along the same dimension so that product locations are evenly distributed on [0,1] . A consumer incurs a “misfit” cost t per unit distance between her ideal location and product location. This cost represents a consumer’s loss due to obtaining a product at a location that is different from her ideal location per unit distance. All consumers have identical unit demand, subject to a reservation utility V , and they are risk neutral. Utility of a consumer at xi from buying a product at x is: U ( xi , x) V | xi x | t (1) All products seem ex ante identical to consumers. This implies that consumers have no prior knowledge of the location of any product. Each consumer can learn a product’s location by searching and evaluating the product. The time and effort to search and analyze the product is the search cost, represented by parameter c . 6 A consumer sequentially searches and examines one product at a time. She decides whether to purchase the examined products or to continue the search. She will stop searching and accept a product if the expected gain in utility from an additional search is less or equal to the cost of another search. It follows that a consumer will purchase a product when it is within a certain distance to her ideal location. We refer to this distance as the acceptance range, and we formally define this range in the next section. Figure 2: Consumer decision tree There are two types of consumers: a proportion of consumers, , who are willing to search and the remaining consumers, 1 , who do not search. Consumers of the first type will search if they reject the recommendation. They will continue searching till they find a product xs in their acceptance range. Consumers of the second type will never search for products and will not buy any product if they reject the recommendation. This may occur for several reasons for example some consumers may have an outside option in the form of an existing or substitute product. We assume that the acceptance range is identical for both types of consumer. Figure 2 illustrates the flow of consumer decisions. 2.2 Firm and product profit margin The monopolist firm sells a continuum of horizontally differentiated products at the same price. There are several examples that fit this setting in the online movie or music subscription models (Netflix and Spotify), where online users pay monthly subscription fee and do not pay for individual movie or music titles. This setting is also applicable to the revenue models where online intermediaries such as search engines, newspapers that charge little or no fee from users, and instead obtain revenue from 7 advertisers. The firm gains heterogeneous profit margin from sales of different products. This can happen due to revenue and cost factors. For example, in the case of Netflix and Spotify, the quality of digital media (non-HD and HD) impacts the price that retailer can charge as well as the retailer’s marginal costs. In the case of the Google search engine, the revenue from keyword advertising varies across different keywords. In general, the profit margin can vary across products because they are provided by different suppliers with different contract terms. To simplify our model, we assume that products have heterogeneous profit margins and all products have the same exogenously determined price. We assume a mapping from product location to profit margin for each product as v( x) mx , where x is product location and m is the highest profit margin. The firm knows the distribution of consumer locations, but not the exact location of a consumer. The firm can invest time and effort to gather information about consumers and such analysis can better inform the firm about the consumer’s location. In our model, the firm is able to narrow a consumer’s possible location to a window. We refer to this window of possible consumer locations as a bucket. The firm knows that the consumer’s true location is within the bucket so that the probability of the bucket containing the consumer’s location is 1. The range of a bucket is denoted by r . In this way, instead of a consumer’s exact location, the firm identifies a bucket that the consumer belongs to, which spans from a j to aj r . Figure 1 illustrates that the firm observes the bucket [a j , a j r ] associated with a consumer without knowing her ideal location, xi . We assume that the consumer’s location is equally likely to be at any point within the bucket and this is known to the firm. This implies that the firm has better information about the location of consumers when the range of the bucket ( r ) is small. At the extreme values of r , when r 1 the firm has no information about consumers, and when r 0 , the firm has precise information about consumers’ locations. 8 Figure 3: Consumer bucket [a j , a j r ] for a consumer at xi When a consumer arrives at the retailer, the firm does not observe the true location of the consumer. Instead, he observes a bucket. In aggregate, the firm observes buckets whose left border ( a j ) are uniformly distributed on the interval [0,1 r] . When the firm recommends a product at xr to a consumer associated with a bucket [a j , a j r] , there are three possible outcomes: 1) the consumer buys the recommended product and the firm gets the profit margin xr m ; 2) the consumer rejects the recommendation, and searches on her own, to purchase a product at xs , resulting in expected profit of E[ xs m] to the firm; and 3) the consumer rejects the recommendation, and does not buy any product, resulting in the firm getting zero profit margin. The probabilities associated with these three outcomes depend on the location of the product recommendation ( xr ), the left border of the bucket ( a j ), and the range of bucket ( r ). The firm’s expected revenue from a bucket whose left border is at a j is E[ Ra ( xr | r )] . Thus, the firm’s total expected revenue j consists of the expected revenue from all buckets and it is: E[ R ] 1 r 0 E[ Ra j ( xr | r )]da (2) In this paper, we make two assumptions. 1) We assume the price of all products is zero so that consumers’ purchasing decisions depend on their baseline utility, misfit and search costs. 2) We assume that for consumers, whose locations are close to the ends of the line (0 or 1), they can always find the same number of products located on the left and on the right of their individual locations. 3. Analysis A consumer sequentially searches for product information on the retailer firm’s website. In a sequential search, a consumer decides to stop or continue a search each time after 9 having searched a product. The theory of optimal sequential search states that a consumer continues a search only if the marginal benefit of doing so outweighs the marginal cost. This leads to a stopping criterion that we refer to as the Acceptance Range of a consumer. 3.1 Acceptance range A consumer incurs cost c per search when she looks for a product and learns its location. Moreover, a consumer incurs misfit cost t per unit distance between her ideal location and the location of purchased product. A consumer will stop searching and purchase the currently examined product if the gain in the expected utility of an additional search is less than or equal to the search cost c . We define that a consumer is indifferent between buying a product located at x distance away from her ideal location xi and continuing searching. This critical distance xi is obtained from the following equation: 2 xi x xi ( x xi )tdx c (3) The distribution of firm’s products is common knowledge, thus, a consumer knows there is a product with probability dx along any differentiated part of the unit line. On the left-hand side of the equation (3) is the expected gain of utility due to finding a product that has a distance less than x to the consumer’ location, and on the right-hand side of the equation is the search cost. Solving the equation (3), we obtain the acceptance range x c t . This implies that a consumer will stop searching for products and buy the currently examined product if it is closer than or equal to x c t distance from her ideal location. PROPOSITION 1: When search costs increase dx dx 0 or misfit costs decrease 0 dc dt then the expected distance between the consumer’s ideal location and the location of the purchased product will increase. Proof is in the Appendix 3.A. 10 Proposition 1 establishes the stopping rule of an individual consumer who strategically balances the trade-offs between her misfit cost and search cost. When her search cost is high, having examined a product, the cost of finding a fit product (closer to her ideal location) is likely to be high. Thus, she will stop searching when she finds a product even if the product is distant from her ideal location when she has high search cost. Figure 4 illustrates the concept of this stopping rule for a consumer, whose location is at xi . She will buy a product if it is within the acceptance range [ xi c t , xi c t ] , otherwise, she may continue to search for alternative products. Figure 4: Consumer's acceptance range 3.2 In the absence of recommender system Some consumers will search for products when the firm does not have a recommender system. If a consumer at xi searches, she incurs cost c for each search, and she stops searching till she finds a product that is within her acceptance range [ xi c t , xi c t ] . Hence, the probability of finding products outside the acceptance range on the first k 1 searches and finding a product within the acceptance range on the k th search is P(k ) p(1 p)k 1 , where p is the probability of a random draw of a product that falls in the acceptance range. Therefore, the number of searches till a product within the acceptance range is discovered by a consumer follows a geometric distribution, and the expected number of searches is 1/ p . For a consumer at xi , the probability of a random draw of products being within the acceptance range is p 2 c t as she would buy any product within [ xi c t , xi c t ] . Hence, the expected number of searches is E[k ] 1/ p t c / 2 , and the expected cost of searches is the product of the expected number of searches and the cost per search: E[kc] ct / 2 . 11 A consumer will continue to search for products till she discovers a product xs that is within the acceptance range [ xi c t , xi c t ] . Since the products are evenly distributed, the expected value of product xs is E[ xs ] xi c t xi c t xi . 2 Moreover, proportion of consumers will search and 1 proportion of consumers will not search. This implies that the firm’s expected profit from a consumer at xi in the absence of a recommender system is: E[ xs m] xi m (4) 3.3 Product recommendation The firm can recommend a product to each individual consumer when she arrives at the website. Depending on the bucket [a j , a j r] associated with the consumer xi , the firm recommends a product at xr (a j ) . This consumer will accept the recommended product xr if it is within her acceptance range [ xi c t , xi c t ] as we have shown in §3.1. If so, then the firm has profit margin xr m . The consumer at xi will reject the recommended product xr if it is outside the acceptance range [ xi c t , xi c t ] . The consumer will continue to search with a probability till she finds an acceptable product, and thus the firm’s expected profit margin is xi m as shown in (4). The firm prefers to recommend products with larger x , as the profit margin is v( x) mx , which implies larger profit margin for larger x . Figure 5 illustrates the process of the firm’s recommending product. The 1st step is that the firm understands the true location of the consumer can be at any location within the range of the bucket [a j , a j r] with equal probability. The second step is that the firm decides to recommend a product at xr (a j ) , and expects three outcomes which depend on the relative position of the left border of the bucket ( a j ), the range of a bucket ( r ), and the product recommendation ( xr (a j ) ). When the product recommendation is within the range of a j c t xr (a j ) a j r c t and xr (a j ) a j r c t , the consumer will accept the 12 recommendation xr if xr (a j ) c / t xi a j r . The likelihood of a consumer accepting the recommendation xr is: Laccept ( xr ) a j r ( xr c / t ) r . If a j xi xr (a j ) c / t , then she does not accept the recommendation. If she rejects the recommendation, then with a probability , she will search, and with a probability 1 , she will leave the retailer’s website. Though range of the buckets seems to be greater than the acceptance range ( r c / t ) in Figure 5, the concept is applicable to the cases where r c / t . Figure 5: The firm’s product recommendation and consumer’s decision when a j c t xr (a j ) a j r c t and xr (a j ) a j r c t When a j c t xr (a j ) a j r c t and xr (a j ) a j r c t , the firm’s expected revenue of recommending product at xr is: E[ Ra j ( xr )] a j r xr c / t xr m / rdxi xr c / t aj xi m / rdxi (5) The first term is the expected profit margin if the recommendation was within the consumer’s acceptance range ( xi [ xr c / t , a j r ] ). The second term is the expected profit margin if the recommendation was outside her acceptance range ( xi [a j , xr c / t ) ). When xr a j r c t , the firm gets zero from recommendation. When xr a j c / t , the second term is zero; while when xr a j r c / t , the consumer may reject the recommendation and give up product search if the consumer location is greater than xr c / t xi a j r . Thus, the firm’s maximization problem is: 13 max E[ Ra j ( xr )] xr subject to xr a j c t xr a j r c t xr a j r c t We solve the m( c (1 ) t (a j r (2 ) xr )) r t x1 c / t 1 a j r 2 2 maximization 0 as FOC. The problem value and of obtain recommendation satisfies the FOC. Since 0 1 , the optimal product recommendation x1 a j r c / t , the constraints on the optimal product recommendation are only xr a j r c t and xr a j c t . When xr a j c / t , the firm can increase the value of xr to xr a j c t , which leads to higher expected profit margin from recommendation (greater value of the first term in (5) and the second term remains as 0), and thus, the greater expected revenue. Note that when r 2 c t , a j r c t a j c / t , thus, xr a j c / t is the only boundary solution. When r2 c t , aj r c t aj c / t , it is possible that a j c t xr (a j ) a j r c t which means that a consumer, whose location is either extremely large or small within the bucket, may reject the recommended product xr . In this case, the firm’s expected revenue of recommending a product at xr is: E[ Ra j ( xr )] xr c / t xr c / t xr m / rdxi xr c / t aj xi m / rdxi The first derivative of the expected revenue in (6) is a j r xr c / t xi m / rdxi dE[ Ra j ( xr )] dxr c/t (6) 2(1 )m 0. r This implies that the optimal solution is the highest possible value xr a j r c / t . Thus, when r 2 c / t , the constraint is xr a j r c t , and xr a j r c / t is a boundary solution. 14 We further assume that the search cost is low compared to the misfit cost, that is t t c . The optimal product recommendation strategy is summarized in Lemma 1. 9 4 LEMMA 1: 1) When 0 r c / t , the optimal product recommendation is xr a j c / t for buckets a j [0,1 c / t ] , and xr 1 for buckets a j (1 c / t ,1 r] ; 2) When c / t r r1 , the optimal product recommendation is xr x1 for buckets a j [0, a1 ] , and xr a j c / t for buckets a j (a1 ,1 r ] ; 3) When r1 r 1 , the optimal product recommendation is xr x1 for all buckets a j [0,1 r ] ; where x1 c / t a1 1 a j r , 2 2 r1 1 c / t and 2 r c/t . 1 It is obvious to see that it is more like that a consumer will accept the recommended product if the range of a bucket ( r ) decreases. Moreover, when every consumer, who rejects recommended product, continues to search for alternative products ( 1 ), the firm recommends products that do not match with consumers’ preferences. The firm recommends products outside the bucket if the range of buckets is relatively small, or recommends the product at the most right border of the bucket when the range is large. When consumers may give up product search all together with a probability, the firm recommends products that are inside the bucket so that the recommendation may be a perfect match for the consumer. PROPOSITION 2: The location of the product recommended by the recommender system xr is independent of the profit margin parameter m . The firm’s strategic recommendation strategy is driven by heterogeneity among products’ profit margin xm and the profit margin increase as either x or m increases. Therefore, one may think that the firm’s product recommendation strategy will shift to the right when profit margin m is higher. However, counter-intuitively, Proposition 2 says that the product recommendation does not depend on m . 3.4 Firm’s total expected revenue 15 Lemma 1 says that the firm’s recommendation strategy for each consumer bucket depends on the range of a bucket ( r ), the consumer’s search cost ( c ), misfit cost ( t ), and the probability that consumer will continue product search after rejecting the recommendation ( ). Thus, the firm’s expected revenue of a bucket over [a j , a j r ] also depends on these factors, E[ Ra ( xa* | r, , c, t,)] . Without considering the cost of j j recommender system, we show that the firm’s expected total revenue is: E[ R( r )] 1 r 0 E[ Ra j ( xr | r )]da j , and this decreases as the bucket range increases (Figure 6). Figure 6: Impact of accuracy of recommender system on the firm's expected revenue where m 1 , c 0.2 , t 1 , and 0.6 PROPOSITION 3: The firm’s expected total revenue increases as the range of consumer buckets decreases (lower r ), dE[ R( r )] 0. dr Decrease in the range of buckets means the firm can more accurately identify each consumer, and thus it is more likely for a consumer accepting a recommended product that has a higher profit margin. Thus, the firm’s total revenue increases as r decreases. PROPOSITION 4: When the range of buckets is relatively small, r c / t , the firm’s optimal product recommendation is outside the bucket xr* [a j , a j r ] . 16 When the optimal range of bucket is r* c / t (Lemma 2), therefore, the firm’s optimal product recommendation is xr* a j c / t . Proposition 4 states that the firm will never provide recommendation that perfectly matches the consumer’s preference if the recommender system is sufficiently accurate ( r c / t ). This implies that the consumers will accept the recommendation, but they never get a perfect match to their ideal locations. This is intuitive as the firm will try to sell products that are profitable and acceptable to consumers. When the cost of improving accuracy of the recommender system is relatively low, the firm will invest to build a more accurate recommender system. This allows the firm to sell the product that is acceptable to consumers but he chooses to recommend products that are outside the consumers’ buckets. Such a recommendation strategy yields higher profit margin to the retailer. Proposition 5 explores the firm’s product recommendation strategy when the cost of improving the accuracy of the recommender system is relatively high. PROPOSITION 5: When the range of buckets is relatively large, r* c / t , (1) the firm’s optimal product recommendation is inside the bucket xr* [a j , a j r ] ; and (2) a consumer is more likely to accept the recommendation if is smaller When dLaccept ( xr ) d 0. c / t r r1 , the firm’s optimal product recommendation is xr x1 for buckets start from a j [0, a1 ] , or xr a j c / t for buckets start from a j (a1 ,1 r ] ; or when r1 r 1 , the firm’s optimal product recommendation is xr x1 for all buckets (Lemma 1). In either case, the product recommendation is inside the range of buckets xr* [a j , a j r ] , which implies that a consumer may get a perfect match with the product recommendation. However, in this case a consumer may reject the recommendation if her location is xi [a j , xr c / t ) , since the optimal range of bucket is relatively large, r* c / t . Proposition 5 states that when the cost of improving accuracy of the recommender system is relatively high, the firm will invest less and thus the recommender system is less accurate, r* c / t . Therefore, a consumer may reject the recommendation, but she may get a perfect match with the product recommendation. 17 Moreover, it is more likely that she accepts the recommendation when it is less likely for consumers to continue to search if they reject the product recommendation (smaller ), or the cost of is lower, or the acceptance range is greater. The intuition is that when there is a probability that a consumer will give up product search and do not buy any product if she rejects recommendation ( 0 1 ), the firm needs to sacrifice the expected profit margin from recommending product (by recommending a lower xr ) as he may lose a consumer if she rejects the recommendation. Therefore, the chance that a consumer will accept the product recommendation is higher, if there is a higher chance if the firm loses a consumer. \ 3.5 Realized consumer surplus The expected misfit cost for a consumer at xi who search products till she finds an acceptable product is the expected loss from products that are within the range [ xi c / t , xi c / t ] : 2 xi c / t xi ( x xi )tdx ct / 2 The expected number of attempts for a consumer who searches products till she finds an acceptable product is t c / 2 , and the expected search cost is ct / 2 from §3.2. Therefore, consumer surplus of a consumer who rejects project recommendation, and continues to search for alternative product till an acceptable product is the baseline utility minus the expected search cost and misfit costs: CSis V cs / 2 cs / 2 V cs The consumer surplus of a consumer at xi who accepts the product recommendation xr is: CSir V | xr xi | t In order to examine the impact of the firm’s recommender system strategy on consumer surplus, we need to assume a mapping from consumer location to consumer 18 bucket, a j xi / (1 r) . In this subsection, we assume consumer locations xi are distributed uniformly on the interval [0,1] , and the left borders of buckets a j are distributed uniformly on the interval [0,1 r] . Lemma 1 states that when the range of buckets is relatively small, r c / t , the firm’s product recommendation is xr* a j c / t for all buckets. Since all consumers accept the recommendation and none searches for products, consumers only incur misfit cost (Proposition 4). For consumers whose locations are close to 1, xi (1 c / t ) / (1 r) , the product recommendation is xr* 1 as it is the most profitable product; while for xi (1 c / t ) / (1 r) , the product recommendation is consumers whose location are xr* a j c / t . The aggregate consumer surplus is: CS (1 c / t )/(1 r ) 0 CSir ( xr a j c / t )dxi 1 (1 c / t )/(1 r ) CSir ( xr 1)dxi (7) When the range of buckets is medium, c / t r r1 , the firm’s optimal product recommendation is xr* x1 c / t 1 a j r for buckets a j [0, a1 ] , or xr* a j c / t for 2 2 buckets a j (a1 ,1 r ] . Some consumers ( xi x1 c / t ) reject the product recommendation because it is outside their acceptance ranges, and they continue to search and buy products that are acceptable. Their expected misfit cost is ct / 2 and search cost is ct / 2 . Other consumers ( xi x1 c / t ) accept the product recommendation, and they do not search for other products. Their search cost is zero. Moreover, for consumers xi [ x1 c / t , a1 / (1 r)] , whose corresponding buckets are a j [0, a1 ] , the product recommendation is xr* x1 ; while for consumers xi [a1 / (1 r),1] , the product recommendation is xr* a j c / t . Thus, the aggregate consumer surplus is: CS x1 c / t 0 CSis dxi a1 /(1 r ) x1 c / t CSir ( xr x1 )dxi 1 a1 /(1 r ) CSir ( xr a j c / t )dxi (8) Furthermore, when the range of buckets is relatively large, r1 r 1 , the optimal product recommendation is xr x1 for all consumers. Similar to the case where 19 c / t r r1 , some consumers ( xi x1 c / t ) reject the product recommendation, while other consumers ( xi x1 c / t ) accept the product recommendation. For consumers who reject the product recommendation, they may continue to search for products, and they incur search and misfit costs; while for consumers who accept the product recommendation, they only incur misfit cost. The aggregate consumer surplus is: CS x1 c / t 0 CSis dxi 1 x1 c / t CSir ( xr x1 )dxi We further assume that the unit misfit cost is t 1 , and thus, (9) 1 1 c , following 9 4 the assumption in §3.2. PROPOSITION 6: (1) The aggregate consumer surplus increases as the range of buckets increases, dCS 0 , when the range of bucket is relatively small r c / t ; (2) dr aggregate consumer surplus decreases as the range of buckets decreases, dCS 0 , when dr the range of bucket is relatively large r c / t . Proposition 6 states that when the accuracy of recommender system is not accurate ( r c / t ), consumer surplus increases if the range of buckets decreases. This implies that consumers benefit as the recommender system becomes more accurate in this range. On the other hand, when the recommender system is already accurate ( r c / t ), consumer surplus decreases if the range of buckets decreases. This means that consumers become worse off if recommender system becomes more accurate (Figure 7). 20 Figure 7: Impact of bucket range on the realized consumer surplus Proposition 6 argues that improvement of recommender system (shorter r ) by the firm has heterogeneous impacts consumer surplus. The range of consumer buckets also has heterogeneous impacts on social welfare. Figure 8 illustrates that social welfare increases as the accuracy of the recommender system improves (shorter r ) when the range of consumer buckets is relatively large r ( c / t ,1) , but is constant the range of consumer buckets is relatively small r (0, c / t ] . Combined with what we have shown about aggregate consumer surplus and the firm’s revenue, we show that when the recommender system is relatively not accurate, improving its accuracy is beneficial to both the firm and its customers. On the other hand, decrease in the range of consumer buckets does not entail any increase in social welfare when the range is already relatively small. The loss in the aggregate consumer surplus is the firm’s gain in revenue when the range of consumer buckets decreases if the range is already relatively low ( r c / t ). from accurately identifying individual consumer’s preference and recommending products with higher margin. 21 Figure 8: Impact of bucket range on the realized social welfare PROPOSITION 7: When the range of buckets decreases, (1) social surplus does not change as the range of buckets increases, if the range of bucket is relatively small r c / t ; (2) social surplus decreases if the range of bucket is relatively large r c / t . When the range of consumer buckets is relatively large ( r c / t ), since the aggregate consumer surplus (Proposition 6) and the firm’s revenue increases, social welfare increases. This implies that when the firm does not have adequate information about individual consumers’ preferences, improving accuracy of the recommender system creates externality to consumers. At the same time, the firm’s revenue increases, since more consumers accept the recommended products with higher profit margin when the recommender system becomes more accurate. On the other hand, when the recommender system is relatively accurate, the retailer deliberately misleads all consumers by not giving them their ideal products. When the firm improves the accuracy of the recommender system, all consumers are given the recommended products that are further away from their preferences, but are within their acceptance range. This leads to higher mismatch cost that is borne by the consumers. Interestingly, even when the recommender system is accurate and misleads all consumers, social welfare does not decrease. This happens because the firm’s revenue increases at the same rate when the recommender system becomes more accurate. 22 4. Conclusion In this paper, we develop a framework to examine the optimal recommender system strategy of an online monopolist multi-product retailer. In the context of online products, we consider consumers consider the personalized recommended products of the recommender system before they decide to search for alternative products. The use of recommender system reduces consumers’ search cost as they avoid product search by accepting the product recommendation. We show that the firm’s expected revenue increases when the accuracy of the recommender system improves. We also find that the firm’s optimal recommender system strategy is driven by consumers’ search cost and misfit cost, and the probability that a consumer does not buy any product if she rejects recommendation. We find that when the recommender system gets more accurate, the aggregate consumer surplus decreases if the recommender system is relatively accurate, and increases if the recommender system is relatively less accurate. This is surprising, as one may think the recommender system would have a monotonic impact on consumer surplus. More interestingly, we find that when the recommender systems gets more accurate, social welfare increases if the recommender system is relatively less accurate, but does not change if the recommender system is relatively accurate. This implies that both the firm and consumers benefit from improving accuracy of the recommender system when it is less accurate. On the other hand, the firm’s gain in revenue is exactly the loss in consumer surplus, and thus, the social welfare does not change, if the recommender system is relatively accurate. Our finding suggests that when the firm does not have enough information about consumers’ preferences, product recommendation are not accurate and only some customers accept the recommended products. We refer to this stage as the “explore” stage of the recommender system. When the firm gets more information, the recommendations become more accurate, and thus more customers accept the recommended products and skip the search for alternative products. In other words, the firm’s effort of improving accuracy of the recommender system creates an externality that benefits consumers. On the other hand, when the firm gets enough information about consumers, the 23 recommendation is accurate. This is the “exploit” stage of the recommender system. The firm can target consumers with more “accurate” product recommendations that have higher profit margin but are further away from consumers’ references. The firm’s revenue increases on the expense of consumers—their mismatch cost increases. However, society as a whole never gets worse off when the recommender system gets more accurate. Our results provide several insights about the use of recommender systems in the context of online retail business. First, an online retailer should adopt a recommender system and use it to maximize revenue by skewing towards products with higher profit margin. Second, the recommender system has non-monotonic effects on consumers, depending on the different stages of the recommender system — understanding of individual consumers’ preferences by the retailer. Third, it is never worse off from the societal point of view when the recommender system gets more accurate. This provides counter-argument to voices of draw-backs and perils of personalized or target marketing in the electronic marketplace (Zhang 2011). In this paper, we assume a uniform price across all products, and homogenous mismatch cost per distance between individual preference and a product among consumers. In future study, we want to generalize these assumptions to accommodate the firm’s product pricing decision and consumer heterogeneity in mismatch cost and search cost. References Adomavicius, G., & Tuzhilin, A. (2005). Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions. Knowledge and Data Engineering, IEEE Transactions on, 17(6), 734-749. Ansari, A., Essegaier, S., & Kohli, R. (2000). Internet recommendation systems. Journal of Marketing research, 37(3), 363-375. Bakos, J. Y. (1997). Reducing buyer search costs: Implications for electronic marketplaces. Management science, 43(12), 1676-1692. Bodapati, A. V. (2008). Recommendation systems with purchase data. Journal of Marketing Research, 45(1), 77-93. 24 Branco, F., Sun, M., & Villas-Boas, J. M. (2012). Optimal search for product information. Management Science, 58(11), 2037-2056. Das, A. S., Datar, M., Garg, A., & Rajaram, S. (2007). Google news personalization: scalable online collaborative filtering. In Proceedings of the 16th international conference on World Wide Web (pp. 271-280). ACM. Fleder, D., & Hosanagar, K. (2009). Blockbuster culture's next rise or fall: The impact of recommender systems on sales diversity. Management science, 55(5), 697-712. Fong, N. M. (2012). Targeted Marketing and Customer Search. working paper. Available at SSRN 2097495. Hosanagar, K., Fleder, D., Lee, D., & Buja, A. (2013). Will the Global Village Fracture Into Tribes? Recommender Systems and Their Effects on Consumer Fragmentation. Management Science, 60(4), 805-823. Häubl, G., & Trifts, V. (2000). Consumer decision making in online shopping environments: The effects of interactive decision aids. Marketing science, 19(1), 4-21. Johar, M., Mookerjee, V., & Sarkar, S. (2014). Selling vs. Profiling: Optimizing the Offer Set in Web-Based Personalization. Information Systems Research, 25(2), 285-306. Kim, J. B., Albuquerque, P., & Bronnenberg, B. J. (2010). Online demand under limited consumer search. Marketing Science, 29(6), 1001-1023. Lamere, P., & Green, S. (2008). Project aura-recommendation for the rest of us. In Presentation at Sun JavaOne Conference. Slides last accessed (Vol. 25). Murthi, B. P. S., & Sarkar, S. (2003). The role of the management sciences in research on personalization. Management Science, 49(10), 1344-1362. Oestreicher-Singer, G., & Sundararajan, A. (2012). Recommendation networks and the long tail of electronic commerce. MIS Quarterly, 36(1), 65-83. Tam, K. Y., & Ho, S. Y. (2006). Understanding the impact of web personalization on user information processing and decision outcomes. Mis Quarterly, 865-890. Thompson, C. (2008). If you liked this, you’re sure to love that. The New York Times, 21. http://www.nytimes.com/2008/11/23/magazine/23Netflix-t.html?pagewantedDall& _rD0. Zhang, J. (2011). The perils of behavior-based personalization. Marketing Science, 30(1), 170-186. 25 Appendix A: Proof PROPOSITION 1: When search costs increase dx dx 0 or misfit costs decrease 0 dc dt then the expected distance between the consumer’s ideal location and the location of the purchased product will increase. Proof of Proposition 1: Since x c / t , dx dx dx 1 dx 1 0 and 0. , and . Since c 0 and t 0 , dc 2 ct dt dc dt 2 ct Thus, we prove Proposition 1. ▪ LEMMA 1: 1) When 0 r c / t , the optimal product recommendation is xr a j c / t for buckets start from a j [0,1 c / t ] , or xr 1 for buckets start from a j (1 c / t ,1 r] ; 2) When c / t r r1 , the optimal product recommendation is xr x1 for buckets start from a j [0, a1 ] , or xr a j c / t for buckets start from a j (a1 ,1 r ] ; optimal product recommendation is xr x1 for all buckets x1 c / t 3) When r1 r 1 , the a j [0,1 r ] ; where 1 c / t r c/t 1 a j r , r1 and a1 . 2 1 2 2 Proof of Lemma 1: When 0 r c / t , then x1 c / t aj c / t 1 a j r a j 2 c / t c / t (r c / t ) is lower than 2 2 2 a j (2 ) 2 c / t c / t 2 , thus, the optimal product recommendation is the boundary solution xr a j c / t . 26 When a1 c / t r , then solve the value of a1 where a1 c / t x1 (a a1 ) , and we obtain r c/t r c/t 1 r1 , we . Since the maximum value of a j is 1 r , then solving a1 1 1 1 get r1 1 c / t . 2 This implies that when c / t r r1 , if a j a1 , then x1 a1 c / t , thus the optimal product recommendation is xr x1 ; and if a j a1 , then x1 a1 c / t , thus the optimal product recommendation is xr a j c / t . When r1 r 1 , x1 a1 c / t , thus, the optimal product recommendation is xr x1 . Thus, we prove Lemma 1. ▪ PROPOSITION 2: The firm’s product recommendation xr is independent of profit margin m . Proof of Proposition 2: Given the accuracy of the recommender system, it is obvious to see that the product recommendation, xr a j c / t or x1 c / t 1 a j r , is independent of m . 2 2 Thus, we prove Proposition 2. ▪ PROPOSITION 3: The firm’s expected total revenue increases as the range of consumer buckets decreases (lower r ), dE[ R( r )] 0. dr Proof of Proposition 3: 27 From Lemma 1, we show the firm’s product recommendation strategy depends on the range of buckets. And there are three regions for the range of buckets: 1) 0 r c / t , 2) c / t r r1 , and 3) r1 r 1 . 1) When 0 r c / t , the product recommendation is xr a j c / t for buckets a j [0,1 c / t ] , or xr 1 for buckets a j (1 c / t ,1 r ] . Thus, the optimal expected total revenue is E[ R( r )] 1 c / t 0 E[ Ra j ( xr a j c / t )]da j 1 r 1 c / t m( c / t r ) E[ Ra j ( xr 1)]da j m( t c ) 2t . We take the first order derivative of the optimal expected total revenue with respect to the range of buckets, and we obtain: 2) When dE[ R( r )] m 0 . dr c / t r r1 , the optimal product recommendation is xr x1 for buckets start from a j [0, a1 ] , or xr a j c / t for buckets start from a j (a1 ,1 r ] . Thus, the optimal 1 r a1 expected total revenue is E[ R(r)] 0 E[ Ra ( xr x1 )]da j a E[ Ra ( xr a j c / t )]da j . j 1 j We take the first order derivative of the optimal expected total revenue with respect to the range of buckets, and we obtain: dE[ R( r )] m(c3/2 3 c (5 6 2 2 )r 2t 2r 2 (9 (1 r ) 3 2 (1 r ) 7r 6)t 3/2 ) . dr 6(2 )(1 )r 2t 3/2 We let A c3/2 3 c (5 6 2 2 )r 2t 2r 2 (9(1 r) 3 2 (1 r) 7r 6)t 3/2 , so the numerator is mA . We then take the derivative of A with respect to , and we find that dA 6(3 2 )r 2 ( c (1 r ) t )t , d and d2A 12r 2 ( c (1 r ) t )t 0 . This implies that A is at d 2 the maximum when 2 / 3 . We substitute 2 / 3 into A , and we have 1 A( 2 / 3) (3c3/2 17 cr 2t 2r 2 (4 7r )t 3/2 ) 3 . Since 28 3 1 c / t r r1 ( 2 / 3) c / t r ( c / t ) 4 3 A( 2 / 3) 0 . Since , and 2 c / t 1 3 c / t , we show that This implies that A 0 for all (0,1) . dE[ R( r )] mA dE[ R( r )] 0 for , and 6(2 )(1 )r 2t 3/2 0 , we show that 2 3/2 dr 6(2 )(1 )r t dr all (0,1) . 3) When r1 r 1 , the optimal product recommendation is xr x1 for all buckets 1 r Thus, the optimal expected total revenue is E[ R(r )] 0 E[ Ra ( xr x1 )]da j . a j [0,1 r ] . j We take the first order derivative of the optimal expected total revenue with respect to the range of buckets, and we obtain: dE[ R( r )] m(3c 3 c (1 )(1 r 2 ) t ((1 )2 3(2 ) r 2 2(1 ) 2 r 3 )t ) . dr 6(2 )r 2t Since 0 1 , the numerator is positive, but the denominator is negative, and we have dE[ R( r )] 0. dr Thus, we show that dE[ R( r )] 0. ▪ dr PROPOSITION 4: When the range of buckets is relatively small, r c / t , the firm’s optimal product recommendation is outside the bucket xr* [a j , a j r ] . PROPOSITION 5: When the range of buckets is relatively large, r* c / t , (1) the firm’s optimal product recommendation is inside the bucket xr* [a j , a j r ] ; and (2) a consumer is more likely to accept the recommendation if is smaller dLaccept ( xr ) d 0. Proof of Propositions 4 and 5: When the range of buckets is relatively small, r c / t , xr a j c / t a j r . When the range of buckets is relatively large, r c / t , xr x1 a j r , and the likelihood of a consume may accept the recommendation is: 29 Laccept ( xr ) dLaccept ( xr ) d a j r ( x1 c / t ) r aj r c / t r (1 ) c / t (a j r ) (2 )r , which decreases in , so 0. Thus, we prove Proposition 4 and 5. ▪ PROPOSITION 6: (1) The aggregate consumer surplus increases as the range of buckets increases, dCS 0 , when the range of bucket is relatively small r c ; (2) dr aggregate consumer surplus decreases as the range of buckets decreases, the range of bucket is relatively large r dCS 0 , when dr 1 c . 2 Proof of Proposition 6: When r c / t , the product recommendation is xr a j c / t for buckets a j [0,1 c / t ] , or xr 1 for buckets a j (1 c / t ,1 r] . The aggregate consumer surplus from equation (7) is CS CS (1 c / t )/(1 r ) (V (a j c / t xi )t )dxi 1 (1 c / t )/(1 r ) 0 c 2 ct tr V 2(1 r ) (V (1 xi )t )dxi . We get . Take the derivative with respect to r , and we obtain dCS ( c t )2 0. dr 2(1 r )2 When c / t r r1 , the optimal product recommendation is xr x1 for buckets start from a j [0, a1 ] , or xr a j c / t for buckets start from a j (a1 ,1 r ] . We first solve for as1 that satisfies as1 / (1 r) as1 c / t . If r implies when c/t r c1/4 (c1/4 c1/4 (c1/4 c 2 4(1 ) t ) 2 t c 2 4(1 ) t ) 2 t , then a1 as1 . This , consumers a1 / (1 r) xi as1 / (1 r) , will be given the product recommendation that is smaller than the true location xr a j c / t xi ; and consumers xi as1 / (1 r ) , will be given the product 30 recommendation larger than the true location xr a j c / t xi . Thus, the aggregate consumer surplus from equation (8) is: CS x1 c / t 0 If (V ct )dxi a1 /(1 r ) x1 c / t c1/4 (c1/4 (V ( x1 xi )t )dxi c 2 4(1 ) t ) 2 t as 2 / (1 r) x1 (a j as 2 ) . as1 /(1 r ) a1 /(1 r ) r (V (a j c / t xi )t )dxi 1 as1 /(1 r ) (V ( xi a j c / t )t )dxi , then as1 a1 . We first solve for as 2 that satisfies When c1/4 (c1/ 4 c 2 4(1 ) t ) 2 t r r1 , consumers as 2 / (1 r) xi a1 / (1 r) , will be given the product recommendation that is greater than the true location xr x1 xi ; and consumers a1 / (1 r) xi , will be given the product recommendation that is greater than the true location xr a j c / t xi . Thus, the aggregate consumer surplus from equation (8) is: CS x1 c / t 0 as 2 /(1 r ) (V ct )dxi x1 c / t (V ( x1 xi )t )dxi a1 /(1 r ) as 2 /(1 r ) (V ( xi x1 )t )dxi 1 a1 /(1 r ) (V ( xi a j c / t )t )dxi When r1 r 1 , the optimal product recommendation is xr x1 for all buckets a j [0,1 r ] . Thus, the aggregate consumer surplus from equation (9) is CS x1 c / t 0 (V ct )dxi as 2 /(1 r ) x1 c / t (V ( x1 xi )t )dxi 1 as 2 /(1 r ) (V ( xi x1 )t )dxi . Since we assume t 1 , we take the derivative with respect to r , and we obtain dCS c(5 10 4 2 ) 2 c (1 )(1 2 (1 V ) 2V ) (1 ) 2 (1 2(2 )V ) dr 2(2 )(1 r )2 1/ 9 c 1/ 4 , 0 1 . Also, since and V c , the numerator of the derivative is negative and the denominator is positive, thus, dCS 0. dr Thus, we prove Proposition 6. ▪ 31