Exploring the reasons behind customer purchase decisions

Crowdsourcing reveals sources of information that could improve product discovery algorithms.

By Jie Yang, Rongting Zhang, Vanessa Murdock

March 10, 2020

One way to make Amazon’s product discovery algorithms more satisfying is to ensure that we match high-quality products to customer queries. Recently, we conducted a study in which we used crowdsourcing to identify product data that could help predict human judgments of quality; we were scheduled to report the results of that study next week at the ACM SIGIR Conference on Human Information Interaction and Retrieval (CHIIR), but unfortunately, the conference has been canceled.

In the study, we presented crowd workers with images of pairs of related products — together with public product information supplied by both sellers and customers — and asked them which products were of higher quality. Then we asked them which terms extracted from the product information best explained their judgments.

Product comparison screen.png — Our custom interface for crowdsourcing product quality comparisons. Beneath the seller’s public description of each product *(C)* are expandable lists of public product-related information provided by customers *(D and E)*. To the right are phrases extracted from both seller and customer information *(I)*, which crowd workers used to characterize the criteria for their quality judgments.

Our first finding was that perceived quality is not a good predictor of customers’ purchase decisions. We suspect, however, that this is related to our second finding: perceived quality is highly correlated with price. Customers may choose lower-quality products if they’re correspondingly lower priced.

If this hypothesis is right, then a good product discovery algorithm should factor in indications of quality that are not correlated with price. Where should it look for them?

Customer vs. seller texts.png — Proportions of selected attributes from catalogue and customer-generated content

We found that most of the time, the terms that best described the crowd workers’ judgment criteria came from the public customer-supplied information, not the seller information. The customer-supplied information had two sources: customer reviews and question-and-answer sequences in which customers answered other customers’ product-related questions.

In our study, each product pair included one product that was actually purchased and one that was clicked but not purchased during the same query. Products also had to share the most fine-grained classification available in the amazon.com product classification hierarchy. The terms presented to the crowd workers were chosen based on how frequently they appeared in texts associated with those fine-grained categories.

Existing research on product recommendation has mainly focused on modeling purchases directly, without attempting to find the reasons behind customer decisions. We believe that understanding the processes that underlie customers' purchasing decisions will help us make better product recommendations. This work represents one of several steps we’re taking in that direction.

About the Author

Jie Yang

Jie Yang is an applied scientist in the Amazon Alexa Shopping organization.

Rongting Zhang

Rongting Zhang is an applied scientist in the Amazon Alexa Shopping organization.

Vanessa Murdock

Vanessa Murdock is an applied science manager in the Amazon Alexa Shopping organization.

Exploring the reasons behind customer purchase decisions

Crowdsourcing reveals sources of information that could improve product discovery algorithms.

Related content

Work with us