Summary: Feed scoring system for comparison-shopping engine structured data feeds.
The Feed Scoring System gave merchants and the teams that supported them, more insight into the quality of a Yahoo! Shopping data feed. This data feed quality information could be used for a variety of activities including: feed reviews, optimization suggestions, and competitive benchmarking.
I initially created this scoring system with another developer as a hack day project, then as a minimum viable product, and finally as a fully functioning system with incremental updates on a standard two-week release cycle. Like many innovations, the feed scoring system was created on top of existing technology and within set parameters. The goals were manifold: to answer common questions, provide insights, and list prioritized optimization suggestions for merchants. The resulting technology was not available anywhere else within the company or by competing comparison-shopping engines and was highlighted by the sales and account management teams as a clear product differentiator.
Situation:
The feed scoring system improved data feeds by adjusting two levers: merchant insights and improved data quality.
Merchant Insights: Merchants participating in Yahoo! Shopping would submit data feeds filled with meta-data describing the products they sold. Using the Yahoo! Shopping front-end, consumers would find a merchant’s product, click on the product, and would be sent to a product page on a merchant’s site to complete the purchase. This transaction would be considered a lead. Merchants would be charged per lead on a cost per click (CPC) basis, according to a rate card with CPCs listed by product category.
The Yahoo! Shopping sales and account management teams would manually review data feeds, compare the feeds against a feed specification document and deliver feed optimization suggestions to merchants during in-person meeting. As part of supporting top-tier merchants (top 15% of revenue), account managers would select merchants as review candidates. Due to the lengthy review process, a limited number of merchants could receive feed reviews each quarter. In the past, these feed reviews would take up to four hours of work to complete.
Improved data quality: While the sales and account management teams were focused on answering merchants’ questions, the Yahoo! Shopping product management team was focused on improving the quality of the product catalog. This product catalog powers the front-end display and allows consumers who use Yahoo! Shopping to more easily find products and make more informed purchase decisions. Consumers find products on Yahoo! Shopping using three main functionalities: search, attribute narrowing, and comparison grids. The improved data quality in-turn improved the performance of each of these front-end functionalities.
Additionally, the product management team also kept track of the overall product catalog breadth and depth, and compared results against other competing shopping-comparison engines. If consumers couldn’t find a product on Yahoo! Shopping, they may try finding the product on a competing site. It’s possible that lack of product catalog breath and depth could result in not just losing one sale, but losing a customer for life.
Goals / Tasks:
To build the feed scoring system, I grouped the goals into two types: merchant insights and improved product catalog data quality.
Merchant insight goals - The merchant insight goals focused on answering the following merchant questions – Am I submitting enough data? What can I do to improve my feed? How does my feed compare against other merchants? Have changes to my feed improved my performance on Yahoo! Shopping? How should I prioritize feed improvement work with my development team?
Product catalog goals - The improved product catalog goals included: increased number of product attributes extracted from the data feed, increased number of matches to comparison grids, increased search terms extracted, increased merchant retention, and increased count of total product catalog.
Actions:
The feed scoring system was built in three phases: initial prototype, minimum viable product, and full release with incremental updates. In between each phase or release, I would gather feedback from external customers (Yahoo! Shopping merchants) and internal stakeholders (sales teams, account management teams, developers, business development teams and product management teams). Feedback from each group was incorporated to develop the prioritized feature roadmap. I then assembled a cross-functional team of developers, designers and product managers to build the required features. A selection of key features from each phase of development is listed below:
Prototype:
Initial field coverage data import
Prototype UI, UED plan and user stories
Basic System Architecture
Meta-data level weighting algorithm
Minimum Viable Product:
Integration of attribute extraction data
Scoring tool dashboard
Prioritized optimization suggestions
TSV report download functionality
Incremental improvements:
Algorithm weighting updates
Score comparisons
Score history snapshots
Score history graphs
Advanced UI improvements
Scores incorporated into other tools
Program-wide competitive benchmarking
Results:
The results of implementing the feed scoring system ranged from objective, quantifiable results to systematic and operational improvements. A selection of findings are listed below:
Merchant insights
Increased count of reviews available per quarter from XX to X,XXX
Reduced time for creating feed reviews from 4+ hours to seconds.
Increased average data quality scores by XXX%
Increased number of CPC leads by XX%
Provided data and insights for 5 key questions from merchants, that were previously unanswerable with existing toolset.
Product catalog
Increased depth of product catalog in vertical search by XX%
Increased number of extracted product attribute meta-data by XX%
Increased number of matches to comparison grids by XX%
Increased insight into program-wide data quality for current snapshots, historical trends, and seasonality.
Provided program-wide data and insights that were previously unavailable with existing toolset.