-
In LLM alignment and many other ML applications, one often faces the MultiObjective Fine-Tuning (MOFT) problem, i.e. fine-tuning an existing model with datasets labeled w.r.t. different objectives simultaneously. To address the challenge, we propose the HyperDPO framework, a hypernetwork-based approach that extends the Direct Preference Optimization (DPO) technique, originally developed for efficient LLM
-
2024The fashion industry is one of the leading domains in the global e-commerce sector, prompting major online retailers to employ recommendation systems for product suggestions and customer convenience. While recommendation systems have been widely studied, most are designed for general e-commerce problems and struggle with the unique challenges of the fashion domain. To address these issues, we propose a
-
2024The rapid introduction of new brand names into everyday language poses a unique challenge for e-commerce spelling correction services, which must distinguish genuine misspellings from novel brand names that use unconventional spelling. We seek to address this challenge via Retrieval Augmented Generation (RAG). On this approach, product names are retrieved from a catalog and incorporated into the context
-
2024Long-form question answering (LFQA) aims at generating in-depth answers to end-user questions, providing relevant information beyond the direct answer. However, existing retrievers are typically optimized towards information that directly targets the question, missing out on such contextual information. Furthermore, there is a lack of training data for relevant context. To this end, we propose and compare
-
2024Query Auto-Complete (QAC) is an essential search feature that suggests users with a list of potential search keyword completions as they type, enabling them to complete their queries faster. While the QAC systems in eCommerce stores generally use the Learning to Rank (LTR) approach optimized based on customer feedback, it struggles to provide diverse suggestions, leading to repetitive queries and limited
Related content
-
October 04, 2024Rufus leverages AWS chips Trainium and Inferentia, AWS’s elasticity and scalability, and a custom-built large language model to quickly answer shoppers’ questions.
-
July 03, 2024Gradient-boosted decision trees aggregate model outputs, and Shapley values help identify the most useful models for the ensemble.
-
May 10, 2024Using large language models to discern commonsense relationships can improve performance on downstream tasks by as much as 60%.
-
October 06, 2023Leveraging a large vision-language foundation model enables state-of-the-art performance in remote-object grounding.
-
September 26, 2023Time series forecasting enables up-to-the-minute trend recognition, while novel two-step training process improves forecast accuracy.
-
September 14, 2023In a keynote address, the Amazon International vice president will discuss recommendations in directed graphs, training models whose target labels change, and using prediction uncertainty to improve model performance.