Customer-obsessed science
Research areas
-
June 8, 20267 min readFour approaches can dramatically improve the performance and trustworthiness of AI agents in operational environments.
-
-
-
May 26, 20265 min read
-
Featured news
-
ACL 2026 Workshop on Advances in Language and Vision Research2026Visual grounding in graphical user interface (GUI) requires accurate localization of UI elements from natural language instructions. Conventional coordinate generation approaches face inherent limitations, including sensitivity to resolution variations and lack of interpretability. Recently, coordinate-free attention-based methods have emerged as a promising alternative, but these methods primarily rely
-
VLDB 20262026Compilation-based query execution produces optimized machine code per query but introduces a cold-start problem: when the compiled code is not cached, the query stalls during compilation, delaying data processing by up to orders of magnitude relative to the query's execution time. This overhead dominates short-running queries and creates latency variability for both interactive analytics and ETL pipelines
-
EACL 2026 Industry Track2026Personalized shopping agents must adapt their decisions to different user personas, balancing efficiency, preference alignment, and goal success. Building upon the WebShop dataset and τ2-Bench environment, ShopperBench introduces a persona-guided benchmark for evaluating such adaptive behaviors. ShopperBench augments shopping trajectories with persona-conditioned goals, reasoning rationales, and preference
-
2026We introduce SWAN (Semantic Watermarking with Abstract Meaning Representation)1 , a novel framework that embeds watermark signatures into the semantic structure of a sentence using Abstract Meaning Representation (AMR). In contrast to existing watermarking methods, which typically encode signatures by adjusting token selection preferences during text generation, SWAN embeds the signature directly in the
-
KDD 20262026Individual treatment effect (ITE) estimation from observational data becomes unreliable when three challenges co-occur: extreme class imbalance (0.4% treatment rate), outcome sparsity (97.6% zeros), and pervasive cold-start (99.2% incomplete profiles). These conditions violate identifying assumptions—propensity scores collapse toward boundary values, and outcome predictions degrade for subjects with sparse
Collaborations
View allWhether you're a faculty member or student, there are number of ways you can engage with Amazon.
View all