Customer-obsessed science
Research areas
-
December 5, 20256 min readA multiagent architecture separates data perception, tool knowledge, execution history, and code generation, enabling ML automation that works with messy, real-world inputs.
-
-
-
November 20, 20254 min read
-
Featured news
-
KDD 2025 Workshop on Structured Knowledge for Large Language Models2025Despite advances in large language model (LLM)-based natural language interfaces for databases, scaling to enterprise-level data catalogs remains an under-explored challenge. Prior works addressing this challenge rely on domain-specific fine-tuning—complicating deployment—and fail to leverage important semantic context contained within database metadata. To address these limitations, we introduce a component-based
-
KDD 2025 Workshop on Machine Learning in Finance (MLF)2025Financial accounting systems rely heavily on subledgers to track detailed transaction records. However, modern systems often evolve into complex architectures where different components use inconsistent labeling conventions, making it difficult to understand and utilize important relationships within subledger data. This paper presents a novel framework LLM-STARS (LLM-Enhanced Standardization of Time-series
-
KDD 2025 Workshop on Prompt Optimization2025Despite advances in the multilingual capabilities of Large Language Models (LLMs), their performance varies substantially across different languages and tasks. In multilingual retrieval-augmented generation (RAG)-based systems, knowledge bases (KB) are often shared from high-resource languages (such as English) to lowresource ones, resulting in retrieved information from the KB being in a different language
-
2025We introduce Griffin, the first foundation model attemptation designed specifically for Relational Databases (RDBs). Unlike previous smaller models focused on single RDB tasks, Griffin unifies the data encoder and task decoder to handle diverse tasks. Additionally, we enhance the architecture by incorporating a cross-attention module and a novel aggregator. Griffin utilizes pretraining on both single-table
-
KDD 2025 Workshop on Talent and Management Computing2025Having a unified, coherent taxonomy is essential for effective knowledge representation in domain-specific applications as diverse terminologies need to be mapped to underlying concepts. Traditional manual approaches to taxonomy alignment rely on expert review of concept pairs, but this becomes prohibitively expensive and time-consuming at scale, while subjective interpretations often lead to expert disagreements
Collaborations
View allWhether you're a faculty member or student, there are number of ways you can engage with Amazon.
View all