Michael Kearns and Aaron Roth seated at a table in front of a large chalk board.
Michael Kearns, left, and Aaron Roth, right, are the co-authors ofThe Ethical Algorithm: The Science of Socially Aware Algorithm Design. Kearns and Roth are leading researchers in machine learning, University of Pennsylvania computer science professors, and Amazon Scholars.
University of Pennsylvania

Amazon Scholars Michael Kearns and Aaron Roth discuss the ethics of machine learning

Two of the world’s leading experts on algorithmic bias look back at the events of the past year and reflect on what we’ve learned, what we’re still grappling with, and how far we have to go.

In November of 2019, University of Pennsylvania computer science professors Michael Kearns and Aaron Roth released The Ethical Algorithm: The Science of Socially Aware Algorithm Design. Kearns is the founding director of the Warren Center for Network and Data Sciences, and the faculty founder and former director of Penn Engineering’s Networked and Social Systems Engineering program. Roth is the co-director of Penn’s program in Networked and Social Systems Engineering and co-authored The Algorithmic Foundations of Differential Privacy with Cynthia Dwork. Kearns and Roth are leading researchers in machine learning, focusing on both the design and real-world application of algorithms.

Their book’s central thesis, which involves “the science of designing algorithms that embed social norms such as fairness and privacy into their code,” was already pertinent when the book was released. Fast forward one year, and the book’s themes have taken on even greater significance.

Amazon Science sat down with Kearns and Roth, both of whom recently became Amazon Scholars, to find out whether the events of the past year have influenced their outlook. We talked about what it means to define and pursue fairness, how differential privacy is being applied in the real world and what it can achieve, the challenges faced by regulators, what advice the two University of Pennsylvania professors would give to students studying artificial intelligence and machine learning, and much more.

Q. How has the narrative around designing socially aware algorithms evolved in the past year, and have the events of the past year altered your outlooks in any way?

Aaron Roth: The main thesis of our book, which is that in any particular problem you have to start by thinking carefully about what you want in terms of fairness or privacy or some other social desideratum, and then how you relatively value things like that compared to other things you might care about, such as accuracy—that fundamental thesis hasn't really changed.

Now with the coronavirus pandemic, what we have seen are application areas where how you want to manage the trade-off between accuracy and privacy, for example, is more extreme than we usually see. So, for example, in the midst of an outbreak, contact tracing might be really important, even though you can't really do contact tracing while protecting individual privacy. Because of the urgency of the situation, you might decide to trade off privacy for accuracy. But because the message of our book really is about thinking things through on a case-by-case basis, the thesis itself hasn't changed.

Michael Kearns: The events of the last year, in particular coronavirus, the resulting restrictions on society and the tensions around these restrictions, and all of the recent social upheaval in the United States, clearly has made the topics of our book much more relevant. The book has focused a lot of attention on the use of algorithms for both good and bad purposes, including things like contact tracing or releasing statistics about people's movements or health data, as well as the use of machine learning, AI, and algorithms more generally for applications like surveillance.

Since our book, at a high level, is about the tensions that arise when there's a battle between social norms like equality or privacy and the use of algorithms for optimizing things like performance or error, I don't think anything in the last year has changed our thinking about the technical aspects of these problems. It's clear that society has been forced to face these problems in a very direct way because of the events of the last year, in a way that we really haven't before. In that sense, our timing was very fortunate because the things we're talking about are more relevant now than ever.

Q. How does that affect your ability to define fairness? Is that something that can ever be a fixed definition, or does it need to be adjusted as events or specific use cases dictate?

Kearns: There's not one correct definition of fairness. In every application you have to think about who the parties are that you're trying to protect, and what the harms are that you're trying to protect them from. That changes both over time and in different scenarios.

Roth: Even before the events of the last year, fairness was always a very context- and beholder- dependent notion. One society might be primarily concerned about fairness by race, and another might be primarily concerned about fairness by gender, and a different community might have other norms. The events of the last year have highlighted cases in which not only will things vary over space or communities, but also over time.

People's attitudes about relatively invasive technologies like contact tracing might be quite different now than they were a year ago. If a year ago I told you, “Suppose there was some disease that some people were catching and the most effective way of tamping it down was to do contact tracing.” Many people might have said, “That sounds really invasive to me”, but now that we've all been through one of the alternatives—being on lock down for six months—people's minds might be changed. We’ve definitely seen norms around privacy for health-related data change.

Q. Standard setting bodies have a significant challenge when it comes to auditing algorithms. Given the scope of that challenge, what needs to happen to allow those groups to do that effectively?

Roth: Although it hasn't happened yet, regulatory agencies are thinking about this, and are reaching out to people like us to help them think about doing this in the right way. I don't know of any regulatory agency that is ready yet to audit algorithms at-scale in sensible ways of the technical sort we discuss in the book. But there are regulatory agencies that have gotten the idea that they should be gearing up to do this, and those agencies have started preliminary movements in that direction.

Kearns: Many of the conversations we've had with standard setting bodies make it clear they're realizing that, collectively, they've technologically fallen behind the industries that they regulate. They don't have the right resources or personnel to do some of the more technological types of auditing. But in these conversations, it's also become clear to us that, even if you could snap your fingers and get the right people and the right resources, it will only be part of a broader framework.

Other important pieces involve becoming more precise about best practices, and also thinking carefully about what those specifications should look like. Let me give a concrete example: One of the things that we argue in our book is that there are many laws and regulations in areas like consumer finance, for instance, that try to get at fairness by restricting what kinds of inputs an algorithm can use. These laws and regulations say, “In order to make sure that your model isn't racially discriminatory, you must not use race as a variable.” But, in fact, not using race as a variable is no guarantee that you won't build a model that's discriminatory by race. In fact, it can actually exacerbate that problem. What we advocate in the book is, rather than restricting the inputs, you should specify the behavior you want as outputs. So instead of saying, “Don't use race”, say instead, “The outputs of the models shouldn't be discriminatory by race.”

Q. Differential privacy has progressed from theoretical to applied science in significant ways in the past few years. How is differential privacy being utilized? How does that help balance the trade-off between privacy and accuracy?

Roth: In the last five years or so, differential privacy has gone from an academic topic to a real technology. For example, the 2020 US Decennial Census is going to release all of its statistical products for the first time, subject to the protections of differential privacy. This is because, by law, the Census is required to protect the privacy of the people it is surveying. The ad hoc techniques used in previous decades to protect the statistics have been shown not to work.

I think that what we will see is that the statistics that the Census releases this year will be more protective of the privacy of Americans. However, in the theme of trade off, using rigorous privacy protections is not without cost. Certain kinds of analyses, such as detailed demographic studies that rely on having highly granular Census data, might now be unavailable under differential privacy. We've seen this play out in the public sphere between downstream users of the data and folks at Census who actually have to hammer out the details.

We've seen other interesting uses of differential privacy during the pandemic too. Some tech companies have utilized differential privacy when releasing statistics about personal mobility data gathered during the pandemic. What differential privacy is best at is releasing those kinds of population level statistics: It's exactly designed to prevent you from learning too much about any particular individual. If you want to know how much less people are moving around different cities because of coronavirus restrictions, these data sets let you answer that question without giving up too much privacy for individuals whose mobile devices were providing the data at the most granular level.

Q. So how does differential privacy help protect individual information?

Roth: Oftentimes the things that you will most naturally want to know about a data set are not facts about particular people, but are population level aggregates like, how many people are crowded into my supermarket at 6 a.m. when it opens. If you tell me sufficiently many aggregate statistics, I can do some math and back out particular people's data from that. The fact that aggregate statistics can be disclosive about individual people's data is an unfortunate accident that actually doesn't have too much to do with what you really wanted to learn.

At its most basic level, differential privacy does things like add little bits of noise to the statistics that you're releasing so that what you're telling me is not the exact number of people who were in my local supermarket at 6 a.m., but roughly the number of people who were in the supermarket plus or minus some small number of people. The fortunate mathematical fact is that you can add amounts of noise that are relatively small that still allow you to get good estimates, but are sufficient to wash out the contributions of particular people, making it impossible to learn too much about any particular individual. It lets you get access to these population level questions that you were curious about without incidentally or accidentally learning about particular people, which is the dangerous side.

"We are bullish about algorithms"
Michael Kearns and Aaron Roth talked to Oxford Academic about the future of AI.

Kearns: To make this slightly more concrete, say what I want to do is each day tell everybody how many people were in the supermarket a couple blocks from me at 1 p.m. If you happened to be at that supermarket at one o’clock, then your GPS data is one of the data points that goes into the count. You may consider your presence at supermarket at 1 p.m. to be the kind of private information that you don't want the whole world to know. So then let's say that, on a typical day, there might be a couple hundred people at the supermarket, but that I add a number which is an order of magnitude, plus or minus 25. The addition of that random number mathematically and provably obscures any individual’s contributions to that count. I won't be able to look at that count and try to figure out any particular person who was present. If I add a number that's between minus 25 and 25, I can't affect the overall count by 100. I'll still have an accurate count up to some resolution, but I will have provided privacy to everybody who was present at the supermarket and, actually, all the people who weren't present as well.

Q. How are topics like fairness, accountability, transparency, interpretability, and privacy showing up in computer science curriculum at Penn and elsewhere within higher education?

Kearns: When Aaron and I first started working on the technical aspects of fairness in machine learning and related topics, it was pretty sparsely populated. This was maybe six or seven years ago, and there weren't many papers on the topics. There were some older ones, more from the statistics literature, but there wasn't really a community of any size within machine learning that thought about these problems. On the research side, the opposite is now true. All of the major machine learning conferences have significant numbers of papers and workshops on these topics; they have workshops devoted to these topics. There are now standalone conferences about fairness, accountability, and explainability in machine learning that are growing every year. It's a very vibrant, active research community now. Additionally, even though it's still early, it's an important enough topic that there are now starting to be efforts to teach this even at the undergraduate level.

The last two years at Penn, for example, I have piloted a course called The Science of Data Ethics. It’s deliberately called that and not The Ethics of Data Science. What that represents is that it’s about the science of making algorithms that are more ethical by different norms, like fairness and privacy. It's not your typical engineering ethics course, which at some level is meant to teach you to be a good, responsible person in that you look at case studies where things went wrong and you talk about what you would do differently. This class is a science class. It says: Here are the standard principles of machine learning, here's how those standard principles can lead to discriminatory behavior in my predictive models, and here are alternate principles, or modifications of those principles and the algorithms that implement them, that avoid or mitigate that behavior.

Q. Is there a more multidisciplinary approach to this set of challenges?

Roth: It's definitely a multidisciplinary area. At Penn, we've been actively collaborating with interested folks in the law school and the criminology department. So far, we don't really have interdisciplinary undergraduate courses on these topics. Those courses would be good in the long run, but at the research and graduate level we've been having interdisciplinary conversations for a number of years.

In particular, one critique that we try to anticipate in the book, and that we’re very aware of, is that technical work on making algorithms more ethical is only one piece of a much larger sociological, or what some people would call socio-technical, pipeline.
Michael Kearns

Kearns: Not just at the teaching level, but even in the research community, there's a real melting pot of viewpoints on these topics. Even though our book is focused on the scientific aspects of these issues, we do spend some time mentioning the fact that the science will only take us so far. In particular, one critique that we try to anticipate in the book, and that we’re very aware of, is that technical work on making algorithms more ethical is only one piece of a much larger sociological, or what some people would call socio-technical, pipeline. Machine learning begins with data and ends with a model. But upstream from the data is the entire manner in which the data was collected and the conditions under which it was collected.

One of the things that's very interesting, exciting, and necessary about the dialogue around these kinds of issues is that, even when there's quite a bit to say on them scientifically, you don't want to just put your head down and look at the science. You want to talk to people who are upstream and downstream from the machine learning part of this pipeline because they bring very different perspectives, and can often point out perspectives which can help you change the way you look at things scientifically in a positive way.

Q. If I were a student exploring AI or ML and I wanted to influence this particular conversation, beyond technical skills, what kind of skills should I be developing?

Kearns: What I would very strongly advocate is: think widely, think broadly, think big. Yes, you're going to be doing technical work in particular models and frameworks, and you know you want to get results in those frameworks. But also read what people who are from very, very different fields think about these problems. Go to their conferences, don't just go to the machine learning conferences and to the sub-track on fairness and machine learning. Go to the interdisciplinary conferences and workshops that are deliberately meant to bring together scientists, legal scholars, philosophers, sociologists, and regulators. Hear their views on these problems, keep an ear out for whether they even think you're working on a problem that's relevant or even has a solution.

That's the way I have approached my career: focus on what I'm good at and what I think is interesting from a scientific standpoint, but not in a scientific vacuum. I deliberately expose myself whenever possible to what people from a completely different perspective are thinking about the same set of topics. The good news is that there's a lot of opportunity for that right now. If you work in some branch of material science, it may not be possible to wander out in the world and get diverse perspectives, but everybody has an opinion on AI and machine learning ethics these days, so there is no shortage of sources from which this hypothetical student could go out and find their own technical views challenged or broadened.

Roth: One trap that is very easy for a new PhD student, or even an established researcher, to fall into is to write the introductions to your papers motivated by some kind of fairness problem, but then find yourself solving some narrow technical problem that ultimately has little connection to the world. I am sometimes guilty of this myself, but this is an area where there really are lots of important problems to solve. It's an area where theoretical approaches, if wielded correctly, can be extremely valuable. The thing that’s valuable is to be, sort of, multilingual. It can be difficult to talk to people from other fields because those fields have different vocabularies and a different world view. However, it's important to understand the perspective of these different communities. There are interdisciplinary groups looking at fairness, accountability, and transparency, which bring people together from all sorts of backgrounds to actively work on developing, at the very least, a shared vocabulary—and hopefully a shared world view.

Q. You've become Amazon Scholars fairly recently. What inspired you to take on this role?

Roth: I've spent most of my career as a theorist, so the ways I've been primarily thinking about privacy and fairness are in the abstract. I've had fun thinking about questions like: What kinds of things are, and are not, possible in principle with differential privacy? Or what kinds of semantic fairness promises can you make to people in a way that is still consistent with trying to learn something from the data? The attraction of Amazon and AWS is that it's where the rubber meets the road. Here we are deploying real machine learning products, and the privacy and the fairness concerns are real and pressing.

My hope is that by having a foot in the practice of these problems, not just their theory, not only will I have some effect on how consequential products actually work, but I’ll learn things that will be helpful in developing new theory that is grounded in the real world.

Kearns: I've had a kind of second life in the quantitative finance industry up until I joined Amazon. While I spent time doing practical things in the world of finance, it was more just using my general knowledge in machine learning. The opportunity to come to Amazon and really think about the topics we've been discussing in a practical technological setting seemed like a great opportunity. I'm also a long-term fan and observer of the company. I’ve known people here for many years, and have had great conversations with them. So I’ve watched with great interest over the last decade plus as Amazon grew its machine learning effort from scratch and gradually grew it to have wider and wider applications. It’s now at a point where not only is machine learning used widely within the company to optimize all kinds of processes and recommendations and the like, but it’s also used by customers worldwide in the form of services like Amazon SageMaker.

I have watched this with great interest because when I was studying machine learning in graduate school back in the late 80s, trust me, it was an obscure corner of AI that people kind of raised their eyebrows at. I never would have thought we would reach the point where not only does The Wall Street Journal expect everyone to know what they mean when they write about machine learning, but that it would actually be a product sold at scale.

I've watched these developments from academia and from the world of finance.  It seemed like a great opportunity to combine my very specific current research and other interests with an inside look at one of the great technology companies. Like Aaron, my expectations, which were high, have only been exceeded in the time I've spent here.

Research areas

Related content

US, NY, New York
We are seeking an Applied Scientist to lead the development of evaluation frameworks and data collection protocols for robotic capabilities. In this role, you will focus on designing how we measure, stress-test, and improve robot behavior across a wide range of real-world tasks. Your work will play a critical role in shaping how policies are validated and how high-quality datasets are generated to accelerate system performance. You will operate at the intersection of robotics, machine learning, and human-in-the-loop systems, building the infrastructure and methodologies that connect teleoperation, evaluation, and learning. This includes developing evaluation policies, defining task structures, and contributing to operator-facing interfaces that enable scalable and reliable data collection. The ideal candidate is highly experimental, systems-oriented, and comfortable working across software, robotics, and data pipelines, with a strong focus on turning ambiguous capability goals into measurable and actionable evaluation systems. Key job responsibilities - Design and implement evaluation frameworks to measure robot capabilities across structured tasks, edge cases, and real-world scenarios - Develop task definitions, success criteria, and benchmarking methodologies that enable consistent and reproducible evaluation of policies - Create and refine data collection protocols that generate high-quality, task-relevant datasets aligned with model development needs - Build and iterate on teleoperation workflows and operator interfaces to support efficient, reliable, and scalable data collection - Analyze evaluation results and collected data to identify performance gaps, failure modes, and opportunities for targeted data collection - Collaborate with engineering teams to integrate evaluation tooling, logging systems, and data pipelines into the broader robotics stack - Stay current with advances in robotics, evaluation methodologies, and human-in-the-loop learning to continuously improve internal approaches - Lead technical projects from conception through production deployment - Mentor junior scientists and engineers
US, WA, Seattle
Prime Video is a first-stop entertainment destination offering customers a vast collection of premium programming in one app available across thousands of devices. Prime members can customize their viewing experience and find their favorite movies, series, documentaries, and live sports – including Amazon MGM Studios-produced series and movies; licensed fan favorites; and programming from Prime Video subscriptions such as Apple TV+, HBO Max, Peacock, Crunchyroll and MGM+. All customers, regardless of whether they have a Prime membership or not, can rent or buy titles via the Prime Video Store, and can enjoy even more content for free with ads. Are you interested in shaping the future of entertainment? Prime Video's technology teams are creating best-in-class digital video experience. As a Prime Video team member, you’ll have end-to-end ownership of the product, user experience, design, and technology required to deliver state-of-the-art experiences for our customers. You’ll get to work on projects that are fast-paced, challenging, and varied. You’ll also be able to experiment with new possibilities, take risks, and collaborate with remarkable people. We’ll look for you to bring your diverse perspectives, ideas, and skill-sets to make Prime Video even better for our customers. With global opportunities for talented technologists, you can decide where a career Prime Video Tech takes you! As an Applied Scientist, you will apply state of the art natural language processing and computer vision research to video centric digital media. We are looking for scientists with expertise in vision-language models/multimodal LLMs and long-form content understanding (full movies/episode vs. short clips). You will be dealing with architectures that handle long-context understanding and causal reasoning across extended temporal sequences. Key job responsibilities Our team builds multi-modal machine learning technologies to enrich and understand video content. We aim not only to understand individual components within the content itself, but also their relationships to each other to provide a holistic and broader contextual understanding. This powers the next generation of video understanding and search capabilities for Prime Video. About the team Prime Video's Content Localization, Understanding & Enrichment organization is responsible for 1) enabling Prime Video to "see" and "understand" video content including characters, scenes, dialogue, events & visual elements and 2) delivering localized, accessible content that meets a consistent cinematic quality standard at scale. This team's mission is to deeply understand all content and empower all customers with relevant language options, innovative accessibility assists, and rich title-information across all their content-experiences on Prime Video. We create and publish content on-time that's meaningful, accurate, and accessible to every customer globally. We delight our customers by pushing the boundaries of content understanding and enrichment. Through inclusion and innovation, we do the most fulfilling work of our career.
US, CA, San Francisco
The Amazon Center for Quantum Computing (CQC) is seeking to hire an Applied Science Manager to lead a team of scientists in the physical design and simulation of superconducting quantum processors. In this role, you will use advanced modeling, simulation, and experimental design to drive improvements in scaling and performance. You will partner with other physics and engineering teams to advance the development of fault-tolerant quantum computers. Key job responsibilities - Hire Applied Scientists from diverse technical backgrounds to design quantum processors and improve the design process - Develop scientific talent through goal setting, feedback, collaborative work, and coaching - Collaborate with other science teams in designing experiments to overcome scaling and performance limitations - Influence engineering team development priorities in enabling systematic processor design and simulation workflows - Manage tactical and strategic initiatives with scientific projects pursued within team - Enable creative and innovative experimentation while striving for operational excellence About the team The Amazon Center for Quantum Computing (CQC) is a multi-disciplinary team of scientists, engineers, and technicians, on a mission to develop a fault-tolerant quantum computer. Inclusive Team Culture Here at Amazon, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon conferences, inspire us to never stop embracing our uniqueness. Diverse Experiences Amazon values diverse experiences. Even if you do not meet all of the preferred qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying. Mentorship & Career Growth We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional. Work/Life Balance We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud. Export Control Requirement Due to applicable export control laws and regulations, candidates must be either a U.S. citizen or national, U.S. permanent resident (i.e., current Green Card holder), or lawfully admitted into the U.S. as a refugee or granted asylum, or be able to obtain a US export license. If you are unsure if you meet these requirements, please apply and Amazon will review your application for eligibility.
US, WA, Seattle
Amazon Seller Assistant is our flagship GenAI-first, multi-agent system that reimagines Seller experience. Our vision is to provide each seller with a proactive, autonomous, agentic assistant that understands their business and helps them navigate the complexities of selling by anticipating their needs, surfacing insights, resolving issues, taking actions on their behalf, and helping them grow. Amazon Seller Assistant helps millions of sellers on Amazon serve billions of customers worldwide. We are seeking a world-class Senior Data Scientist to help define and build the next generation of Amazon Seller Assistant. You will partner with top-tier scientist, engineers and product teams to launch production-grade agentic capabilities at Amazon's scale — owning your problem space end-to-end, from a crisp customer insight to a shipped product that millions of sellers rely on. Key job responsibilities • Own the science vision, strategy, and roadmap for a key Seller Assistant capability area. • Define and ship agentic experiences — sub-agent onboarding, tool onboarding, evaluations— that solve hard seller problems at scale. • Partner with scientists and engineers to translate frontier AI research into production-grade features sellers trust and depend on. • Design rigorous evaluation frameworks — automated and human-in-the-loop — to measure agent quality, accuracy, and business impact. • Deep-dive into seller data, identify unmet needs, and write compelling PRFAQs that set the direction for your team. • Drive cross-functional alignment across science, engineering, UX, and business teams to deliver with speed and quality. About the team Amazon Seller Assistant team operates at the very frontier of agentic AI and agentic commerce — not as a research group, but as a team shipping production-grade, multi-agent systems used by millions of sellers worldwide. We move with the urgency of a startup and the resources of the world's most customer-obsessed company, the latest breakthroughs in science and engineering into capabilities that sellers rely on every day.
US, NY, New York
MULTIPLE POSITIONS AVAILABLE Employer: Amazon Development Center U.S., Inc. Offered Position: Applied Scientist III - AMZ007408 Job Location: New York, NY Position Responsibilities: Participate in the design, development, evaluation, deployment, and updating of formal reasoning systems for security, privacy, and data protection applications. Drive technical and scientific innovation in security automation, data protection, and privacy-preserving technologies, with a focus on developing scalable solutions for cloud environments. Develop and/or apply formal verification techniques and automated theorem proving methods for different applications in cloud security and privacy. Collaborate with internal and external users to understand requirements and enhance formal verification and automated reasoning capabilities. Lead research and development efforts in AI security, specifically evaluate emerging threats and opportunities, including securing Generative AI systems and designing robust safeguards. Proactively identify and explore new opportunities for deploying and leveraging formal reasoning solutions across various domains.
GB, London
The Agentic Automated Reasoning Group is building the next generation of software verification tools combining advances in artificial intelligence, the computational capacity of the cloud, and our deep expertise in the domain. Join us if you want to be a part of this transformational endeavor. The Strata team (https://github.com/strata-org) is seeking an applied scientist with broad interest and expertise in model checking, interactive theorem proving, programming language semantics, and generative AI. You will combine your expertise with that of your coworkers to build new tools that solve code analysis problems previously considered beyond reach. Our application areas span all the way from Infrastructure as Code to high-performance cryptography written in assembly code, while our methods span from interactive theorem proving to automated test generation. Each day, hundreds of thousands of developers make billions of transactions worldwide on AWS. They harness the power of the cloud to enable innovative applications, websites, and businesses. Using automated reasoning technology and mathematical proofs, AWS allows customers to answer questions about security, availability, durability, and functional correctness. We call this provable security, absolute assurance in security of the cloud and in the cloud. https://aws.amazon.com/security/provable-security/ Key job responsibilities Work with customer teams to understand the nature of their software and the properties they need to establish of it. Identify tools and methods capable of addressing the verification needs of customers, including any novel analysis capabilities required. Use techniques spanning property-based testing to model checkers, and interactive theorem provers to establish program properties. Explore generative AI techniques to help customers formalize their requirements, find revealing tests, generate required boiler plate for testing and model checking, and find and repair program proofs. About the team The Agentic Automated Reasoning Group at AWS develops and applies state of the art formal methods and automated reasoning techniques to ensure the security, reliability, and correctness of AWS services and customer applications, with a strong focus on AI based agents. Our work innovates tools and services to perform verification at scale and apply them to build safe and secure systems at AWS. We are also pioneering the use of formal verification and automated reasoning to develop agentic systems, ensuring AI agents operate within defined safety boundaries.
US, CA, San Francisco
Join the next revolution in robotics at Amazon's Frontier AI & Robotics team, where you'll work alongside world-renowned AI pioneers to lead key initiatives in robotic intelligence. As a Member of Technical Staff, you'll spearhead the development of breakthrough foundation models that enable robots to perceive, understand, and interact with the world in unprecedented ways. You'll drive technical excellence in areas such as perception, manipulation, science understanding, sim2real transfer, multi-modal foundation models, and multi-task learning, designing novel algorithms that bridge the gap between state-of-the-art research and real-world deployment at Amazon scale. In this role, you'll combine hands-on technical work with scientific leadership, ensuring your team delivers robust solutions for dynamic real-world environments. You'll leverage Amazon's vast computational resources to tackle ambitious problems in areas like very large multi-modal robotic foundation models and efficient, promptable model architectures that can scale across diverse robotic applications. Key job responsibilities - Lead technical initiatives in robotics foundation models, driving breakthrough approaches through hands-on research and development in areas like open-vocabulary panoptic scene understanding, scaling up multi-modal LLMs, sim2real/real2sim techniques, end-to-end vision-language-action models, efficient model inference, video tokenization - Design and implement novel deep learning architectures that push the boundaries of what robots can understand and accomplish - Guide technical direction for specific research initiatives, ensuring robust performance in production environments - Mentor and support fellow scientists while maintaining strong individual technical contributions - Collaborate with engineering teams to optimize and scale models for real-world applications - Influence technical decisions and implementation strategies within your area of focus A day in the life - Develop and implement novel foundation model architectures, working hands-on with our extensive compute infrastructure - Guide and support fellow scientists in solving complex technical challenges, from sim2real transfer to efficient multi-task learning - Lead focused technical initiatives from conception through deployment, ensuring successful integration with production systems - Drive technical discussions within your team and with key stakeholders - Conduct experiments and prototype new ideas using our massive compute cluster - Mentor team members while maintaining significant hands-on contribution to technical solutions Amazon offers a full range of benefits that support you and eligible family members, including domestic partners and their children. Benefits can vary by location, the number of regularly scheduled hours you work, length of employment, and job status such as seasonal or temporary employment. The benefits that generally apply to regular, full-time employees include: 1. Medical, Dental, and Vision Coverage 2. Maternity and Parental Leave Options 3. Paid Time Off (PTO) 4. 401(k) Plan If you are not sure that every qualification on the list above describes you exactly, we'd still love to hear from you! At Amazon, we value people with unique backgrounds, experiences, and skillsets. If you’re passionate about this role and want to make an impact on a global scale, please apply! About the team At Frontier AI & Robotics, we're not just advancing robotics – we're reimagining it from the ground up. Our team is building the future of intelligent robotics through ground breaking foundation models and end-to-end learned systems. We tackle some of the most challenging problems in AI and robotics, from developing sophisticated perception systems to creating adaptive manipulation strategies that work in complex, real-world scenarios. What sets us apart is our unique combination of ambitious research vision and practical impact. We leverage Amazon's massive computational infrastructure and rich real-world datasets to train and deploy state-of-the-art foundation models. Our work spans the full spectrum of robotics intelligence – from multimodal perception using images, videos, and sensor data, to sophisticated manipulation strategies that can handle diverse real-world scenarios. We're building systems that don't just work in the lab, but scale to meet the demands of Amazon's global operations. Join us if you're excited about pushing the boundaries of what's possible in robotics, working with world-class researchers, and seeing your innovations deployed at unprecedented scale.
US, NY, New York
In this role, you will design and build intelligent multi-agent systems that automate root cause analysis for advertising campaign delivery at scale. You will architect agentic orchestration patterns where specialized sub-agents (campaign diagnostics, deal-level troubleshooting, pacing control) are invoked as composable tools by a reasoning layer that determines which subsystems to query based on the nature of the issue. You will develop hierarchical analysis frameworks that move from daily trend detection to intra-day anomaly isolation, enabling the system to pinpoint when and why delivery degraded rather than relying on static time windows. You will build self-learning feedback loops where the system identifies recurring failure signatures (auction dynamics, pacing anomalies, supply contention), updates its diagnostic knowledge as engineering teams deploy fixes, and retires stale patterns automatically. We are looking for a passionate Applied Scientist with technical expertise in LLM-based agent architectures, retrieval-augmented generation, time-series anomaly detection, and production ML systems. In addition to hands-on experience building agentic AI solutions, an ideal candidate should demonstrate the ability to translate complex distributed system behaviors into structured diagnostic reasoning, show a willingness to push the boundaries of how LLMs interact with real-time operational data, and thrive in an environment where you ship production systems that directly reduce advertiser escalation time from days to minutes. Key job responsibilities * Conduct deep data analysis to derive insights for the business, identify gaps, and uncover new opportunities. * Develop scalable and effective machine learning models and optimization strategies to solve business problems. * Run regular A/B experiments, gather data, and perform statistical analysis to optimize advertiser experiences. * Collaborate closely with software engineers to deliver end-to-end solutions into production. * Enhance the scalability, efficiency, and automation of large-scale data analytics, model training, deployment, and serving. * Research and implement new machine learning models and techniques to improve advertising performance. A day in the life Your primary focus is building a multi-agent diagnostic system that automates root cause analysis for advertising campaign delivery issues. On a typical day, you might review how the system handled recent escalations, identify where it reasoned incorrectly, adjust orchestration logic, and write new evaluation cases. You will design agent architectures that invoke specialized sub-agents as tools, build hierarchical analysis frameworks that move from trend detection to anomaly isolation, and develop self-learning loops that keep the system's diagnostic knowledge current as the underlying platform evolves. You will work closely with SDEs building the diagnostic platform, product managers defining the troubleshooting experience, and the support teams who rely on your system to resolve advertiser delivery issues in minutes instead of days. Beyond the core agent work, you may find yourself diving into causal inference to measure recommendation effectiveness, prototyping proactive anomaly detection, or contributing to evaluation science for systems that reason over complex operational data. About the team The Demand Enablement, Product Analytics and Operations team builds the diagnostic and intelligence layer for Amazon DSP, the demand-side platform powering Amazon's programmatic advertising business. We own the systems that detect, diagnose, and surface delivery issues across campaigns, giving internal teams and advertisers the visibility to act before problems impact spend. Our product portfolio spans automated troubleshooting platforms, advertiser-facing delivery insights, and AI-powered root cause analysis using multi-agent architectures on foundation models. We are a small, high-ownership team that ships production systems end-to-end, from data pipelines processing billions of bid events to LLM-based agents that reason over complex advertising systems. If you want to work at the intersection of applied science, distributed systems observability, and real business impact measured in advertiser dollars recovered, this is the team.
US, NY, New York
About the Team Our team builds and operates automated reasoning technology that powers security and privacy assurance across Amazon and AWS at scale. Our technology is deeply integrated into critical Amazon and AWS security workflows. We operate at the intersection of automated reasoning, program analysis, and applied security — and our work directly impacts the security posture of every AWS service. About the Role We are looking for an experienced Applied Science Manager to lead the team's static analysis platform science team. In this role, you will own the technical vision and roadmap for our automated reasoning engine's static analysis capabilities, drive innovation in scalable program analysis, and lead a team of applied scientists working at the frontier of automated reasoning for security while also contributing technically as a player/coach. You will partner closely with security, privacy, and compliance stakeholders across AWS to expand the reach and impact of provably correct code analysis. You will also partner closely with automated reasoning experts across the company and contribute to the science of security Key job responsibilities Technical Leadership: Own the science roadmap for our automated reasoning engine, including taint analysis, compositional heap analysis, modular method summarization, and dataflow graph generation Hands-on Contribution: Personally contribute to key research and design decisions, including prototyping novel analyses and reviewing technical artifacts Team Building & Management: Hire, develop, and retain a world-class team of applied scientists; foster a culture of scientific rigor, innovation, and operational excellence Product Integration: Partner with application security and service teams to expand our platform's integration footprint and deliver new security and privacy analysis capabilities Research & Innovation: Advance the state of the art in static program analysis, including exploring formal verification of analysis correctness (e.g., using Lean, Coq, or Dafny), expanding language support beyond Java, and developing novel analysis techniques for emerging security properties Stakeholder Engagement: Collaborate with AWS AppSec, Privacy Engineering, and service teams to understand their security assurance needs and translate them into analysis capabilities Strategic Influence: Represent our team in the broader Automated Reasoning community at Amazon; contribute to automated reasoning initiatives, and academic partnerships About the team Our team builds and operates automated reasoning technology that powers security and privacy assurance across Amazon and AWS at scale. Our automated reasoning engine is the core technology behind our managed dataflow mapping service, which automatically tracks how data flows through AWS service teams’ code and infrastructure. Our technology is deeply integrated into critical Amazon and AWS security workflows. We operate at the intersection of automated reasoning, program analysis, and applied security — and our work directly impacts the security posture of every AWS service. Diverse Experiences Amazon Security values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying. Why Amazon Security? At Amazon, security is central to maintaining customer trust and delivering delightful customer experiences. Our organization is responsible for creating and maintaining a high bar for security across all of Amazon’s products and services. We offer talented security professionals the chance to accelerate their careers with opportunities to build experience in a wide variety of areas including cloud, devices, retail, entertainment, healthcare, operations, and physical stores. Inclusive Team Culture In Amazon Security, it’s in our nature to learn and be curious. Ongoing DEI events and learning experiences inspire us to continue learning and to embrace our uniqueness. Addressing the toughest security challenges requires that we seek out and celebrate a diversity of ideas, perspectives, and voices. Training & Career Growth We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, training, and other career-advancing resources here to help you develop into a better-rounded professional. Work/Life Balance We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why flexible work hours and arrangements are part of our culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve.
US, WA, Seattle
The Sponsored Products and Brands (SPB) team at Amazon Ads is re-imagining the advertising landscape through generative AI technologies, revolutionizing how millions of customers discover products and engage with brands across Amazon.com and beyond. We are at the forefront of re-inventing advertising experiences, bridging human creativity with artificial intelligence to transform every aspect of the advertising lifecycle from ad creation and optimization to performance analysis and customer insights. We are a passionate group of innovators dedicated to developing responsible and intelligent AI technologies that balance the needs of advertisers, enhance the shopping experience, and strengthen the marketplace. If you're energized by solving complex challenges and pushing the boundaries of what's possible with AI, join us in shaping the future of advertising. This position will be part of the Conversational Ad Experiences team within the Amazon Advertising organization. Our cross-functional team focuses on designing, developing and launching innovative ad experiences delivered to shoppers in conversational contexts. We utilize leading-edge engineering and science technologies in generative AI to help shoppers discover new products and brands through intuitive, conversational, multi-turn interfaces. We also empower advertisers to reach shoppers, using their own voice to explain and demonstrate how their products meet shoppers' needs. We collaborate with various teams across multiple Amazon organizations to push the boundary of what's possible in these fields. We are seeking a science leader for our team within the Sponsored Products & Brands organization. You'll be working with talented scientists, engineers, and product managers to innovate on behalf of our customers. An ideal candidate is able to navigate through ambiguous requirements, working with various partner teams, and has experience in generative AI, large language models (LLMs), information retrieval, and ads recommendation systems. Using a combination of generative AI and online experimentation, our scientists develop insights and optimizations that enable the monetization of Amazon properties while enhancing the experience of hundreds of millions of Amazon shoppers worldwide. If you're fired up about being part of a dynamic, driven team, then this is your moment to join us on this exciting journey! Key job responsibilities - Serve as a tech lead for defining the science roadmap for multiple projects in the conversational ad experiences space powered by LLMs. - Build POCs, optimize and deploy models into production, run experiments, perform deep dives on experiment data to gather actionable learnings and communicate them to senior leadership - Work closely with software engineers on detailed requirements, technical designs and implementation of end-to-end solutions in production. - Work closely with product managers to contribute to our mission, and proactively identify opportunities where science can help improve customer experience - Research new machine learning approaches to drive continued scientific innovation - Be a member of the Amazon-wide machine learning community, participating in internal and external meetups, hackathons and conferences - Help attract and recruit technical talent, mentor scientists and engineers in the team