Michael Kearns and Aaron Roth seated at a table in front of a large chalk board.
Michael Kearns, left, and Aaron Roth, right, are the co-authors ofThe Ethical Algorithm: The Science of Socially Aware Algorithm Design. Kearns and Roth are leading researchers in machine learning, University of Pennsylvania computer science professors, and Amazon Scholars.
University of Pennsylvania

Amazon Scholars Michael Kearns and Aaron Roth discuss the ethics of machine learning

Two of the world’s leading experts on algorithmic bias look back at the events of the past year and reflect on what we’ve learned, what we’re still grappling with, and how far we have to go.

In November of 2019, University of Pennsylvania computer science professors Michael Kearns and Aaron Roth released The Ethical Algorithm: The Science of Socially Aware Algorithm Design. Kearns is the founding director of the Warren Center for Network and Data Sciences, and the faculty founder and former director of Penn Engineering’s Networked and Social Systems Engineering program. Roth is the co-director of Penn’s program in Networked and Social Systems Engineering and co-authored The Algorithmic Foundations of Differential Privacy with Cynthia Dwork. Kearns and Roth are leading researchers in machine learning, focusing on both the design and real-world application of algorithms.

Their book’s central thesis, which involves “the science of designing algorithms that embed social norms such as fairness and privacy into their code,” was already pertinent when the book was released. Fast forward one year, and the book’s themes have taken on even greater significance.

Amazon Science sat down with Kearns and Roth, both of whom recently became Amazon Scholars, to find out whether the events of the past year have influenced their outlook. We talked about what it means to define and pursue fairness, how differential privacy is being applied in the real world and what it can achieve, the challenges faced by regulators, what advice the two University of Pennsylvania professors would give to students studying artificial intelligence and machine learning, and much more.

Q. How has the narrative around designing socially aware algorithms evolved in the past year, and have the events of the past year altered your outlooks in any way?

Aaron Roth: The main thesis of our book, which is that in any particular problem you have to start by thinking carefully about what you want in terms of fairness or privacy or some other social desideratum, and then how you relatively value things like that compared to other things you might care about, such as accuracy—that fundamental thesis hasn't really changed.

Now with the coronavirus pandemic, what we have seen are application areas where how you want to manage the trade-off between accuracy and privacy, for example, is more extreme than we usually see. So, for example, in the midst of an outbreak, contact tracing might be really important, even though you can't really do contact tracing while protecting individual privacy. Because of the urgency of the situation, you might decide to trade off privacy for accuracy. But because the message of our book really is about thinking things through on a case-by-case basis, the thesis itself hasn't changed.

Michael Kearns: The events of the last year, in particular coronavirus, the resulting restrictions on society and the tensions around these restrictions, and all of the recent social upheaval in the United States, clearly has made the topics of our book much more relevant. The book has focused a lot of attention on the use of algorithms for both good and bad purposes, including things like contact tracing or releasing statistics about people's movements or health data, as well as the use of machine learning, AI, and algorithms more generally for applications like surveillance.

Since our book, at a high level, is about the tensions that arise when there's a battle between social norms like equality or privacy and the use of algorithms for optimizing things like performance or error, I don't think anything in the last year has changed our thinking about the technical aspects of these problems. It's clear that society has been forced to face these problems in a very direct way because of the events of the last year, in a way that we really haven't before. In that sense, our timing was very fortunate because the things we're talking about are more relevant now than ever.

Q. How does that affect your ability to define fairness? Is that something that can ever be a fixed definition, or does it need to be adjusted as events or specific use cases dictate?

Kearns: There's not one correct definition of fairness. In every application you have to think about who the parties are that you're trying to protect, and what the harms are that you're trying to protect them from. That changes both over time and in different scenarios.

Roth: Even before the events of the last year, fairness was always a very context- and beholder- dependent notion. One society might be primarily concerned about fairness by race, and another might be primarily concerned about fairness by gender, and a different community might have other norms. The events of the last year have highlighted cases in which not only will things vary over space or communities, but also over time.

People's attitudes about relatively invasive technologies like contact tracing might be quite different now than they were a year ago. If a year ago I told you, “Suppose there was some disease that some people were catching and the most effective way of tamping it down was to do contact tracing.” Many people might have said, “That sounds really invasive to me”, but now that we've all been through one of the alternatives—being on lock down for six months—people's minds might be changed. We’ve definitely seen norms around privacy for health-related data change.

Q. Standard setting bodies have a significant challenge when it comes to auditing algorithms. Given the scope of that challenge, what needs to happen to allow those groups to do that effectively?

Roth: Although it hasn't happened yet, regulatory agencies are thinking about this, and are reaching out to people like us to help them think about doing this in the right way. I don't know of any regulatory agency that is ready yet to audit algorithms at-scale in sensible ways of the technical sort we discuss in the book. But there are regulatory agencies that have gotten the idea that they should be gearing up to do this, and those agencies have started preliminary movements in that direction.

Kearns: Many of the conversations we've had with standard setting bodies make it clear they're realizing that, collectively, they've technologically fallen behind the industries that they regulate. They don't have the right resources or personnel to do some of the more technological types of auditing. But in these conversations, it's also become clear to us that, even if you could snap your fingers and get the right people and the right resources, it will only be part of a broader framework.

Other important pieces involve becoming more precise about best practices, and also thinking carefully about what those specifications should look like. Let me give a concrete example: One of the things that we argue in our book is that there are many laws and regulations in areas like consumer finance, for instance, that try to get at fairness by restricting what kinds of inputs an algorithm can use. These laws and regulations say, “In order to make sure that your model isn't racially discriminatory, you must not use race as a variable.” But, in fact, not using race as a variable is no guarantee that you won't build a model that's discriminatory by race. In fact, it can actually exacerbate that problem. What we advocate in the book is, rather than restricting the inputs, you should specify the behavior you want as outputs. So instead of saying, “Don't use race”, say instead, “The outputs of the models shouldn't be discriminatory by race.”

Q. Differential privacy has progressed from theoretical to applied science in significant ways in the past few years. How is differential privacy being utilized? How does that help balance the trade-off between privacy and accuracy?

Roth: In the last five years or so, differential privacy has gone from an academic topic to a real technology. For example, the 2020 US Decennial Census is going to release all of its statistical products for the first time, subject to the protections of differential privacy. This is because, by law, the Census is required to protect the privacy of the people it is surveying. The ad hoc techniques used in previous decades to protect the statistics have been shown not to work.

I think that what we will see is that the statistics that the Census releases this year will be more protective of the privacy of Americans. However, in the theme of trade off, using rigorous privacy protections is not without cost. Certain kinds of analyses, such as detailed demographic studies that rely on having highly granular Census data, might now be unavailable under differential privacy. We've seen this play out in the public sphere between downstream users of the data and folks at Census who actually have to hammer out the details.

We've seen other interesting uses of differential privacy during the pandemic too. Some tech companies have utilized differential privacy when releasing statistics about personal mobility data gathered during the pandemic. What differential privacy is best at is releasing those kinds of population level statistics: It's exactly designed to prevent you from learning too much about any particular individual. If you want to know how much less people are moving around different cities because of coronavirus restrictions, these data sets let you answer that question without giving up too much privacy for individuals whose mobile devices were providing the data at the most granular level.

Q. So how does differential privacy help protect individual information?

Roth: Oftentimes the things that you will most naturally want to know about a data set are not facts about particular people, but are population level aggregates like, how many people are crowded into my supermarket at 6 a.m. when it opens. If you tell me sufficiently many aggregate statistics, I can do some math and back out particular people's data from that. The fact that aggregate statistics can be disclosive about individual people's data is an unfortunate accident that actually doesn't have too much to do with what you really wanted to learn.

At its most basic level, differential privacy does things like add little bits of noise to the statistics that you're releasing so that what you're telling me is not the exact number of people who were in my local supermarket at 6 a.m., but roughly the number of people who were in the supermarket plus or minus some small number of people. The fortunate mathematical fact is that you can add amounts of noise that are relatively small that still allow you to get good estimates, but are sufficient to wash out the contributions of particular people, making it impossible to learn too much about any particular individual. It lets you get access to these population level questions that you were curious about without incidentally or accidentally learning about particular people, which is the dangerous side.

"We are bullish about algorithms"
Michael Kearns and Aaron Roth talked to Oxford Academic about the future of AI.

Kearns: To make this slightly more concrete, say what I want to do is each day tell everybody how many people were in the supermarket a couple blocks from me at 1 p.m. If you happened to be at that supermarket at one o’clock, then your GPS data is one of the data points that goes into the count. You may consider your presence at supermarket at 1 p.m. to be the kind of private information that you don't want the whole world to know. So then let's say that, on a typical day, there might be a couple hundred people at the supermarket, but that I add a number which is an order of magnitude, plus or minus 25. The addition of that random number mathematically and provably obscures any individual’s contributions to that count. I won't be able to look at that count and try to figure out any particular person who was present. If I add a number that's between minus 25 and 25, I can't affect the overall count by 100. I'll still have an accurate count up to some resolution, but I will have provided privacy to everybody who was present at the supermarket and, actually, all the people who weren't present as well.

Q. How are topics like fairness, accountability, transparency, interpretability, and privacy showing up in computer science curriculum at Penn and elsewhere within higher education?

Kearns: When Aaron and I first started working on the technical aspects of fairness in machine learning and related topics, it was pretty sparsely populated. This was maybe six or seven years ago, and there weren't many papers on the topics. There were some older ones, more from the statistics literature, but there wasn't really a community of any size within machine learning that thought about these problems. On the research side, the opposite is now true. All of the major machine learning conferences have significant numbers of papers and workshops on these topics; they have workshops devoted to these topics. There are now standalone conferences about fairness, accountability, and explainability in machine learning that are growing every year. It's a very vibrant, active research community now. Additionally, even though it's still early, it's an important enough topic that there are now starting to be efforts to teach this even at the undergraduate level.

The last two years at Penn, for example, I have piloted a course called The Science of Data Ethics. It’s deliberately called that and not The Ethics of Data Science. What that represents is that it’s about the science of making algorithms that are more ethical by different norms, like fairness and privacy. It's not your typical engineering ethics course, which at some level is meant to teach you to be a good, responsible person in that you look at case studies where things went wrong and you talk about what you would do differently. This class is a science class. It says: Here are the standard principles of machine learning, here's how those standard principles can lead to discriminatory behavior in my predictive models, and here are alternate principles, or modifications of those principles and the algorithms that implement them, that avoid or mitigate that behavior.

Q. Is there a more multidisciplinary approach to this set of challenges?

Roth: It's definitely a multidisciplinary area. At Penn, we've been actively collaborating with interested folks in the law school and the criminology department. So far, we don't really have interdisciplinary undergraduate courses on these topics. Those courses would be good in the long run, but at the research and graduate level we've been having interdisciplinary conversations for a number of years.

In particular, one critique that we try to anticipate in the book, and that we’re very aware of, is that technical work on making algorithms more ethical is only one piece of a much larger sociological, or what some people would call socio-technical, pipeline.
Michael Kearns

Kearns: Not just at the teaching level, but even in the research community, there's a real melting pot of viewpoints on these topics. Even though our book is focused on the scientific aspects of these issues, we do spend some time mentioning the fact that the science will only take us so far. In particular, one critique that we try to anticipate in the book, and that we’re very aware of, is that technical work on making algorithms more ethical is only one piece of a much larger sociological, or what some people would call socio-technical, pipeline. Machine learning begins with data and ends with a model. But upstream from the data is the entire manner in which the data was collected and the conditions under which it was collected.

One of the things that's very interesting, exciting, and necessary about the dialogue around these kinds of issues is that, even when there's quite a bit to say on them scientifically, you don't want to just put your head down and look at the science. You want to talk to people who are upstream and downstream from the machine learning part of this pipeline because they bring very different perspectives, and can often point out perspectives which can help you change the way you look at things scientifically in a positive way.

Q. If I were a student exploring AI or ML and I wanted to influence this particular conversation, beyond technical skills, what kind of skills should I be developing?

Kearns: What I would very strongly advocate is: think widely, think broadly, think big. Yes, you're going to be doing technical work in particular models and frameworks, and you know you want to get results in those frameworks. But also read what people who are from very, very different fields think about these problems. Go to their conferences, don't just go to the machine learning conferences and to the sub-track on fairness and machine learning. Go to the interdisciplinary conferences and workshops that are deliberately meant to bring together scientists, legal scholars, philosophers, sociologists, and regulators. Hear their views on these problems, keep an ear out for whether they even think you're working on a problem that's relevant or even has a solution.

That's the way I have approached my career: focus on what I'm good at and what I think is interesting from a scientific standpoint, but not in a scientific vacuum. I deliberately expose myself whenever possible to what people from a completely different perspective are thinking about the same set of topics. The good news is that there's a lot of opportunity for that right now. If you work in some branch of material science, it may not be possible to wander out in the world and get diverse perspectives, but everybody has an opinion on AI and machine learning ethics these days, so there is no shortage of sources from which this hypothetical student could go out and find their own technical views challenged or broadened.

Roth: One trap that is very easy for a new PhD student, or even an established researcher, to fall into is to write the introductions to your papers motivated by some kind of fairness problem, but then find yourself solving some narrow technical problem that ultimately has little connection to the world. I am sometimes guilty of this myself, but this is an area where there really are lots of important problems to solve. It's an area where theoretical approaches, if wielded correctly, can be extremely valuable. The thing that’s valuable is to be, sort of, multilingual. It can be difficult to talk to people from other fields because those fields have different vocabularies and a different world view. However, it's important to understand the perspective of these different communities. There are interdisciplinary groups looking at fairness, accountability, and transparency, which bring people together from all sorts of backgrounds to actively work on developing, at the very least, a shared vocabulary—and hopefully a shared world view.

Q. You've become Amazon Scholars fairly recently. What inspired you to take on this role?

Roth: I've spent most of my career as a theorist, so the ways I've been primarily thinking about privacy and fairness are in the abstract. I've had fun thinking about questions like: What kinds of things are, and are not, possible in principle with differential privacy? Or what kinds of semantic fairness promises can you make to people in a way that is still consistent with trying to learn something from the data? The attraction of Amazon and AWS is that it's where the rubber meets the road. Here we are deploying real machine learning products, and the privacy and the fairness concerns are real and pressing.

My hope is that by having a foot in the practice of these problems, not just their theory, not only will I have some effect on how consequential products actually work, but I’ll learn things that will be helpful in developing new theory that is grounded in the real world.

Kearns: I've had a kind of second life in the quantitative finance industry up until I joined Amazon. While I spent time doing practical things in the world of finance, it was more just using my general knowledge in machine learning. The opportunity to come to Amazon and really think about the topics we've been discussing in a practical technological setting seemed like a great opportunity. I'm also a long-term fan and observer of the company. I’ve known people here for many years, and have had great conversations with them. So I’ve watched with great interest over the last decade plus as Amazon grew its machine learning effort from scratch and gradually grew it to have wider and wider applications. It’s now at a point where not only is machine learning used widely within the company to optimize all kinds of processes and recommendations and the like, but it’s also used by customers worldwide in the form of services like Amazon SageMaker.

I have watched this with great interest because when I was studying machine learning in graduate school back in the late 80s, trust me, it was an obscure corner of AI that people kind of raised their eyebrows at. I never would have thought we would reach the point where not only does The Wall Street Journal expect everyone to know what they mean when they write about machine learning, but that it would actually be a product sold at scale.

I've watched these developments from academia and from the world of finance.  It seemed like a great opportunity to combine my very specific current research and other interests with an inside look at one of the great technology companies. Like Aaron, my expectations, which were high, have only been exceeded in the time I've spent here.

Research areas

Related content

US, MA, Boston
The Artificial General Intelligence (AGI) team is looking for a highly skilled and experienced Sr. Applied Scientist, to support the development and implementation of state-of-the-art algorithms and models for supervised fine-tuning and reinforcement learning through human feedback and complex reasoning; with a focus across text, image, and video modalities. As an Sr. Applied Scientist, you will play a critical role in supporting the development of Generative AI (Gen AI) technologies that can handle Amazon-scale use cases and have a significant impact on our customers' experiences. Key job responsibilities Collaborate with cross-functional teams of engineers, product managers, and scientists to identify and solve complex problems in Gen AI Design and execute experiments to evaluate the performance of different algorithms (PT, SFT, RL) and models, and iterate quickly to improve results Think big about the arc of development of Gen AI over a multi-year horizon, and identify new opportunities to apply these technologies to solve real-world problems Communicate results and insights to both technical and non-technical audiences, including through presentations and written reports About the team We are passionate scientists dedicated to pushing the boundaries of innovation in Gen AI with focus on Software Development use cases.
IN, HR, Gurugram
Do you want to join an innovative team of scientists who use machine learning and statistical techniques to create state-of-the-art solutions for providing better value to Amazon’s customers? Do you want to build and deploy advanced ML systems that help optimize millions of transactions every day? Are you excited by the prospect of analyzing and modeling terabytes of data to solve real-world problems? Do you like to own end-to-end business problems/metrics and directly impact the profitability of the company? Do you like to innovate and simplify? If yes, then you may be a great fit to join the Machine Learning team for India Consumer Businesses. Machine Learning, Big Data and related quantitative sciences have been strategic to Amazon from the early years. Amazon has been a pioneer in areas such as recommendation engines, ecommerce fraud detection and large-scale optimization of fulfillment center operations. As Amazon has rapidly grown and diversified, the opportunity for applying machine learning has exploded. We have a very broad collection of practical problems where machine learning systems can dramatically improve the customer experience, reduce cost, and drive speed and automation. These include product bundle recommendations for millions of products, safeguarding financial transactions across by building the risk models, improving catalog quality via extracting product attribute values from structured/unstructured data for millions of products, enhancing address quality by powering customer suggestions We are developing state-of-the-art machine learning solutions to accelerate the Amazon India growth story. Amazon India is an exciting place to be at for a machine learning practitioner. We have the eagerness of a fresh startup to absorb machine learning solutions, and the scale of a mature firm to help support their development at the same time. As part of the India Machine Learning team, you will get to work alongside brilliant minds motivated to solve real-world machine learning problems that make a difference to millions of our customers. We encourage thought leadership and blue ocean thinking in ML. Key job responsibilities Use machine learning and analytical techniques to create scalable solutions for business problems Analyze and extract relevant information from large amounts of Amazon’s historical business data to help automate and optimize key processes Design, develop, evaluate and deploy, innovative and highly scalable ML models Work closely with software engineering teams to drive real-time model implementations Work closely with business partners to identify problems and propose machine learning solutions Establish scalable, efficient, automated processes for large scale data analyses, model development, model validation and model maintenance Work proactively with engineering teams and product managers to evangelize new algorithms and drive the implementation of large-scale complex ML models in production Leading projects and mentoring other scientists, engineers in the use of ML techniques About the team International Machine Learning Team is responsible for building novel ML solutions that attack India first (and other Emerging Markets across MENA and LatAm) problems and impact the bottom-line and top-line of India business. Learn more about our team from https://www.amazon.science/working-at-amazon/how-rajeev-rastogis-machine-learning-team-in-india-develops-innovations-for-customers-worldwide
US, CA, Sunnyvale
The Artificial General Intelligence (AGI) team is looking for a passionate, talented, and inventive Principal Applied Scientist with a strong deep learning background, to lead the development of industry-leading technology with multimodal systems. As a Principal Scientist within the Artificial General Intelligence (AGI) organization, you are a trusted part of the technical leadership. You bring business and industry context to science and technology decisions, set the standard for scientific excellence, and make decisions that affect the way we build and integrate algorithms. A Principal Applied Scientist will solicit differing views across the organization and are willing to change your mind as you learn more. Your artifacts are exemplary and often used as reference across organization. You are a hands-on scientific leader; develop solutions that are exemplary in terms of algorithm design, clarity, model structure, efficiency, and extensibility; and tackle intrinsically hard problems, acquiring expertise as needed. Principal Applied Scientists are expected to decompose complex problems into straightforward solutions. You amplify your impact by leading scientific reviews within your organization or at your location; and scrutinize and review experimental design, modeling, verification and other research procedures. You also probe assumptions, illuminate pitfalls, and foster shared understanding; align teams toward coherent strategies; and educate keeping the scientific community up to date on advanced techniques, state of the art approaches, the latest technologies, and trends. AGI Principal Applied Scientists help managers guide the career growth of other scientists by mentoring and play a significant role in hiring and developing scientists and leads. You will play a critical role in driving the development of Generative AI (GenAI) technologies that can handle Amazon-scale use cases and have a significant impact on our customers' experiences. Key job responsibilities You will be responsible for defining key research directions, inventing new machine learning techniques, conducting rigorous experiments, and ensuring that research is translated into practice. You will develop long-term strategies, persuade teams to adopt those strategies, propose goals and deliver on them. A Principal Applied Scientist will participate in organizational planning, hiring, mentorship and leadership development. You will also be build scalable science and engineering solutions, and serve as a key scientific resource in full-cycle development (conception, design, implementation, testing to documentation, delivery, and maintenance).
US, WA, Seattle
Innovators wanted! Are you an entrepreneur? A builder? A dreamer? This role is part of an Amazon Special Projects team that takes the company’s Think Big leadership principle to the next level. We focus on creating entirely new products and services with a goal of positively impacting the lives of our customers. No industries or subject areas are out of bounds. If you’re interested in innovating at scale to address big challenges in the world, this is the team for you. As a Research Scientist, you will work with a unique and gifted team developing exciting products for consumers and collaborate with cross-functional teams. Our team rewards intellectual curiosity while maintaining a laser-focus in bringing products to market. At the edge of both academic and applied research in this product area, you have the opportunity to work together with some of the most talented scientists, engineers, and product managers. Here at Amazon, we embrace our differences. We are committed to furthering our culture of inclusion. We have thirteen employee-led affinity groups, reaching 40,000 employees in over 190 chapters globally. We are constantly learning through programs that are local, regional, and global. Amazon’s culture of inclusion is reinforced within our 16 Leadership Principles, which remind team members to seek diverse perspectives, learn and be curious, and earn trust. Our team highly values work-life balance, mentorship and career growth. We believe striking the right balance between your personal and professional life is critical to life-long happiness and fulfillment. We care about your career growth and strive to assign projects and offer training that will challenge you to become your best. Key job responsibilities * Partner with laboratory science teams on design and analysis of experiments * Originate and lead the development of new data collection workflows with cross-functional partners * Develop and deploy scalable bioinformatics analysis and QC workflows * Evaluate and incorporate novel bioinformatic approaches to solve critical business problems
US, CA, Sunnyvale
As a Principal Scientist within the Artificial General Intelligence (AGI) organization, you are a trusted part of the technical leadership. You bring business and industry context to science and technology decisions, set the standard for scientific excellence, and make decisions that affect the way we build and integrate algorithms. A Principal Applied Scientist will solicit differing views across the organization and are willing to change your mind as you learn more. Your artifacts are exemplary and often used as reference across organization. You are a hands-on scientific leader; develop solutions that are exemplary in terms of algorithm design, clarity, model structure, efficiency, and extensibility; and tackle intrinsically hard problems, acquiring expertise as needed. Principal Applied Scientists are expected to decompose complex problems into straightforward solutions. You amplify your impact by leading scientific reviews within your organization or at your location; and scrutinize and review experimental design, modeling, verification and other research procedures. You also probe assumptions, illuminate pitfalls, and foster shared understanding; align teams toward coherent strategies; and educate keeping the scientific community up to date on advanced techniques, state of the art approaches, the latest technologies, and trends. AGI Principal Applied Scientists help managers guide the career growth of other scientists by mentoring and play a significant role in hiring and developing scientists and leads. You will play a critical role in driving the development of Generative AI (GenAI) technologies that can handle Amazon-scale use cases and have a significant impact on our customers' experiences. Key job responsibilities You will be responsible for defining key research directions, inventing new machine learning techniques, conducting rigorous experiments, and ensuring that research is translated into practice. You will develop long-term strategies, persuade teams to adopt those strategies, propose goals and deliver on them. A Principal Applied Scientist will participate in organizational planning, hiring, mentorship and leadership development. You will also be build scalable science and engineering solutions, and serve as a key scientific resource in full-cycle development (conception, design, implementation, testing to documentation, delivery, and maintenance). A day in the life About the team Amazon’s AGI team is focused on building foundational AI to solve real-world problems at scale, delivering value to all existing businesses in Amazon, and enabling entirely new services and products for people and enterprises around the world.
LU, Luxembourg
Are you a MS or PhD student interested in a 2026 internship in the field of machine learning, deep learning, generative AI, large language models and speech technology, robotics, computer vision, optimization, operations research, quantum computing, automated reasoning, or formal methods? If so, we want to hear from you! We are looking for students interested in using a variety of domain expertise to invent, design and implement state-of-the-art solutions for never-before-solved problems. You can find more information about the Amazon Science community as well as our interview process via the links below; https://www.amazon.science/ https://amazon.jobs/content/en/career-programs/university/science https://amazon.jobs/content/en/how-we-hire/university-roles/applied-science Key job responsibilities As an Applied Science Intern, you will own the design and development of end-to-end systems. You’ll have the opportunity to write technical white papers, create roadmaps and drive production level projects that will support Amazon Science. You will work closely with Amazon scientists and other science interns to develop solutions and deploy them into production. You will have the opportunity to design new algorithms, models, or other technical solutions whilst experiencing Amazon’s customer focused culture. The ideal intern must have the ability to work with diverse groups of people and cross-functional teams to solve complex business problems. A day in the life At Amazon, you will grow into the high impact person you know you’re ready to be. Every day will be filled with developing new skills and achieving personal growth. How often can you say that your work changes the world? At Amazon, you’ll say it often. Join us and define tomorrow. Some more benefits of an Amazon Science internship include; • All of our internships offer a competitive stipend/salary • Interns are paired with an experienced manager and mentor(s) • Interns receive invitations to different events such as intern program initiatives or site events • Interns can build their professional and personal network with other Amazon Scientists • Interns can potentially publish work at top tier conferences each year About the team Applicants will be reviewed on a rolling basis and are assigned to teams aligned with their research interests and experience prior to interviews. Start dates are available throughout the year and durations can vary in length from 3-6 months for full time internships. This role may available across multiple locations in the EMEA region (Austria, Estonia, France, Germany, Ireland, Israel, Italy, Jordan, Luxembourg, Netherlands, Poland, Romania, Spain, South Africa, UAE, and UK). Please note these are not remote internships.
US, WA, Seattle
Revolutionize the Future of AI at the Frontier of Applied Science Are you a brilliant mind seeking to push the boundaries of what's possible with artificial intelligence? Join our elite team of researchers and engineers at the forefront of applied science, where we're harnessing the latest advancements in natural language processing, deep learning, and generative AI to reshape industries and unlock new realms of innovation. As an Applied Science Intern, you'll have the unique opportunity to work alongside world-renowned experts, gaining invaluable hands-on experience with cutting-edge technologies such as large language models, transformers, and neural networks. You'll dive deep into complex challenges, fine-tuning state-of-the-art models, developing novel algorithms for named entity recognition, and exploring the vast potential of generative AI. This internship is not just about executing tasks – it's about being a driving force behind groundbreaking discoveries. You'll collaborate with cross-functional teams, leveraging your expertise in statistics, recommender systems, and question answering to tackle real-world problems and deliver impactful solutions. Throughout your journey, you'll have access to unparalleled resources, including state-of-the-art computing infrastructure, cutting-edge research papers, and mentorship from industry luminaries. This immersive experience will not only sharpen your technical skills but also cultivate your ability to think critically, communicate effectively, and thrive in a fast-paced, innovative environment where bold ideas are celebrated. Join us at the forefront of applied science, where your contributions will shape the future of AI and propel humanity forward. Seize this extraordinary opportunity to learn, grow, and leave an indelible mark on the world of technology. Amazon has positions available for LLM & GenAI Applied Science Internships in, but not limited to, Bellevue, WA; Boston, MA; Cambridge, MA; New York, NY; Santa Clara, CA; Seattle, WA; Sunnyvale, CA; Pittsburgh, PA. Key job responsibilities We are particularly interested in candidates with expertise in: LLMs, NLP/NLU, Gen AI, Transformers, Fine-Tuning, Recommendation Systems, Deep Learning, NER, Statistics, Neural Networks, Question Answering. In this role, you will work alongside global experts to develop and implement novel, scalable algorithms and modeling techniques that advance the state-of-the-art in areas at the intersection of LLMs and GenAI. You will tackle challenging, groundbreaking research problems on production-scale data, with a focus on recommendation systems, question answering, deep learning and generative AI. The ideal candidate should possess the ability to work collaboratively with diverse groups and cross-functional teams to solve complex business problems. A successful candidate will be a self-starter, comfortable with ambiguity, with strong attention to detail and the ability to thrive in a fast-paced, ever-changing environment. A day in the life - Collaborate with cross-functional teams to tackle complex challenges in natural language processing, computer vision, and generative AI. - Fine-tune state-of-the-art models and develop novel algorithms to push the boundaries of what's possible. - Explore the vast potential of generative AI and its applications across industries. - Attend cutting-edge research seminars and engage in thought-provoking discussions with industry luminaries. - Leverage state-of-the-art computing infrastructure and access to the latest research papers to fuel your innovation. - Present your groundbreaking work and insights to the team, fostering a culture of knowledge-sharing and continuous learning.
US, WA, Seattle
Unlock the Future with Amazon Science! Calling all visionary minds passionate about the transformative power of machine learning! Amazon is seeking boundary-pushing graduate student scientists who can turn revolutionary theory into awe-inspiring reality. Join our team of visionary scientists and embark on a journey to revolutionize the field by harnessing the power of cutting-edge techniques in bayesian optimization, time series, multi-armed bandits and more. At Amazon, we don't just talk about innovation – we live and breathe it. You'll conducting research into the theory and application of deep reinforcement learning. You will work on some of the most difficult problems in the industry with some of the best product managers, scientists, and software engineers in the industry. You will propose and deploy solutions that will likely draw from a range of scientific areas such as supervised, semi-supervised and unsupervised learning, reinforcement learning, advanced statistical modeling, and graph models. Throughout your journey, you'll have access to unparalleled resources, including state-of-the-art computing infrastructure, cutting-edge research papers, and mentorship from industry luminaries. This immersive experience will not only sharpen your technical skills but also cultivate your ability to think critically, communicate effectively, and thrive in a fast-paced, innovative environment where bold ideas are celebrated. Join us at the forefront of applied science, where your contributions will shape the future of AI and propel humanity forward. Seize this extraordinary opportunity to learn, grow, and leave an indelible mark on the world of technology. Amazon has positions available for Machine Learning Applied Science Internships in, but not limited to Arlington, VA; Bellevue, WA; Boston, MA; New York, NY; Palo Alto, CA; San Diego, CA; Santa Clara, CA; Seattle, WA. Key job responsibilities We are particularly interested in candidates with expertise in: Optimization, Programming/Scripting Languages, Statistics, Reinforcement Learning, Causal Inference, Large Language Models, Time Series, Graph Modeling, Supervised/Unsupervised Learning, Deep Learning, Predictive Modeling In this role, you will work alongside global experts to develop and implement novel, scalable algorithms and modeling techniques that advance the state-of-the-art in areas at the intersection of Reinforcement Learning and Optimization within Machine Learning. You will tackle challenging, groundbreaking research problems on production-scale data, with a focus on developing novel RL algorithms and applying them to complex, real-world challenges. The ideal candidate should possess the ability to work collaboratively with diverse groups and cross-functional teams to solve complex business problems. A successful candidate will be a self-starter, comfortable with ambiguity, with strong attention to detail and the ability to thrive in a fast-paced, ever-changing environment. A day in the life - Develop scalable, efficient, automated processes for large scale data analyses, model development, model validation and model implementation. - Design, development and evaluation of highly innovative ML models for solving complex business problems. - Research and apply the latest ML techniques and best practices from both academia and industry. - Think about customers and how to improve the customer delivery experience. - Use and analytical techniques to create scalable solutions for business problems.
US, WA, Seattle
Shape the Future of Human-Machine Interaction Are you a master of natural language processing, eager to push the boundaries of conversational AI? Amazon is seeking exceptional graduate students to join our cutting-edge research team, where they will have the opportunity to explore and push the boundaries of natural language processing (NLP), natural language understanding (NLU), and speech recognition technologies. Imagine waking up each morning, fueled by the excitement of tackling complex research problems that have the potential to reshape the world. You'll dive into production-scale data, exploring innovative approaches to natural language understanding, large language models, reinforcement learning with human feedback, conversational AI, and multimodal learning. Your days will be filled with brainstorming sessions, coding sprints, and lively discussions with brilliant minds from diverse backgrounds. Throughout your journey, you'll have access to unparalleled resources, including state-of-the-art computing infrastructure, cutting-edge research papers, and mentorship from industry luminaries. This immersive experience will not only sharpen your technical skills but also cultivate your ability to think critically, communicate effectively, and thrive in a fast-paced, innovative environment where bold ideas are celebrated.. Join us at the forefront of applied science, where your contributions will shape the future of AI and propel humanity forward. Seize this extraordinary opportunity to learn, grow, and leave an indelible mark on the world of technology. Amazon has positions available for Natural Language Processing & Speech Applied Science Internships in, but not limited to, Bellevue, WA; Boston, MA; Cambridge, MA; New York, NY; Santa Clara, CA; Seattle, WA; Sunnyvale, CA. Key job responsibilities We are particularly interested in candidates with expertise in: NLP/NLU, LLMs, Reinforcement Learning, Human Feedback/HITL, Deep Learning, Speech Recognition, Conversational AI, Natural Language Modeling, Multimodal Learning. In this role, you will work alongside global experts to develop and implement novel, scalable algorithms and modeling techniques that advance the state-of-the-art in areas at the intersection of Natural Language Processing and Speech Technologies. You will tackle challenging, groundbreaking research problems on production-scale data, with a focus on natural language processing, speech recognition, text-to-speech (TTS), text recognition, question answering, NLP models (e.g., LSTM, transformer-based models), signal processing, information extraction, conversational modeling, audio processing, speaker detection, large language models, multilingual modeling, and more. The ideal candidate should possess the ability to work collaboratively with diverse groups and cross-functional teams to solve complex business problems. A successful candidate will be a self-starter, comfortable with ambiguity, with strong attention to detail and the ability to thrive in a fast-paced, ever-changing environment. A day in the life - Develop novel, scalable algorithms and modeling techniques that advance the state-of-the-art in natural language processing, speech recognition, text-to-speech, question answering, and conversational modeling. - Tackle groundbreaking research problems on production-scale data, leveraging techniques such as LSTM, transformer-based models, signal processing, information extraction, audio processing, speaker detection, large language models, and multilingual modeling. - Collaborate with cross-functional teams to solve complex business problems, leveraging your expertise in NLP/NLU, LLMs, reinforcement learning, human feedback/HITL, deep learning, speech recognition, conversational AI, natural language modeling, and multimodal learning. - Thrive in a fast-paced, ever-changing environment, embracing ambiguity and demonstrating strong attention to detail.
US, WA, Seattle
Do you enjoy solving challenging problems and driving innovations in research? Do you want to create scalable optimization models and apply machine learning techniques to guide real-world decisions? We are looking for builders, innovators, and entrepreneurs who want to bring their ideas to reality and improve the lives of millions of customers. As a Research Science intern focused on Operations Research and Optimization intern, you will be challenged to apply theory into practice through experimentation and invention, develop new algorithms using modeling software and programming techniques for complex problems, implement prototypes and work with massive datasets. As you navigate through complex algorithms and data structures, you'll find yourself at the forefront of innovation, shaping the future of Amazon's fulfillment, logistics, and supply chain operations. Imagine waking up each morning, fueled by the excitement of solving intricate puzzles that have a direct impact on Amazon's operational excellence. Your day might begin by collaborating with cross-functional teams, exchanging ideas and insights to develop innovative solutions. You'll then immerse yourself in a world of data, leveraging your expertise in optimization, causal inference, time series analysis, and machine learning to uncover hidden patterns and drive operational efficiencies. Throughout your journey, you'll have access to unparalleled resources, including state-of-the-art computing infrastructure, cutting-edge research papers, and mentorship from industry luminaries. This immersive experience will not only sharpen your technical skills but also cultivate your ability to think critically, communicate effectively, and thrive in a fast-paced, innovative environment where bold ideas are celebrated. Amazon has positions available for Operations Research Science Internships in, but not limited to, Bellevue, WA; Boston, MA; Cambridge, MA; New York, NY; Santa Clara, CA; Seattle, WA; Sunnyvale, CA. Key job responsibilities We are particularly interested in candidates with expertise in: Optimization, Causal Inference, Time Series, Algorithms and Data Structures, Statistics, Operations Research, Machine Learning, Programming/Scripting Languages, LLMs In this role, you will gain hands-on experience in applying cutting-edge analytical techniques to tackle complex business challenges at scale. If you are passionate about using data-driven insights to drive operational excellence, we encourage you to apply. The ideal candidate should possess the ability to work collaboratively with diverse groups and cross-functional teams to solve complex business problems. A successful candidate will be a self-starter, comfortable with ambiguity, with strong attention to detail and the ability to thrive in a fast-paced, ever-changing environment. A day in the life Develop and apply optimization, causal inference, and time series modeling techniques to drive operational efficiencies and improve decision-making across Amazon's fulfillment, logistics, and supply chain operations Design and implement scalable algorithms and data structures to support complex optimization systems Leverage statistical methods and machine learning to uncover insights and patterns in large-scale operations data Prototype and validate new approaches through rigorous experimentation and analysis Collaborate closely with cross-functional teams of researchers, engineers, and business stakeholders to translate research outputs into tangible business impact