“Robin deals with a world where things are changing all around it”

An advanced perception system, which detects and learns from its own mistakes, enables Robin robots to select individual objects from jumbled packages — at production scale.

Inside an Amazon fulfillment center, as packages roll down a conveyor, the Robin robotic arm goes to work. It dips, picks up a package, scans its, and places it on a small drive robot that routes it to the correct loading dock. By the time the drive has dropped off its package, Robin has loaded several more delivery robots.

While Robin looks a lot like other robotic arms used in industry, its vision system enables it to see and react to the world in an entirely different way.

“Most robotic arms work in a controlled environment,” explained Charles Swan, a senior manager of software development at Amazon Robotics & AI. “If they weld vehicle frames, for example, they expect the parts to be in a fixed location and follow a pre-scripted set of motions. They do not really perceive their environment.

Related content
While these systems look like other robot arms, they embed advanced technologies that will shape Amazon's robot fleet for years to come.

“Robin deals with a world where things are changing all around it. It understands what objects are there — different sized boxes, soft packages, envelopes on top of other envelopes — and decides which one it wants and grabs it. It does all these things without a human scripting each move that it makes. What Robin does is not unusual in research. But it is unusual in production.”

Yet, thanks to machine learning, Robin and its advanced perception system are moving rapidly into production. When Swan began working with the robot in 2021, Amazon was operating only a couple dozen units at its fulfillment centers. Today, Swan’s team is significantly scaling that perception system.

To reach that goal, Amazon Robotics researchers are exploring ways for Robin to achieve unparalleled levels of production accuracy. Because Amazon is so focused on improving the customer experience through timely deliveries, even 99.9% accuracy doesn’t meet the mark for robotics researchers.

Training day

Over the past five years, machine learning has significantly advanced the ability of robots to see, understand, and reason about their environment.

Robin perception testing
Model 1 from October 2021 — The model misses two black packages and one occluded package.

In the past, classical computer vision algorithms systematically segmented scenes into individual elements, a slow and computationally intensive approach. Supervised machine learning has made that process more efficient.

robinperceptiontest2.png
Model 2 from November 2021 — The black packages are detected, but a heavily occluded one is still missed.

“We don’t explicitly say how the model should learn,” said Bhavana Chandrashekhar, a software development manager at Amazon Robotics & AI. “Instead, we give it an input image and say, ‘This is an object.’ Then it tries to identify the object in the image, and we grade how well it does that. Using only that supervised feedback, the model learns how to extract features from the images so it can classify the objects in them.”

robinperceptiontest3.png
Model 3 from February 2022 — All packages are correctly detected.

Robin’s perception system started with pre-trained models that could already identify object elements like edges and planes.

Next, it was taught to identify the type of packages found within the fulfillment center’s sortation area.

Machine learning models learn best when provided with an abundance of sample images. Yet, despite shipping millions of packages daily, Chandrashekhar’s team initially found it hard to find enough training data to capture the enormous variation of the boxes and packages continuously rolling down a conveyor.

“Everything comes in a jumble of sizes and shapes, some on top of the other, some in the shadows,” Chandrashekhar said. “During the holidays, you might see pictures of Minions or Billy Eilish mixed in with our usual brown and white packages. The taping might change.

“Sometimes, the differences between one package and another are hard to see, even for humans. You might have a white envelope on another white envelope, and both are crinkled so you can’t tell where one begins and the other ends,” she explained.

To teach Robin’s model to make sense of what it sees, researchers gathered thousands of images, drew lines around features like boxes, yellow, brown and white mailers, and labels, and added descriptions. The team then used these annotated images to continually retrain the robot.

The training continued in a simulated production environment, with the robot working on a live conveyor with test packages.

Whenever Robin failed to identify an object or make a pick, the researchers would annotate the errors and add them to the training deck. This on-going training regimen significantly improved the robot’s efficiency.

Continual learning

Robin’s success rate during these tests improved markedly, but the researchers pushed for near perfection. “We want to be really good at these random edge problems, which happen only a few times during testing, but occur more often in field when we’re running at larger scale,” Chandrashekhar said.

Because of Robin’s high accuracy rate in testing, researchers found it difficult to find enough of those mistakes to create a dataset for further training. “In the beginning, we had to imagine how the robot would make a mistake in order to create the type of data we could use to improve the model,” Chandrashekhar explained.

The Amazon team also monitored Robin’s confidence in its decisions. The perception model might, for example, indicate it was confident about spotting a package, but less confident about assigning it to a specific type of package. Chandrashekhar’s team developed a framework to ensure those low-confidence images were automatically sent for annotation by a human and then added back to the training deck.

Amazon's Robin robotic arm is seen inside a facility gripping a package
While Robin looks a lot like other robotic arms used in industry, its vision system enables it to see and react to the world in an entirely different way.

“This is part of continual learning,” says Jeremy Wyatt, senior manager of applied science. “It’s incredibly powerful because every package becomes a learning opportunity. Every robot contributes experiences that helps the entire fleet get better.”

That continual learning led to big improvements. “In just six months, we halved the number of packages Robin’s perception system can’t pick and we reduced the errors the perception system makes by a factor of 10,” Wyatt notes.

Still, robots will make mistakes in production that have to be corrected. What happens in the moment if Robin drops a package or puts two mailers on one sortation robot? While most production robots are oblivious to mistakes, Robin is an exception. It monitors its performance for missteps.

Robin’s quality assurance system oversees how it handles packages. If it identifies a problem, it will try to fix it on its own, or call for human intervention if it cannot. “If Robin finds and corrects a mistake, it might lose some time,” Swan explained. “However, if that error wasn’t addressed at all, we might lose a day or two getting that product to the customer.”

Scaling Robin perception

Swan joined the Robin perception team when there were only a few dozen units in production. His goal: scale the perception system to thousands of robotic arms. To accomplish this, Swan’s team doesn’t just focus on catching and annotating errors for continual learning, it seeks the root cause of those errors.

They rely on Robin perception’s user interface, which lets engineers look through the robot’s eyes and trace how its vision system made the decision. They might, for example, find a Robin that picked up two packages because it could not distinguish one from the other, or another that failed to grab any package owing to a noisy depth signal. Auditing Robin’s decisions lets Amazon Robotics engineers fine-tune the robot’s behaviors.

This is complemented by the metrics derived from a fleet of machines sorting well over 1 million items every day. “Once you have that kind of data, then you can start to look for correlations,” Swan said. “Then you can say the latency in making a decision is related to this property of the machine or this property of the scene and that’s something we can focus on.”

Fleet metrics provide data about a greater range of scenes and problems than any one machine would ever see, from a broken light to an address label stuck on the conveyor belt. That data, used to retrain Robin every few days, gives it a much broader understanding of the world in which it works.

The Robin robotic arm sorts packages

It also helps Amazon improve efficiency. Before Robin picks up a package, it must first segment a cluttered scene, decide which package it will grab, calculate how it will approach the package, and choose how many of its eight suction cups to use to pick it up. Choose too many and it might lift more than one package; too few, and it could drop its cargo.

That decision requires much more than computer vision. “Making decisions on what and where to grasp is accomplished with a combination of learning systems, optimization, geometric reasoning, and 3D understanding,” explained Nick Hudson, principal applied scientist with Amazon Robotics AI. “There are a lot of components which interact, and they all need to accommodate the variations seen across different sites and regions.”

“There is always a tradeoff between efficiency and good decisions,” Swan continued. “That was a major scaling challenge. We did a lot of experimentation offline with very cluttered scenes and other situations that slowed the robots down to improve our algorithms. When we liked them, we would run them on a small portion of the fleet. If they did well, we would roll them out to all the robots.”

Related content
The collaboration will support research, education, and outreach efforts in areas of mutual interest, beginning with artificial intelligence and robotics.

Those rollouts were also made possible because the software was rewritten to support regular updates, said Sicong Zhao, a software development manager. “The software is modular. That way, we can upgrade one component without affecting the others. It also enables multiple groups to work on different improvements at the same time.” That modularity has enabled key parts of the perception system to be automatically retrained twice a week.

Nor was that a simple task. Robin had many tens of thousands of lines of code, so it took Zhao’s team months to understand how those lines interacted with one another well enough to modularize their components. The effort was worth it. It made Robin easier to upgrade and will ultimately enable automatic fleet updates as frequently as needed while mitigating operational disruptions.

Next-generation robot perception

Those continuous improvements are essential to deploy Robin at Amazon’s scale, Swan explained. The team’s goal is to update the fleet of Robin robots automatically several times weekly.

“We are increasing our usage of Robin,” Swan said. “To do that, we must continue to improve Robin’s ability to handle those random edge cases, so it never mis-sorts, has great motion planning, and moves at the fastest safe speed its arm can handle — all with time to spare.”

That means even more innovation. Take, for example, package recognition. Robin’s perception system needs to be able to spot a pile of packages and know to start with the top one to avoid upending the pile. “Robin has a sense of how to do that as well, but we need machine learning to accelerate the way Robin decides which one it is most likely to pick up successfully as we keep adding new types of packaging,” Zhao explained.

Related content
Scientists and engineers are developing a new generation of simulation tools accurate enough to develop and test robots virtually.

Chandrashekhar believes more powerful digital simulations, based on the physics of robot and package movement, will enable faster innovation. “This is very difficult when we’re talking about deformable packages, like a water bottle in a soft mailer,” she said. “But we’re getting a lot closer.”

Longer-term, she wants to see self-learning robots that teach themselves to make fewer mistakes and to recover from them faster. Self-learning will also make the robots easier to use. “Deploying a robot shouldn’t require a PhD,” Swan said.

We’ve only scratched the surface of what’s possible with robots.
Charles Swan

“There is a unique opportunity to have this fleet adapt automatically,” agreed Hudson. “There are open questions on how to accomplish this, including whether individual robots should adapt on their own. The fleet already updates its object understanding using data collected worldwide. How can we also have the individual robots adapt to issues they are seeing locally – for instance if one of the suction cups is blocked or torn?”

Ultimately, though, Swan would like to use what Amazon Robotics researchers have learned to create new types of robots. “We’ve only scratched the surface of what’s possible with robots,” he said.

Research areas

Related content

  • Staff writer
    December 29, 2025
    From foundation model safety frameworks and formal verification at cloud scale to advanced robotics and multimodal AI reasoning, these are the most viewed publications from Amazon scientists and collaborators in 2025.
  • Staff writer
    December 29, 2025
    From quantum computing breakthroughs and foundation models for robotics to the evolution of Amazon Aurora and advances in agentic AI, these are the posts that captured readers' attention in 2025.
  • Staff writer
    October 21, 2025
    Initiative will fund over 100 doctoral students researching machine learning, computer vision, and natural-language processing at nine universities.
GB, London
We are looking for a passionate, talented, and inventive Data Scientist with a strong machine learning and analytics background to help build industry-leading language technology powering Rufus, our AI-driven search and shopping assistant, helping customers with their shopping tasks at every step of their shopping journey. This innovative role focuses on developing and optimizing large language model (LLM)-powered conversational experiences. The core emphasis is to get the best performance out of state-of-the-art LLMs via careful and methodical instruction design, contextual grounding, informed choices of MCP tools and agent/multi-agent systems, evaluation frameworks, and experimentation to systematically improve LLM quality, robustness, and customer impact. The work combines scientific rigor with product intuition to systematically raise the bar for conversational AI performance at Amazon scale. Our mission in conversational shopping is to make it easy for customers to find and discover the best products to meet their needs by helping with their product research, providing comparisons and recommendations, answering product questions, enabling shopping directly from images or videos, providing visual inspiration, and more. We do this by leveraging advanced analytics, Natural Language Processing (NLP), Machine Learning (ML), A/B testing, causal inference, and data-driven insights to continuously improve our systems. Key job responsibilities As a Data Scientist on our team, you will develop and maintain LLM instructions iterations and evaluation frameworks, including automated eval pipelines, LLM-as-a-judge methodologies, rubric design, and dataset curation to measure nuanced aspects of response quality. You will partner with the wider org to experiment with techniques such as retrieval augmentation, context enrichment, prompt decomposition, and model fine-tuning or post-training strategies, if and when applicable. You will leverage petabytes of data and identify opportunities to leverage machine learning models aimed at making conversational systems more performant. A day in the life You will: Perform hands-on analysis of large-scale multimodal interaction datasets to develop insights into how customers engage with conversational AI systems and how to improve response quality and customer experience. Use statistical methods, experimentation, and data-driven analysis to develop scalable approaches for measuring, evaluating, and optimizing large language model (LLM)-based shopping assistant systems, leveraging structured and unstructured contextual signals. Design and analyze A/B tests and experiments to evaluate new features and model improvements, ensuring statistical rigor and actionable insights. Develop metrics, dashboards, and reporting frameworks to monitor system performance, customer engagement, and business impact. Conduct deep-dive analyses to identify opportunities for improving conversational relevance, grounding, customer satisfaction, and downstream business impact. Collaborate with Applied Scientists and Engineers to translate analytical insights into production systems, working closely on model evaluation and deployment. Establish automated processes for large-scale data analysis, ETL pipelines, metric generation, and experimentation frameworks. Communicate results and insights to both technical and non-technical audiences, including through presentations, written reports, and data visualizations. About the team The Rufus Features Science team, based in London, works alongside ~150 engineers, designers and product managers, shaping the future of AI-driven shopping experiences at Amazon. The team works on every aspect of the Rufus AI, from making Rufus agentic, enabling customers to set price alerts or empower Rufus to act on their behalf and automatically purchase products when the price is right, to understanding multimodal user queries and generating answers that combine text, image, audio and video, including deep research reports that scour the web and the Amazon catalog to provide detailed and personalised shopping guidance. We utilize and advance state-of-art techniques in the fields of Natural Language Processing, gen AI, Information Retrieval, Machine/Deep Learning, and Data Mining. We validate our work by actively participating in the internal and external scientific communities.
CN, 44, Shenzhen
职位:Applied scientist 应用科学家实习生 毕业时间:2026年10月 - 2027年7月之间毕业的应届毕业生 · 入职日期:2026年6月及之前 · 实习时间:保证一周实习4-5天全职实习,至少持续3个月 · 工作地点:深圳福田区 投递须知: 1 填写简历申请时,请把必填和非必填项都填写完整。提交简历之后就无法修改了哦! 2 学校的英文全称请准确填写。中英文对应表请查这里(无法浏览请登录后浏览)https://docs.qq.com/sheet/DVmdaa1BCV0RBbnlR?tab=BB08J2 关于职位 Amazon Device &Services Asia团队正在寻找一位充满好奇心、善于沟通的应用科学家实习生,成为连接前沿AI研究与现实世界认知的桥梁。这是一个独特的角色——既需要动手参与机器学习项目,又要接受将复杂AI概念转化为通俗易懂内容的创意挑战。D&S Asia是亚马逊设备与服务业务在亚洲的支柱组织,自2009年支持Kindle制造起步,现已发展为横跨软硬件、AI(Alexa)及智能家居(Ring/Blink)的综合性团队,持续驱动区域业务创新与人才发展。 你将做什么 • 解密AI: 将复杂的技术发现转化为直观的解释、博客文章、教程或互动演示,让非技术背景的业务方和更广泛的社区都能理解 • 技术叙事: 与工程团队协作,以清晰、引人入胜的方式记录AI的能力与局限性 • 知识共享: 协助开发内部工作坊或"AI入门"课程,提升跨职能团队(产品、设计、商务)的AI素养 • 保持前沿: 持续学习并整合最新突破(如大语言模型、扩散模型、智能体),为团队输出简明易懂的趋势简报 • 研究与应用: 参与端到端的应用研究项目,从文献综述到原型开发,涵盖自然语言处理、计算机视觉或多模态AI领域
US, MA, N.reading
Amazon Industrial Robotics Group is seeking exceptional talent to help develop the next generation of advanced robotics systems that will transform automation at Amazon's scale. We're building revolutionary robotic systems that combine cutting-edge AI, sophisticated control systems, and advanced mechanical design to create adaptable automation solutions capable of working safely alongside humans in dynamic environments. This is a unique opportunity to shape the future of robotics and automation at an unprecedented scale, working with world-class teams pushing the boundaries of what's possible in robotic dexterous manipulation, locomotion, and human-robot interaction. This role presents an opportunity to shape the future of robotics through innovative applications of deep learning and large language models. At Amazon Industrial Robotics Group, we leverage advanced robotics, machine learning, and artificial intelligence to solve complex operational challenges at an unprecedented scale. Our fleet of robots operates across hundreds of facilities worldwide, working in sophisticated coordination to fulfill our mission of customer excellence. We are pioneering the development of dexterous manipulation system that: - Enables unprecedented generalization across diverse tasks - Enables contact-rich manipulation in different environments - Seamlessly integrates low-level skills and high-level behaviors - Leverage mechanical intelligence, multi-modal sensor feedback and advanced control techniques. The ideal candidate will contribute to research that bridges the gap between theoretical advancement and practical implementation in robotics. You will be part of a team that's revolutionizing how robots learn, adapt, and interact with their environment. Join us in building the next generation of intelligent robotics systems that will transform the future of automation and human-robot collaboration. A day in the life - Lead design and implementation of methods for Visual SLAM, navigation and spatial reasoning - Leverage simulation and real-world data collection to create large datasets for model development - Develop a hierarchical system that combines low-level control with high-level planning - Collaborate effectively with multi-disciplinary teams to co-design hardware and algorithms for dexterous manipulation
US, WA, Seattle
Amazon Prime is looking for an ambitious Economist Intern to help create econometric insights for world-wide Prime. Prime is Amazon's premiere membership program, with over 200M members world-wide. This role is at the center of many major company decisions that impact Amazon's customers. These decisions span a variety of industries, each reflecting the diversity of Prime benefits. These range from fast-free e-commerce shipping, digital content (e.g., exclusive streaming video, music, gaming, photos), reading, healthcare, and grocery offerings. Prime Science creates insights that power these decisions. As an economist intern in this role, you will create statistical tools that embed causal interpretations. You will utilize massive data, state-of-the-art scientific computing, econometrics (causal, counterfactual/structural, experimentation), and machine-learning, to do so. Some of the science you create will be publishable in internal or external scientific journals and conferences. You will work closely with a team of economists, applied scientists, data professionals (business analysts, business intelligence engineers), product managers, and software/data engineers. You will create insights from descriptive statistics, as well as from novel statistical and econometric models. You will create internal-to-Amazon-facing automated scientific data products to power company decisions. You will write strategic documents explaining how senior company leaders should utilize these insights to create sustainable value for customers. These leaders will often include the senior-most leaders at Amazon. The team is unique in its exposure to company-wide strategies as well as senior leadership. It operates at the research frontier of utilizing data, econometrics, artificial intelligence, and machine-learning to form business strategies. A successful candidate will have demonstrated a capacity for building, estimating, and defending statistical models (e.g., causal, counterfactual, machine-learning) using software such as R, Python, or STATA. They will have a willingness to learn and apply a broad set of statistical and computational techniques to supplement deep training in one area of econometrics. For example, many applications on the team motivate the use of structural econometrics and machine-learning. They rely on building scalable production software, which involves a broad set of world-class software-building skills often learned on-the-job. As a consequence, already-obtained knowledge of SQL, machine learning, and large-scale scientific computing using distributed computing infrastructures such as Spark-Scala or PySpark would be a plus. Additionally, this candidate will show a track-record of delivering projects well and on-time, preferably in collaboration with other team members (e.g. co-authors). Candidates must have very strong writing and emotional intelligence skills (for collaborative teamwork, often with colleagues in different functional roles), a growth mindset, and a capacity for dealing with a high-level of ambiguity. Endowed with these traits and on-the-job-growth, the role will provide the opportunity to have a large strategic, world-wide impact on the customer experiences of Prime members.
US, WA, Bellevue
The Mission Build AI safety systems that protect millions of Alexa customers every day. As conversational AI evolves, you'll solve challenging problems in Responsible AI by ensuring LLMs provide safe, trustworthy responses, building AI systems that understand nuanced human values across cultures, and maintaining customer trust at scale. What You'll Build You'll pioneer breakthrough solutions in Responsible AI at Amazon's scale. Imagine training models that set new safety standards, designing automated testing systems that hunt for vulnerabilities before they surface, and certifying the systems that power millions of daily conversations. You'll create intelligent evaluation systems that judge responses with human-level insight, build models that truly understand what makes interactions safe and delightful, and craft feedback mechanisms that help Alexa+ grasp the nuances of complex customer conversations. Here's where it gets even more exciting: you'll build AI agents that act as your team's safety net—automatically detecting and fixing production issues in real-time, often before anyone notices there was a problem. Your innovations won't just improve Alexa+; they'll fundamentally shape how it learns, evolves, and earns customer trust. As Alexa+ continues to delight customers, your work ensures it becomes more trustworthy, safer, and deeply aligned with customer needs and expectations. Your work directly protects customer trust at Amazon's scale. Every innovation you create—from novel safety mechanisms to sophisticated evaluation techniques—shapes how millions of people interact with AI confidently. You're not just building products; you're defining industry standards for responsible AI. This is frontier research with immediate real-world impact. You'll tackle problems that require innovative solutions: training models that remain truthful and grounded across diverse contexts, building reward models that capture the nuanced spectrum of human values across cultures and languages, and creating automated systems that continuously discover and address potential issues before customers encounter them. You'll collaborate with world-class scientists, product managers, and engineers to transform state-of-the-art ideas into production systems serving millions. What We're Looking For * Deep expertise in state-of-the-art NLP and Large Language Models * Track record of building scalable ML systems * Passion for impactful research—where frontier science meets real-world responsibility at scale * Excitement about solving problems that will shape the future of AI Ready to work on AI safety challenges that define the industry? Join us. Key job responsibilities This is where you'll make your mark. You'll architect breakthrough Responsible AI solutions that become industry benchmarks, pioneering algorithms that eliminate false information, designing frameworks that hunt down vulnerabilities before bad actors find them, and developing models that understand human values across every culture we serve. Working with world-class engineers and scientists, you'll push the boundaries of model training—transforming bold research into production systems that protect millions of customers daily while withstanding attacks and delivering exceptional experiences. But here's what makes this role truly special: you'll shape the future. You'll lead certification processes, advance optimization techniques, build evaluation systems that reason like humans, and mentor the next generation of AI safety experts. Every innovation you drive will set new standards for trustworthy AI at the world's largest scale. A day in the life As a Responsible AI Scientist, you're at the frontier of AI safety—experimenting with breakthrough techniques that push the boundaries of what's possible. You partner with engineering to transform research into production-ready solutions, tackling complex optimization challenges. You brainstorm with Product teams, translating ambitious visions into concrete objectives that drive real impact. Your expertise shapes critical deployment decisions as you review impactful work and guide go/no-go calls. You mentor the next generation of AI safety leaders, watching ideas spark and capabilities grow. This is where science meets impact—building AI that's not just intelligent, but trustworthy and aligned with human values. About the team Our team pioneers Responsible AI for conversational assistants. We ensure Alexa delivers safe, trustworthy experiences across all devices, modalities, and languages worldwide. We work on frontier AI safety challenges—and we're looking for scientists who want to help shape the future of trustworthy AI.
US, WA, Bellevue
The Mission Build AI safety systems that protect millions of Alexa customers every day. As conversational AI evolves, you'll solve challenging problems in Responsible AI by ensuring LLMs provide safe, trustworthy responses, building AI systems that understand nuanced human values across cultures, and maintaining customer trust at scale. What You'll Build You'll pioneer breakthrough solutions in Responsible AI at Amazon's scale. Imagine training models that set new safety standards, designing automated testing systems that hunt for vulnerabilities before they surface, and certifying the systems that power millions of daily conversations. You'll create intelligent evaluation systems that judge responses with human-level insight, build models that truly understand what makes interactions safe and delightful, and craft feedback mechanisms that help Alexa+ grasp the nuances of complex customer conversations. Here's where it gets even more exciting: you'll build AI agents that act as your team's safety net—automatically detecting and fixing production issues in real-time, often before anyone notices there was a problem. Your innovations won't just improve Alexa+; they'll fundamentally shape how it learns, evolves, and earns customer trust. As Alexa+ continues to delight customers, your work ensures it becomes more trustworthy, safer, and deeply aligned with customer needs and expectations. Your work directly protects customer trust at Amazon's scale. Every innovation you create—from novel safety mechanisms to sophisticated evaluation techniques—shapes how millions of people interact with AI confidently. You're not just building products; you're defining industry standards for responsible AI. This is frontier research with immediate real-world impact. You'll tackle problems that require innovative solutions: training models that remain truthful and grounded across diverse contexts, building reward models that capture the nuanced spectrum of human values across cultures and languages, and creating automated systems that continuously discover and address potential issues before customers encounter them. You'll collaborate with world-class scientists, product managers, and engineers to transform state-of-the-art ideas into production systems serving millions. What We're Looking For * Deep expertise in state-of-the-art NLP and Large Language Models * Track record of building scalable ML systems * Passion for impactful research—where frontier science meets real-world responsibility at scale * Excitement about solving problems that will shape the future of AI Ready to work on AI safety challenges that define the industry? Join us. Key job responsibilities This is where you'll make your mark. You'll architect breakthrough Responsible AI solutions that become industry benchmarks, pioneering algorithms that eliminate false information, designing frameworks that hunt down vulnerabilities before bad actors find them, and developing models that understand human values across every culture we serve. Working with world-class engineers and scientists, you'll push the boundaries of model training—transforming bold research into production systems that protect millions of customers daily while withstanding attacks and delivering exceptional experiences. But here's what makes this role truly special: you'll shape the future. You'll lead certification processes, advance optimization techniques, build evaluation systems that reason like humans, and mentor the next generation of AI safety experts. Every innovation you drive will set new standards for trustworthy AI at the world's largest scale. A day in the life As a Responsible AI Scientist, you're at the frontier of AI safety—experimenting with breakthrough techniques that push the boundaries of what's possible. You partner with engineering to transform research into production-ready solutions, tackling complex optimization challenges. You brainstorm with Product teams, translating ambitious visions into concrete objectives that drive real impact. Your expertise shapes critical deployment decisions as you review impactful work and guide go/no-go calls. You mentor the next generation of AI safety leaders, watching ideas spark and capabilities grow. This is where science meets impact—building AI that's not just intelligent, but trustworthy and aligned with human values. About the team Our team pioneers Responsible AI for conversational assistants. We ensure Alexa delivers safe, trustworthy experiences across all devices, modalities, and languages worldwide. We work on frontier AI safety challenges—and we're looking for scientists who want to help shape the future of trustworthy AI.
GB, London
We are looking for an Economist to work on exciting and challenging business problems related to Amazon Retail’s worldwide product assortment. You will build innovative solutions based on econometrics, machine learning, and experimentation. You will be part of a interdisciplinary team of economists, product managers, engineers, and scientists, and your work will influence finance and business decisions affecting Amazon’s vast product assortment globally. If you have an entrepreneurial spirit, you know how to deliver results fast, and you have a deeply quantitative, highly innovative approach to solving problems, and long for the opportunity to build pioneering solutions to challenging problems, we want to talk to you. Key job responsibilities * Work on a challenging problem that has the potential to significantly impact Amazon’s business position * Develop econometric models and experiments to measure the customer and financial impact of Amazon’s product assortment * Collaborate with other scientists at Amazon to deliver measurable progress and change * Influence business leaders based on empirical findings
US, WA, Seattle
Innovators wanted! Are you an entrepreneur? A builder? A dreamer? This role is part of an Amazon Special Projects team that takes the company’s Think Big leadership principle to the limits. If you’re interested in innovating at scale to address big challenges in the world, this is the team for you. As an Applied Scientist on our team, you will focus on building state-of-the-art ML models for biology. Our team rewards curiosity while maintaining a laser-focus in bringing products to market. Competitive candidates are responsive, flexible, and able to succeed within an open, collaborative, entrepreneurial, startup-like environment. At the forefront of both academic and applied research in this product area, you have the opportunity to work together with a diverse and talented team of scientists, engineers, and product managers and collaborate with other teams. Key job responsibilities - Build, adapt and evaluate ML models for life sciences applications - Collaborate with a cross-functional team of ML scientists, biologists, software engineers and product managers
US, WA, Seattle
Amazon Prime is looking for an ambitious Economist Intern to help create econometric insights for world-wide Prime. Prime is Amazon's premiere membership program, with over 200M members world-wide. This role is at the center of many major company decisions that impact Amazon's customers. These decisions span a variety of industries, each reflecting the diversity of Prime benefits. These range from fast-free e-commerce shipping, digital content (e.g., exclusive streaming video, music, gaming, photos), reading, healthcare, and grocery offerings. Prime Science creates insights that power these decisions. As an economist intern in this role, you will create statistical tools that embed causal interpretations. You will utilize massive data, state-of-the-art scientific computing, econometrics (causal, counterfactual/structural, experimentation), and machine-learning, to do so. Some of the science you create will be publishable in internal or external scientific journals and conferences. You will work closely with a team of economists, applied scientists, data professionals (business analysts, business intelligence engineers), product managers, and software/data engineers. You will create insights from descriptive statistics, as well as from novel statistical and econometric models. You will create internal-to-Amazon-facing automated scientific data products to power company decisions. You will write strategic documents explaining how senior company leaders should utilize these insights to create sustainable value for customers. These leaders will often include the senior-most leaders at Amazon. The team is unique in its exposure to company-wide strategies as well as senior leadership. It operates at the research frontier of utilizing data, econometrics, artificial intelligence, and machine-learning to form business strategies. A successful candidate will have demonstrated a capacity for building, estimating, and defending statistical models (e.g., causal, counterfactual, machine-learning) using software such as R, Python, or STATA. They will have a willingness to learn and apply a broad set of statistical and computational techniques to supplement deep training in one area of econometrics. For example, many applications on the team motivate the use of structural econometrics and machine-learning. They rely on building scalable production software, which involves a broad set of world-class software-building skills often learned on-the-job. As a consequence, already-obtained knowledge of SQL, machine learning, and large-scale scientific computing using distributed computing infrastructures such as Spark-Scala or PySpark would be a plus. Additionally, this candidate will show a track-record of delivering projects well and on-time, preferably in collaboration with other team members (e.g. co-authors). Candidates must have very strong writing and emotional intelligence skills (for collaborative teamwork, often with colleagues in different functional roles), a growth mindset, and a capacity for dealing with a high-level of ambiguity. Endowed with these traits and on-the-job-growth, the role will provide the opportunity to have a large strategic, world-wide impact on the customer experiences of Prime members.
US, VA, Arlington
The People eXperience and Technology Central Science (PXTCS) team uses economics, behavioral science, statistics, and machine learning to proactively identify mechanisms and process improvements which simultaneously improve Amazon and the lives, well-being, and the value of work to Amazonians. The Benefits Science team is looking for a senior economist to transform complex business challenges into actionable scientific insights. In this role, you will partner directly with business leaders to design and evaluate pilots, build models using large-scale data, and scale successful prototypes into company-wide policies and programs. We're looking for someone who can combine rigorous scientific thinking with practical business acumen and is passionate about using economics to improve employee experiences at scale. The ideal candidate will thrive in interdisciplinary environments, working alongside engineers, data scientists, and business leaders from diverse backgrounds. Key job responsibilities * Design and evaluate innovative research pilots that address critical business challenges * Develop sophisticated economic models using large-scale organizational data * Collaborate with engineers, data scientists, and business leaders to transform research insights into actionable strategies * Write and present comprehensive research findings to senior leadership * Scale successful prototypes into company-wide policies and programs A day in the life Work with teammates to apply economic methods to business problems. This might include identifying the appropriate research questions, writing code to implement a DID analysis or estimate a structural model, or writing and presenting a document with findings to business leaders. Our economists also collaborate with partner teams throughout the process, from understanding their challenges, to developing a research agenda that will address those challenges, to help them implement solutions. About the team Our Benefits Science team is a dynamic group of economists, data scientists, and business strategists committed to understanding human capital at scale. We use interdisciplinary approaches to solve complex workforce challenges, combining economics, behavioral science, and advanced analytics to create meaningful workplace improvements.