How we built Cedar with automated reasoning and differential testing

The new development process behind Amazon Web Services’ Cedar authorization-policy language.

Cedar is a new authorization-policy language used by the Amazon Verified Permissions and AWS Verified Access managed services, and we recently released it publicly. Using Cedar, developers can write policies that specify fine-grained permissions for their applications. The applications then authorize access requests by calling Cedar’s authorization engine. Because Cedar policies are separate from application code, they can be independently authored, updated, analyzed, and audited. 

Related content
CAV keynote lecture by the director of applied science for AWS Identity explains how AWS is making the power of automated reasoning available to all customers.

We want to assure developers that Cedar’s authorization decisions will be correct. To provide that assurance, we follow a two-part process we call verification-guided development when we’re working on Cedar. First, we use automated reasoning to prove important correctness properties about formal models of Cedar’s components. Second, we use differential random testing to show that the models match the production code. In this blog post we present an overview of verification-guided development for Cedar.

A primer on Cedar

Cedar is a language for writing and enforcing authorization policies for custom applications. Cedar policies are expressed in syntax resembling natural language. They define who (the principal) can do what (the action) on what target (the resource) under which conditions (when)?

To see how Cedar works, consider a simple application, TinyTodo, designed for managing task lists. TinyTodo uses Cedar to control who can do what. Here is one of TinyTodo’s policies:

// policy 1
permit(principal, action, resource)
when {
	resource has owner && resource.owner == principal
};

This policy states that any principal (a TinyTodo User) can perform any action on any resource (a TinyTodo List) as long as the resource’s creator, defined by its owner attribute, matches the requesting principal. Here’s another TinyTodo Cedar policy:

// policy 2
permit (
	principal,
	action == Action::"GetList",
	resource
)
when {
	principal in resource.editors || principal in resource.readers
};

This policy states that any principal can read the contents of a task list (Action::"GetList") if that principal is in either the list’s readers group or its editors group. Here is a third policy:

// policy 3
forbid (
	principal in Team::"interns",
	action == Action::"CreateList",
	resource == Application::"TinyTodo"
);

This policy states that any principal who is an intern (in Team::"interns") is forbidden from creating a new task list (Action::"CreateList") using TinyTodo (Application::"TinyTodo").

Related content
Meet Amazon Science’s newest research area.

When the application needs to enforce access, as when a user of TinyTodo issues a command, it only needs to make a corresponding request to the Cedar authorization engine. The authorization engine evaluates the request in light of the Cedar policies and relevant application data. If it returns decision Allow, TinyTodo can proceed with the command. If it returns decision Deny, TinyTodo can report that the command is not permitted.

How do we build Cedar to be trustworthy?

Our work on Cedar uses a process we call verification-guided development to ensure that Cedar’s authorization engine makes the correct decisions. The process has two parts. First, we model Cedar’s authorization engine and validator in the Dafny verification-aware programming language. With Dafny, you can write code, and you can specify properties about what the code is meant to do under all circumstances. Using Dafny’s built-in automated-reasoning capabilities we have proved that the code satisfies a variety of safety and security properties.

Second, we use differential random testing (DRT) to confirm that Cedar’s production implementation, written in Rust, matches the Dafny model’s behavior. We generate millions of diverse inputs and feed them to both the Dafny model and the production code. If both versions always produce the same output, we have a high degree of confidence that the implementation matches the model.

Cedar figure.png
Building Cedar using automated reasoning and differential testing.

Proving properties about Cedar authorization

 Cedar’s authorization algorithm was designed to be secure by default, as exemplified by the following two properties:

  • explicit permit — permission is granted only by individual permit policies and is not gained by error or default;
  • forbid overrides permit — any applicable forbid policy always denies access, even if there is a permit policy that allows it.

With these properties, sets of policies are easier to understand. Policy authors know that permit policies are the only way access is granted, and forbid policies decline access regardless of whether it is explicitly permitted.

Related content
Distributing proof search, reasoning about distributed systems, and automating regulatory compliance are just three fruitful research areas.

Given an authorization request, the Cedar authorization engine takes each Cedar policy and evaluates it after substituting the application request parameters into the principal, action and resource variables. For example, for the request principal= User::”Alice”, action=Action::”GetList”, and resource=List::”AliceList”, substituting for the variables in policy 1 would produce the expression List::”AliceList” has owner && List::”AliceList”.owner == User::”Alice”. If this expression evaluates to true, we say the request satisfies the policy. The authorization engine collects the satisfied forbid and permit policies into distinct sets and then makes its decision.

We model the authorization engine as a Dafny function and use Dafny’s automated-reasoning capabilities to state and prove the explicit-permit and forbid-overrides-permit properties. To see how this helps uncover mistakes, let’s consider a buggy version of the authorization engine:

function method isAuthorized(): Response { // BUGGY VERSION
	var f := forbids();
	var p := permits();
	if f != {} then
		Response(Deny, f)
	else
		Response(Allow, p)
}

The logic states that if any forbid policy is applicable (set f is not the empty set {}), the result should be Deny, thus overriding any applicable permit policies (in set p). Otherwise, the result is Allow. While this logic correctly reflects the desired forbid-overrides-permit property, it does not correctly capture explicit permit. Just because there are no applicable forbid policies doesn’t mean there are any applicable permit policies. We can see this by specifying and attempting to prove explicit permit in Dafny:

// A request is explicitly permitted when a permit policy is satisfied
predicate IsExplicitlyPermitted(request: Request, store: Store) {
	exists p ::
		p in store.policies.policies.Keys &&
		store.policies.policies[p].effect == Permit &&
		Authorizer(request, store).satisfied(p)
}
lemma AllowedIfExplicitlyPermitted(request: Request, store: Store)
ensures // A request is allowed if it is explicitly permitted
	(Authorizer(request, store).isAuthorized().decision == Allow) ==>
	IsExplicitlyPermitted(request, store)
{ ... }

A Dafny predicate is a function that takes arguments and returns a logical condition, and a Dafny lemma is a property to be proved. The IsExplicitlyPermitted predicate defines the condition that there is an applicable permit policy for the given request. The AllowedIfExplicitlyPermitted lemma states that a decision of Allow necessarily means the request was explicitly permitted. This lemma does not hold for the isAuthorized definition above; Dafny complains that A postcondition might not hold on this return path and points to the ensures clause.

Here is the corrected code:

function method isAuthorized(): Response {
	var f := forbids();
	var p := permits();
	if f == {} && p != {} then
		Response(Allow, p)
	else
		Response(Deny, f)
}

Now a response is Allow only if there are no applicable forbid policies, and there is at least one applicable permit policy. With this change, Dafny automatically proves AllowedIfExplicitlyPermitted. It also proves forbid overrides permit (not shown).

Related content
To mark the occasion of the eighth Federated Logic Conference (FloC), Amazon’s Byron Cook, Daniel Kröning, and Marijn Heule discussed automated reasoning’s prospects.

We have used the Cedar Dafny models to prove a variety of properties. Our most significant proof is that the Cedar validator, which confirms that Cedar policies are consistent with the application’s data model, is sound: if the validator accepts a policy, evaluating the policy should never result in certain classes of error. When carrying out this proof in Dafny, we found a number of subtle bugs in the validator’s design that we were able to correct.

We note that Dafny models are useful not just for automated reasoning but for manual reasoning, too. The Dafny code is much easier to read than the Rust implementation. As one measure of this, at the time of this writing the Dafny model for the authorizer has about one-sixth as many lines of code as the production code. Both Cedar users and tool implementers can refer to the Dafny models to quickly understand precise details about how Cedar works.

Differential random testing

Once we have proved properties about the Cedar Dafny model, we want to provide evidence that they hold for the production code, too, which we can do by using DRT to show that the model and the production code behave the same. Using the cargo fuzz random-testing framework, we generate millions of inputs — access requests, accompanying data, and policies — and send them to both the Dafny model engine and the Rust production engine. If the two versions agree on the decision, then all is well. If they disagree, then we have found a bug.

The main challenge with using DRT effectively is to ensure the necessary code coverage by generating useful and diverse inputs. Randomly generated policies are unlikely to mention the same groups and attributes chosen in randomly generated requests and data. As a result, pure random generation will miss a lot of core evaluation logic and overindex on error-handling code. To resolve this, we wrote several input generators, including ones that take care to generate policies, data, and requests that are consistent with one another, while also producing policies that use Cedar’s key language constructs. As of this writing, we run DRT for six hours nightly and execute on the order of 100 million total tests.

Related content
Rungta had a promising career with NASA, but decided the stars aligned for her at Amazon.

The use of DRT during Cedar’s development has discovered corner cases where there were discrepancies between the model and the production code, making it an important tool in our toolkit. For example, there was a bug in a Rust package we were using for IP address operations; the Dafny model exposed an issue in how the package was parsing IP addresses. Since the bug is in an external package, we fixed the problem within our code while we wait for the upstream fix. We also found subtle bugs in the Cedar policy parser, in how the authorizer handles missing application data, and how namespace prefixes on application data (e.g., TinyTodo::List::”AliceList”) are interpreted.

Learn more

In this post we have discussed the verification-guided development process we have followed for the Cedar authorization policy language. In this process, we model Cedar language components in the Dafny programming language and use Dafny’s automated-reasoning capabilities to prove properties about them. We check that the Cedar production code matches the Dafny model through differential random testing. This process has revealed several interesting bugs during development and has given us greater confidence that Cedar’s authorization engine makes correct decisions.

To learn more, you can check out the Cedar Dafny models and differential-testing code on GitHub. You can also learn more about Dafny on the Dafny website and the Cedar service on the Cedar website.

Related content

US, NY, New York
Are you passionate about solving big problems from ground-up? Do you enjoy building new state-of-the-art products at internet scale? Come lead the innovation in this startup team, vertical ad products. This is a green field problem without a known answer or a pattern to follow. We have ambitious vision to simplify full funnel advertising solutions, at scale, with specialized agentic AI-powered models and diversify the demand to strategic verticals including finserv, autos, locals.. etc. We are seeking an experienced Applied Scientist to drive innovation in our Ads Foundational Model. In this individual contributor role, you will apply advanced machine learning techniques to improve advertiser performance and customer experience. Key job responsibilities As an Applied Scientist on this team, you will: 1. Develop and drive the science strategy for Ads Foundational Model (Ads-FM), aligning it with the program's objectives and overall business goals. 2. Identify high-impact opportunities within Ads-FM program and lead the ideation, planning, and execution of science initiatives to address them. 3. Build and deploy machine learning models using computer vision, natural language processing, and deep learning to evaluate and enhance ad effectiveness. 4. Develop algorithms that extract meaningful signals from image, video, and audio content to predict and improve customer engagement 5. Leverage Amazon's extensive data repository to create predictive models that generate actionable recommendations for more compelling ad creative 6. Collaborate with business leaders and cross-functional teams to implement ML-powered solutions 7. Contribute to the ML roadmap for the Ads-FM program through innovation and research.
IN, KA, Bangalore
Amazon’s Last Mile Team is looking for a passionate individual with strong optimization and analytical skills to join its Last Mile Science team in the endeavor of designing and improving the most complex planning of delivery network in the world. Last Mile builds global solutions that enable Amazon to attract an elastic supply of drivers, companies, and assets needed to deliver Amazon's and other shippers' volumes at the lowest cost and with the best customer delivery experience. Last Mile Science team owns the core decision models in the space of jurisdiction planning, delivery channel and modes network design, capacity planning for on the road and at delivery stations, routing inputs estimation and optimization. Our research has direct impact on customer experience, driver and station associate experience, Delivery Service Partner (DSP)’s success and the sustainable growth of Amazon. Optimizing the last mile delivery requires deep understanding of transportation, supply chain management, pricing strategies and forecasting. Only through innovative and strategic thinking, we will make the right capital investments in technology, assets and infrastructures that allows for long-term success. Our team members have an opportunity to be on the forefront of supply chain thought leadership by working on some of the most difficult problems in the industry with some of the best product managers, scientists, and software engineers in the industry. Key job responsibilities Candidates will be responsible for developing solutions to better manage and optimize delivery capacity in the last mile network. The successful candidate should have solid research experience in one or more technical areas of Operations Research or Machine Learning. These positions will focus on identifying and analyzing opportunities to improve existing algorithms and also on optimizing the system policies across the management of external delivery service providers and internal planning strategies. They require superior logical thinkers who are able to quickly approach large ambiguous problems, turn high-level business requirements into mathematical models, identify the right solution approach, and contribute to the software development for production systems. To support their proposals, candidates should be able to independently mine and analyze data, and be able to use any necessary programming and statistical analysis software to do so. Successful candidates must thrive in fast-paced environments, which encourage collaborative and creative problem solving, be able to measure and estimate risks, constructively critique peer research, and align research focuses with the Amazon's strategic needs. As a senior scientist, you will also help coach/mentor junior scientists in the team.
US, WA, Seattle
This role will contribute to developing the Economics and Science products and services in the Fee domain, with specialization in supply chain systems and fees. Through the lens of economics, you will develop causal links for how Amazon, Sellers and Customers interact. You will be a key and senior scientist, advising Amazon leaders how to price our services. You will work on developing frameworks and scaleable, repeatable models supporting optimal pricing and policy in the two-sided marketplace that is central to Amazon's business. The pricing for Amazon services is complex. You will partner with science and technology teams across Amazon including Advertising, Supply Chain, Operations, Prime, Consumer Pricing, and Finance. We are looking for an experienced Principal Economist to improve our understanding of seller Economics, enhance our ability to estimate the causal impact of fees, and work with partner teams to design pricing policy changes. In this role, you will provide guidance to scientists to develop econometric models to influence our fee pricing worldwide. You will lead the development of causal models to help isolate the impact of fee and policy changes from other business actions, using experiments when possible, or observational data when not. Key job responsibilities The ideal candidate will have extensive Economics knowledge, demonstrated strength in practical and policy relevant structural econometrics, strong collaboration skills, proven ability to lead highly ambiguous and large projects, and a drive to deliver results. They will work closely with Economists, Data / Applied Scientists, Strategy Analysts, Data Engineers, and Product leads to integrate economic insights into policy and systems production. Familiarity with systems and services that constitute seller supply chains is a plus but not required. About the team The Stores Economics and Sciences team is a central science team that supports Amazon's Retail and Supply Chain leadership. We tackle some of Amazon's most challenging economics and machine learning problems, where our mandate is to impact the business on massive scale.
US, CA, Pasadena
The Amazon Center for Quantum Computing in Pasadena, CA, is looking to hire an Applied Scientist specializing in the design of microwave components for use in cryogenic environments. Working alongside other scientists and engineers, you will design and validate hardware performing microwave signal conditioning at cryogenic temperatures for Amazon quantum processors. Working effectively within a cross-functional team environment is critical. The ideal candidate will have a proven track record of hardware development from requirements development to validation. Diverse Experiences Amazon values diverse experiences. Even if you do not meet all of the preferred qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying. Work/Life Balance We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud. Inclusive Team Culture Here at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences, inspire us to never stop embracing our uniqueness. Mentorship and Career Growth We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional. Key job responsibilities Our scientists and engineers collaborate across diverse teams and projects to offer state of the art, cost effective solutions for the signal conditioning of Amazon quantum processor systems at cryogenic temperatures. You’ll bring a passion for innovation, collaboration, and mentoring to: Solve layered technical problems across our cryogenic signal chain. Develop requirements with key system stakeholders, including quantum device, test and measurement, hardware, and theory teams. Design, implement, test, deploy, and maintain innovative solutions that meet both performance and cost metrics. Research enabling technologies necessary for Amazon reach commercial viability in quantum computing . A day in the life As you research, design, and implement cryogenic microwave signal conditioning solutions, you will also: Participate in requirements, design, and test reviews. Work cross-functionally to help drive decisions using your unique technical background and skill set. Define and maintain standards for operational excellence. Work in a high-paced, startup-like environment where you are provided the resources to innovate quickly.
US, CA, Pasadena
The Amazon Center for Quantum Computing (CQC) team is looking for a passionate, talented, and inventive Research Engineer specializing in hardware design for cryogenic environments. The ideal candidate should have expertise in 3D CAD (SolidWorks), thermal and structural FEA (Ansys/COMSOL), hardware design for cryogenic applications, design for manufacturing, and mechanical engineering principles. The candidate must have demonstrated experience driving designs through full product development cycles (requirements, conceptual design, detailed design, manufacturing, integration, and testing). Candidates must also have a strong background in both cryogenic mechanical engineering theory and implementation. Working effectively within a cross-functional team environment is critical. Key job responsibilities The CQC collaborates across teams and projects to offer state-of-the-art, cost-effective solutions for scaling the signal delivery to quantum processor systems at cryogenic temperatures. Equally important is the ability to scale the thermal performance and improve EMI mitigation of the cryogenic environment. You will work on the following: - High density novel packaging solutions for quantum processor units - Cryogenic mechanical design for novel cryogenic signal conditioning sub-assemblies - Cryogenic mechanical design for signal delivery systems - Simulation-driven designs (shielding, filtering, etc.) to reduce sources of EMI within the qubit environment. - Own end-to-end product development through requirements, design reports, design reviews, assembly/testing documentation, and final delivery A day in the life As you design and implement cryogenic hardware solutions, from requirements definition to deployment, you will also: - Participate in requirements, design, and test reviews and communicate with internal stakeholders - Work cross-functionally to help drive decisions using your unique technical background and skill set - Refine and define standards and processes for operational excellence - Work in a high-paced, startup-like environment where you are provided the resources to innovate quickly About the team The Amazon Center for Quantum Computing (CQC) is a multi-disciplinary team of scientists, engineers, and technicians, on a mission to develop a fault-tolerant quantum computer. Inclusive Team Culture Here at Amazon, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon conferences, inspire us to never stop embracing our uniqueness. Diverse Experiences Amazon values diverse experiences. Even if you do not meet all of the preferred qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying. Mentorship & Career Growth We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional. Work/Life Balance We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud. Export Control Requirement Due to applicable export control laws and regulations, candidates must be either a U.S. citizen or national, U.S. permanent resident (i.e., current Green Card holder), or lawfully admitted into the U.S. as a refugee or granted asylum, or be able to obtain a US export license. If you are unsure if you meet these requirements, please apply and Amazon will review your application for eligibility.
IT, Turin
As an Applied Scientist in the Alexa AI team, you will spearhead the advancement and deployment of state-of-the-art ML/RAG systems that revolutionize how millions of customers interact with Alexa. You'll leverage your expertise in machine learning, natural language processing, and large language models to create reliable, scalable, high-performance products that set new standards in operational excellence. Working at the intersection of research and production, you'll translate latest AI innovations into customer-facing features that delight users daily. Your work will span the full ML lifecycle—from analyzing customer behavior patterns and building novel metrics for personal digital assistants, to deploying automated training pipelines and conducting rigorous A/B testing across diverse devices and endpoints. Collaborating closely with business, engineering, and science teams across Amazon, you'll lead high-visibility programs that automate workflows and deliver measurable customer impact. This role offers the unique opportunity to publish at top-tier conferences while seeing your innovations scale to one of the world's most popular voice assistants, serving millions of customers globally. Key job responsibilities As an Applied Scientist in the Alexa AI team: - You'll analyze and model customer behavior at scale, building novel metrics for personal digital assistants across diverse devices and endpoints. Your work will involve creating deep learning, policy-based learning, and machine learning algorithms that directly impact customer experiences, translating complex data patterns into actionable insights that drive product innovation. - Your technical leadership will extend to building and deploying automated model training and evaluation pipelines, implementing complex machine learning and deep learning algorithms, and conducting rigorous model and data analysis through online A/B testing. You'll research and implement novel approaches that push the boundaries of what's possible in conversational AI. - Beyond model development, you'll ensure operational excellence by taking ownership of production systems, including on-call responsibilities during peak and non-peak hours. Working alongside Software Development Engineers, you'll deploy fixes and handle high-severity issues, ensuring our ML systems maintain the reliability and performance that millions of Alexa customers depend on daily. A day in the life As an Applied Scientist in the Alexa AI team, your day will involve collaborating with talented engineers and scientists to build scalable solutions for our conversational assistant. You'll dive into data analysis, experiment with novel algorithms, and iterate on models based on real-time user feedback. Working in a fast-paced, ambiguous environment, you'll tackle complex technical challenges—from debugging production issues to presenting research findings to stakeholders. Your self-motivated approach will drive you to swiftly deliver impactful solutions while maintaining the high standards that define our mission to revolutionize user experiences for millions of customers. About the team The Alexa AI team develops the intelligence behind one of the world's most popular voice assistants, serving millions of customers globally. We're a diverse group of scientists, engineers, and researchers united by our mission to make Alexa more natural, helpful, and delightful. Our culture thrives on innovation, collaboration, and customer obsession. We tackle some of the most challenging problems in conversational AI—from natural language understanding to personalization at scale. Here, you'll work alongside world-class talent, publish at top-tier conferences, and see your innovations impact customers daily. We move fast, think big, and celebrate both successes and learnings.
IT, Turin
As an Applied Scientist in the Alexa AI team, you will spearhead the advancement and deployment of state-of-the-art ML/RAG systems that revolutionize how millions of customers interact with Alexa. You'll leverage your expertise in machine learning, natural language processing, and large language models to create reliable, scalable, high-performance products that set new standards in operational excellence. Working at the intersection of research and production, you'll translate latest AI innovations into customer-facing features that delight users daily. Your work will span the full ML lifecycle—from analyzing customer behavior patterns and building novel metrics for personal digital assistants, to deploying automated training pipelines and conducting rigorous A/B testing across diverse devices and endpoints. Collaborating closely with business, engineering, and science teams across Amazon, you'll lead high-visibility programs that automate workflows and deliver measurable customer impact. This role offers the unique opportunity to publish at top-tier conferences while seeing your innovations scale to one of the world's most popular voice assistants, serving millions of customers globally. Key job responsibilities As an Applied Scientist in the Alexa AI team: - You'll analyze and model customer behavior at scale, building novel metrics for personal digital assistants across diverse devices and endpoints. Your work will involve creating deep learning, policy-based learning, and machine learning algorithms that directly impact customer experiences, translating complex data patterns into actionable insights that drive product innovation. - Your technical leadership will extend to building and deploying automated model training and evaluation pipelines, implementing complex machine learning and deep learning algorithms, and conducting rigorous model and data analysis through online A/B testing. You'll research and implement novel approaches that push the boundaries of what's possible in conversational AI. - Beyond model development, you'll ensure operational excellence by taking ownership of production systems, including on-call responsibilities during peak and non-peak hours. Working alongside Software Development Engineers, you'll deploy fixes and handle high-severity issues, ensuring our ML systems maintain the reliability and performance that millions of Alexa customers depend on daily. A day in the life As an Applied Scientist in the Alexa AI team, your day will involve collaborating with talented engineers and scientists to build scalable solutions for our conversational assistant. You'll dive into data analysis, experiment with novel algorithms, and iterate on models based on real-time user feedback. Working in a fast-paced, ambiguous environment, you'll tackle complex technical challenges—from debugging production issues to presenting research findings to stakeholders. Your self-motivated approach will drive you to swiftly deliver impactful solutions while maintaining the high standards that define our mission to revolutionize user experiences for millions of customers. About the team The Alexa AI team develops the intelligence behind one of the world's most popular voice assistants, serving millions of customers globally. We're a diverse group of scientists, engineers, and researchers united by our mission to make Alexa more natural, helpful, and delightful. Our culture thrives on innovation, collaboration, and customer obsession. We tackle some of the most challenging problems in conversational AI—from natural language understanding to personalization at scale. Here, you'll work alongside world-class talent, publish at top-tier conferences, and see your innovations impact customers daily. We move fast, think big, and celebrate both successes and learnings.
IT, Turin
As an Applied Scientist in the Alexa AI team, you will spearhead the advancement and deployment of state-of-the-art ML/RAG systems that revolutionize how millions of customers interact with Alexa. You'll leverage your expertise in machine learning, natural language processing, and large language models to create reliable, scalable, high-performance products that set new standards in operational excellence. Working at the intersection of research and production, you'll translate latest AI innovations into customer-facing features that delight users daily. Your work will span the full ML lifecycle—from analyzing customer behavior patterns and building novel metrics for personal digital assistants, to deploying automated training pipelines and conducting rigorous A/B testing across diverse devices and endpoints. Collaborating closely with business, engineering, and science teams across Amazon, you'll lead high-visibility programs that automate workflows and deliver measurable customer impact. This role offers the unique opportunity to publish at top-tier conferences while seeing your innovations scale to one of the world's most popular voice assistants, serving millions of customers globally. Key job responsibilities As an Applied Scientist in the Alexa AI team: - You'll analyze and model customer behavior at scale, building novel metrics for personal digital assistants across diverse devices and endpoints. Your work will involve creating deep learning, policy-based learning, and machine learning algorithms that directly impact customer experiences, translating complex data patterns into actionable insights that drive product innovation. - Your technical leadership will extend to building and deploying automated model training and evaluation pipelines, implementing complex machine learning and deep learning algorithms, and conducting rigorous model and data analysis through online A/B testing. You'll research and implement novel approaches that push the boundaries of what's possible in conversational AI. - Beyond model development, you'll ensure operational excellence by taking ownership of production systems, including on-call responsibilities during peak and non-peak hours. Working alongside Software Development Engineers, you'll deploy fixes and handle high-severity issues, ensuring our ML systems maintain the reliability and performance that millions of Alexa customers depend on daily. A day in the life As an Applied Scientist in the Alexa AI team, your day will involve collaborating with talented engineers and scientists to build scalable solutions for our conversational assistant. You'll dive into data analysis, experiment with novel algorithms, and iterate on models based on real-time user feedback. Working in a fast-paced, ambiguous environment, you'll tackle complex technical challenges—from debugging production issues to presenting research findings to stakeholders. Your self-motivated approach will drive you to swiftly deliver impactful solutions while maintaining the high standards that define our mission to revolutionize user experiences for millions of customers. About the team The Alexa AI team develops the intelligence behind one of the world's most popular voice assistants, serving millions of customers globally. We're a diverse group of scientists, engineers, and researchers united by our mission to make Alexa more natural, helpful, and delightful. Our culture thrives on innovation, collaboration, and customer obsession. We tackle some of the most challenging problems in conversational AI—from natural language understanding to personalization at scale. Here, you'll work alongside world-class talent, publish at top-tier conferences, and see your innovations impact customers daily. We move fast, think big, and celebrate both successes and learnings.
IT, Turin
As an Applied Scientist in the Alexa AI team, you will spearhead the advancement and deployment of state-of-the-art ML/RAG systems that revolutionize how millions of customers interact with Alexa. You'll leverage your expertise in machine learning, natural language processing, and large language models to create reliable, scalable, high-performance products that set new standards in operational excellence. Working at the intersection of research and production, you'll translate latest AI innovations into customer-facing features that delight users daily. Your work will span the full ML lifecycle—from analyzing customer behavior patterns and building novel metrics for personal digital assistants, to deploying automated training pipelines and conducting rigorous A/B testing across diverse devices and endpoints. Collaborating closely with business, engineering, and science teams across Amazon, you'll lead high-visibility programs that automate workflows and deliver measurable customer impact. This role offers the unique opportunity to publish at top-tier conferences while seeing your innovations scale to one of the world's most popular voice assistants, serving millions of customers globally. Key job responsibilities As an Applied Scientist in the Alexa AI team: - You'll analyze and model customer behavior at scale, building novel metrics for personal digital assistants across diverse devices and endpoints. Your work will involve creating deep learning, policy-based learning, and machine learning algorithms that directly impact customer experiences, translating complex data patterns into actionable insights that drive product innovation. - Your technical leadership will extend to building and deploying automated model training and evaluation pipelines, implementing complex machine learning and deep learning algorithms, and conducting rigorous model and data analysis through online A/B testing. You'll research and implement novel approaches that push the boundaries of what's possible in conversational AI. - Beyond model development, you'll ensure operational excellence by taking ownership of production systems, including on-call responsibilities during peak and non-peak hours. Working alongside Software Development Engineers, you'll deploy fixes and handle high-severity issues, ensuring our ML systems maintain the reliability and performance that millions of Alexa customers depend on daily. A day in the life As an Applied Scientist in the Alexa AI team, your day will involve collaborating with talented engineers and scientists to build scalable solutions for our conversational assistant. You'll dive into data analysis, experiment with novel algorithms, and iterate on models based on real-time user feedback. Working in a fast-paced, ambiguous environment, you'll tackle complex technical challenges—from debugging production issues to presenting research findings to stakeholders. Your self-motivated approach will drive you to swiftly deliver impactful solutions while maintaining the high standards that define our mission to revolutionize user experiences for millions of customers. About the team The Alexa AI team develops the intelligence behind one of the world's most popular voice assistants, serving millions of customers globally. We're a diverse group of scientists, engineers, and researchers united by our mission to make Alexa more natural, helpful, and delightful. Our culture thrives on innovation, collaboration, and customer obsession. We tackle some of the most challenging problems in conversational AI—from natural language understanding to personalization at scale. Here, you'll work alongside world-class talent, publish at top-tier conferences, and see your innovations impact customers daily. We move fast, think big, and celebrate both successes and learnings.
US, NY, New York
We are seeking a Human-Robot Interaction (HRI) Applied Scientist to develop cutting-edge interactions that make robots feel alive, personal, and fun. In this role, you will focus on verbal and non-verbal conversational systems, social dynamics, memory, and long-term relationship formation between robots, their environments, and the people they interact with. Your contributions will be essential in advancing robotics by enabling expressive, socially intelligent, and trustworthy interactions between robots and humans. Key job responsibilities - Develop interactive systems that leverage large language models, multimodal inputs and outputs, reinforcement learning from human feedback, or other advanced techniques to achieve fluid, engaging, and socially appropriate robot behavior - Design and implement intelligent conversational systems that handle turn-taking, grounding, interruption, and incorporates context drawn from a robot's physical environment and shared history with a user - Integrate perceptual sensor streams including gaze, facial expression, gesture, posture, and more to understand social context and produce coherent, lifelike interactions. - Develop memory and personalization systems that allow robots to form lasting relationships with individual users, learn their environments, and adapt their behavior over weeks and months - Stay updated on advancements in HRI, NLP, multimodal AI, and cognitive and social science to apply cutting-edge techniques to robot interaction challenges - Lead technical projects from conception through production deployment - Mentor junior scientists and engineers - Bridge research initiatives with practical engineering implementation