"This technology will be transformative in ways we can barely comprehend"

A judge and some of the finalists from the Alexa Prize Grand Challenge 3 talk about the competition, the role of COVID-19, and the future of socialbots.

Human beings are social creatures, and conversations are what connect us—they enable us to share everything from the prosaic to the profound with the people that matter to us. Living through an era marked by pandemic-induced isolation means many of those conversations have shifted online, but the connection they provide remains essential.

So what happens when you replace one of the human participants in a conversation with a socialbot? What does it mean to have an engaging conversation with an AI assistant? How can that kind of conversation prove to be valuable, and can it provide its own kind of connection?

Application period for next Alexa Prize challenge opens

The Amazon Alexa Prize team encourages all interested teams to apply for the Grand Challenge 4 by 11:59 p.m. PST on October 6, 2020.

The participants in this year’s Alexa Prize contest are driven by those questions. Amazon recently announced that a team from Emory University has won the 2020 Alexa Prize. We talked to that team, along with a judge from this year’s competition, as well as representatives from the other finalist teams at Czech Technical University, Stanford University, University of California, Davis, and University of California, Santa Cruz. We wanted to learn what drives them to participate, how COVID-19 has influenced their work and what they see as the possibilities and challenges for socialbots moving forward.

Winners of the Alexa Prize SocialBot Grand Challenge 3 discuss their research

Q: What inspired you to participate in this year’s competition?

Sarah Fillwock, team leader, Emora, Emory University: We had a group of students who were interested in dialogue system research, some of whom had actually participated in the Alexa Prize in its previous years, and we all knew that the Alexa Prize offers a really unique opportunity for anyone interested in this type of work. It is really exciting to use the Alexa device platform to launch a socialbot, because we are able to get hundreds of conversations a day between our socialbot and human users, which really allows for quick turnaround time when assessing whether or not our hypotheses and strategies are improving the performance of our dialogue system.

Marilyn Walker, faculty advisor, Athena, University of California, Santa Cruz: In our Natural Language and Dialogue Systems lab, our main research focus is dialogue management and language generation. Conversational AI is a very challenging problem, and we felt like we could have a research impact in this area. The field has been developing extremely quickly recently, and the Alexa Prize offers an opportunity to try out cutting-edge technologies in dialogue management and language generation on a large Alexa user population.

Amazon Alexa Prize Finalists 2020
The five Alexa Prize finalist teams: Czech Technical University in Prague; Emory University; Stanford University; the University of California, Davis; and the University of California, Santa Cruz.

Vrindavan (Davan) Harrison, team leader, Athena, UCSC: As academics, our primary focus is on research. This year’s competition aimed at being more research-oriented, allowing the teams to spend more time on developing new ideas.

Kai-Hui Liang, team lead, Gunrock, University of California, Davis: Our experience in last year’s competition motivated us to join again as we realized there is still a large room for improvement. I’m especially interested in how to find topics that engage users the most, including trying different ways to elicit and reason about users’ interests. How can we retrieve content that is relevant and interesting, and make the dialog flow more naturally?

Jan Pichl, team leader, Alquist, Czech Technical University: Since the first year of the Alexa Prize competition, we have been developing Alquist to deliver a wide range of topics with a closer focus on the most popular ones. The first Alquist guided a user through the conversation quite strictly. We learned quickly that we needed to introduce more flexibility and let the user be "in charge". With that in mind, we have been pushing Alquist in that direction. Moreover, we want Alquist to manage dialogue utilizing the knowledge graph, and suggest relevant information based on the previously discussed topics and entities.

Christopher D. Manning, faculty advisor, Chirpy Cardinal, Stanford University: It was our first time doing the Alexa Prize, and the team really hadn’t done advance preparation, so it’s all been a wild ride—by which I mean a lot of work and stress for everyone on the team. But it was super exciting that we were largely able to catch up with other leading teams who have been doing the competition for several years.

Hugh Howey, judge and science fiction author: Artificial intelligence is a passionate interest of mine. As a science fiction author, I have the freedom to write about most anything, but the one topic I keep coming back to is the impact that thinking machines already have on our lives and how that impact will only expand in the future. So any chance to be involved with those doing work and research in the field is a no-brainer for me. I leapt at the chance like a Boston Dynamics dog.

Q: What excites you about the potential of socialbots?

Hugh Howey (Judge): This technology will be transformative in ways we can barely comprehend. Right now, the human/computer interface is a bottleneck. It takes a long time for us to tell our computers what we want them to do, and they'll generally only do that thing the one time and forget what it learned. In the future, more and more of the trivial will be automated. This will free up human capital to tackle larger problems. It will also bring us together by removing language barriers, by helping those with disabilities, and eventually this technology will be available to anyone who needs it.

Jinho D. Choi, faculty advisor, Emory: It has been reported that more than 44 million adults in US have mental health issues such as anxiety or depression. We believe that developing an innovative socialbot that comforts people can really help those with mental health conditions, who are generally afraid of talking to other human beings. You may wonder how artificial intelligence can convey a human emotion such as caring. However, humans have used their own creations, such as arts and music, to comfort themselves. It is our vision to advance AI, the greatest invention of humankind, to help individuals learn more about their inner selves so they can feel more positive about themselves, and have a bigger impact in the world.

Ashwin Paranjape, co-team leader, Stanford: As socialbots become more sophisticated and prevalent, increasing numbers of people are chatting with them regularly. As the name suggests, socialbots have the potential to fulfill social needs, such as chit-chatting about everyday life, or providing support to a person struggling with mental health difficulties. Furthermore, socialbots could become a primary user interface through which we engage with the world—for example, chatting about the news, or discussing a book.

Sarah Fillwock, Emory: Our experience in this competition has really solidified this idea of the potential of socialbots being value to people who need support and are in troubling situations. I think that the most compelling role for socialbots in global challenges is to provide a supportive environment to allow people to express themselves, and explore their feelings with regard to whatever dramatic event is going on. This is especially important for vulnerable populations, such as those who do not have a strong social circle or have reduced social contact with others, prohibiting them from being able to achieve the feeling of being valued and understood.

Q: What are the main challenges to realizing that potential?

Abigail See, co-team leader, Stanford: Currently, socialbots struggle to make sense of long, involved conversations, and this limits their ability to talk about any topic in depth. To do this better, socialbots will need to understand what a particular user wants—not only in terms of discussion topics, but also what kind of conversation they want to have. Another important challenge is to allow users to take more initiative, and drive the conversation themselves. Currently, socialbots tend to take more initiative, to ensure the conversation stays within their capabilities. If we can make our socialbots more flexible, they will be much more useful and engaging to people.

Sarah Fillwock, Emory: One major challenge facing the field of dialogue system research is establishing a best practice for evaluation of the performance of dialogue approaches. There is currently a diverse set of evaluation strategies that the research community uses to determine how well their new dialogue approach performs. Another challenge is that dialogues are more than just a pattern-matching problem. Having a back-and-forth dialogue on any topic with another agent tends to involve planning towards achieving specific goals during the conversation as new information about your speaking partner is revealed. Dialogues also rely a lot on having a foundation of general world knowledge that you use to fully understand the implications of what the other person is saying.

Amazon releases Topical Chat dataset

The text-based collection of more than 235,000 utterances will help support high-quality, repeatable research in the field of dialogue systems.

Marilyn Walker, UCSC: There’s a shortage of large annotated conversational corpora for the task of open-domain conversation. For example, progress in NLU has been supported by large annotated corpora, such as Penn Treebank, however, there are currently no such publicly available corpora for open-domain conversation. Also, a rich model of individual users would enable much more natural conversations, but privacy issues currently make it difficult to build such models.

Hugh Howey (Judge): The challenge will be for our ethics and morality to keep up with our gizmos. We will be far more powerful in the future. I only hope we'll be more responsible as well.

Q: What role has the COVID-19 pandemic played in your work?

Jurik Juraska, team member, UCSC: The most immediate effect the onset of the pandemic had on our socialbot was, of course, that it could not just ignore this new dynamic situation. Our socialbot had to acknowledge this new development, as that was what most people were talking about at that point. We would thus have Athena bring up the topic at the beginning of the conversation, sympathizing with the users' current situation, but avoiding wallowing in the negative aspects of it. In the feedback that some users left, there were a number of expressions of gratitude for the ability to have a fun interaction with a socialbot at a time when direct social interaction with friends and family was greatly restricted.

Kai-Hui Liang, UC Davis: We noticed an evident difference in the way Alexa users reacted to popular topics. For example, before COVID-19, many users gave engaging responses when discussing their favorite sports to watch, their travel experiences, or events they plan to do over the weekend. After the breakout of COVID-19, more users replied saying there’s no sports game to watch or they are not able to travel. Therefore, we adapted our topics to better fit the situation. We added discussion about their life experience during the quarantine (eg. how their diet has changed or if they walk outside daily to stay healthy). We also observed more users having negative feelings potentially due to the quarantine. For instance, some users said they feel lonely and they miss their friends or family. Therefore, we enhanced our comforting module that expresses empathy through active listening.

Abigail See, Stanford: As the pandemic unfolded, we saw in real time how users changed their expectations of our socialbot. Not only did they want our bot to deliver up-to-date information, they also wanted it to show emotional understanding for the situation they were in.

Sarah Fillwock, Emory: When COVID became a significant societal issue, we tried two things: we had an experience-oriented COVID topic where our bot discussed with people how they felt about COVID in a sympathetic and reassuring atmosphere, and we had a fact-oriented COVID topic that gave objective information. What we observed was that people had a much stronger positive reaction to the experience-oriented COVID-19 approach than the fact-oriented COVID-19 approach, and seemed to prefer it when talking. This really gave us some empirical evidence that social agents have a strong potential to be helpful in times of turmoil by giving people a safe and caring space to talk about these major events in their life since people responded positively to our approach at doing this.

Q: Lastly, are there any particular advancements in the fields of NLU, dialogue management, conversational AI, etc., that you find promising?

Jan Pichl, Czech Technical University: It is exciting to see the capabilities of the Transformer-based models these days. They are able to generate large articles or even whole stories that are coherent. However, they demand a lot of computation power during the training phase and even during the runtime. Additionally, it is still challenging to use them in a socialbot when you need to work with constantly changing information about the world.

Abigail See, Stanford: As NLP researchers, we are amazed by the incredible pace of progress in the field. Since the last Alexa Prize in 2018, there have been game-changing advancements, particularly in the use of large pretrained language models to understand and generate language. The Alexa Prize offers a unique opportunity for us to apply these techniques, which so far have mostly been tested only on neat, well-defined tasks, and put them in front of real people, with all the messiness that entails! In particular, we were excited to explore the possibility of using neural generative models to chat with people. As recently as the 2018 Alexa Prize, these models generally performed poorly, and so were not used by any of the finalist teams. However, this year, these systems became an important backbone of our system.

Sarah Fillwock, Emory: The work people have been putting into incorporating common sense knowledge and common sense reasoning into dialogue systems is one of the most interesting directions of the current conversational AI field. A lot of the common sense knowledge we use is not explicitly detailed in any type of data set as people have learned them through physical experience or inference over time, so there isn’t necessarily any convenient way to currently accomplish this goal. There have been a lot of attempts to see how far a language modeling approach to dialogue agents can go, but even using huge dialogue data sets and highly complex models still results in hit-and-miss success at common sense information. I am really looking forward to the dialogue approaches and dialogue resources that more explicitly try to model this type of common sense knowledge.

Research areas

Latest news

The latest updates, stories, and more about Alexa Prize.
US, WA, Seattle
Do you want to work on Reinforcement Learning (RL) post-training of frontier Large Language Models (LLMs) to revolutionize customer service? Come join the world class researchers and academics in the AWS AI endeavor, and develop the science that powers countless new businesses in cloud computing! AWS, the world-leading provider of cloud services. Our customers bring problems that will give Applied Scientists like you endless opportunities to see your research have a positive and immediate impact in the world. You will have the opportunity to partner with technology and business teams to solve real-world problems, have access to virtually endless data and computational resources, and to world-class engineers and developers that can help bring your ideas into the world. As part of the team, we expect that you will develop innovative solutions to hard problems, and publish your findings at peer reviewed conferences and journals. The scientific topics you are going to work on include, but are not limited to: LLM post-training to improve capabilities particularly for instruction following, reasoning over long context, and tool use, etc. About the team Why AWS Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses. Inclusive Team Culture Here at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences, inspire us to never stop embracing our uniqueness. Work/Life Balance We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud. Mentorship and Career Growth We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional. Diverse Experiences Amazon values diverse experiences. Even if you do not meet all of the preferred qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.
US, MA, North Reading
Are you inspired by invention? Is problem solving through teamwork in your DNA? Do you like the idea of seeing how your work impacts the bigger picture? Answer yes to any of these and you’ll fit right in here at Amazon Robotics. We are a smart team of doers that work passionately to apply advances in robotics and software to solve real-world challenges that will transform our customers’ experiences in ways we can’t even imagine yet. We invent new improvements every day. We are Amazon Robotics and we will give you the tools and support you need to invent with us in ways that are rewarding, fulfilling and fun. Amazon Robotics is seeking experienced and Senior Applied Scientist with a passion for robotic research. Our team works on challenging and high-impact projects within robotics. Examples of projects include allocating resources to complete a million orders a day, coordinating the motion of thousands of robots and identifying objects and damage. Key job responsibilities - Lead research initiatives advancing AI-driven structured field robotics (path planning, fleet coordination, control systems) and translate breakthroughs into production solutions at global scale - Own end-to-end delivery of complex algorithmic solutions from design through production deployment and operational maintenance - Drive technical decisions for Control, Coordination, and Path Planning systems meeting performance, scalability, and reliability requirements - Partner with cross-functional teams to translate business requirements into research problems and assess technical risks - Influence technical direction across the broader robotics organization through design reviews and technical discussions with senior engineers and scientists - Demonstrate measurable impact through AI-driven algorithmic improvements: fleet efficiency gains, operational cost reduction, system reliability improvements, and enhanced customer experience - Publish findings at top-tier AI and robotics conferences representing organizational technical leadership - Mentor junior Applied Scientists on research methodology and balancing innovation with production constraints - Operate independently on ambiguous, multi-quarter problems requiring novel AI approaches while navigating tradeoffs between research innovation and production constraints A day in the life Amazon offers a full range of benefits that support you and eligible family members, including domestic partners and their children. Benefits can vary by location, the number of regularly scheduled hours you work, length of employment, and job status such as seasonal or temporary employment. The benefits that generally apply to regular, full-time employees include: - Medical, Dental, and Vision Coverage - Maternity and Parental Leave Options - Paid Time Off (PTO) - 401(k) Plan If you are not sure that every qualification on the list above describes you exactly, we'd still love to hear from you! At Amazon, we value people with unique backgrounds, experiences, and skillsets. If you’re passionate about this role and want to make an impact on a global scale, please apply! About the team We're the structured field robotics organization powering large-scale mobile robotics operations globally. Our mission is to enable safe, efficient, and reliable robotic operations through intelligent Control, Coordination, and Path Planning systems. We operate at the intersection of planning, algorithmic, and ML research with production systems, owning the full stack from innovation to deployment. Our culture balances research excellence with operational ownership. Applied Scientists partner closely with engineers: reviewing code, contributing to research discussions, and solving problems together. We value deep technical expertise alongside pragmatic engineering judgment. We invest in our people through mentorship and encourage conference participation and knowledge sharing.
US, CA, San Francisco
PXT Central Science is seeking an exceptional Data Scientist to join our team. The ideal candidate will thrive in a dynamic, multifaceted role where you'll translate complex business challenges into rigorous quantitative frameworks, extract actionable insights from structured and unstructured datasets, and architect science-backed, scalable solutions that elevate the experience of our 1 million+ employees worldwide. If you're energized by the opportunity to apply data science to our mission of making Amazon Earth's Best Employer, we want to hear from you. Key job responsibilities • Own the design, development, and maintenance of scalable models and prototypes leveraging statistical, machine learning, or GenAI methodologies to enhance employee experience. • Partner with scientists, engineers, and product leaders to solve for employee experience defects using scientific approaches, building new services and tools that deliverable measurable impact. • Author and maintain detailed technical documentation related to the projects you drive. • Communicate results to diverse audiences of varying technical background with effective writing, visualizations, and presentations • Stay current with emerging methods and technologies, and implement them strategically to amplify the team’s impact. About the team The Central Science Team within Amazon’s People Experience and Technology org (PXTCS) uses economics, behavioral science, statistics, machine learning, and Generative AI to proactively identify mechanisms and process improvements which simultaneously improve Amazon and the lives, well-being, and the value of work to Amazonians. We are an interdisciplinary team, which combines the talents of science, engineering, and UX to develop and deliver solutions that measurably achieve this goal.
US, MA, N.reading
Amazon is seeking exceptional talent to help develop the next generation of advanced robotics systems that will transform automation at Amazon's scale. We're building revolutionary robotic systems that combine cutting-edge AI, sophisticated control systems, and advanced mechanical design to create adaptable automation solutions capable of working safely alongside humans in dynamic environments. This is a unique opportunity to shape the future of robotics and automation at an unprecedented scale, working with world-class teams pushing the boundaries of what's possible in robotic dexterous manipulation, locomotion, and human-robot interaction. This role presents an opportunity to shape the future of robotics through innovative applications of deep learning and large language models. At Amazon we leverage advanced robotics, machine learning, and artificial intelligence to solve complex operational challenges at an unprecedented scale. Our fleet of robots operates across hundreds of facilities worldwide, working in sophisticated coordination to fulfill our mission of customer excellence. The ideal candidate will contribute to research that bridges the gap between theoretical advancement and practical implementation in robotics. You will be part of a team that's revolutionizing how robots learn, adapt, and interact with their environment. Join us in building the next generation of intelligent robotics systems that will transform the future of automation and human-robot collaboration. Key job responsibilities - Design and implement whole body control methods for balance, locomotion, and dexterous manipulation - Utilize state-of-the-art in methods in learned and model-based control - Create robust and safe behaviors for different terrains and tasks - Implement real-time controllers with stability guarantees - Collaborate effectively with multi-disciplinary teams to co-design hardware and algorithms for loco-manipulation - Mentor junior engineer and scientists
US, CA, San Francisco
The Amazon General Intelligence “AGI” organization is looking for an Executive Assistant to support leaders of our Autonomy Team in our growing AI Lab space located in San Francisco. This role is ideal for exceptionally talented, dependable, customer-obsessed, and self-motivated individuals eager to work in a fast paced, exciting and growing team. This role serves as a strategic business partner, managing complex executive operations across the AGI organization. The position requires superior attention to detail, ability to meet tight deadlines, excellent organizational skills, and juggling multiple critical requests while proactively anticipating needs and driving improvements. High integrity, discretion with confidential information, and professionalism are essential. The successful candidate will complete complex tasks and projects quickly with minimal guidance, react with appropriate urgency, and take effective action while navigating ambiguity. Flexibility to change direction at a moment's notice is critical for success in this role. Key job responsibilities Key job responsibilities Serve as strategic partner to senior leadership, identifying opportunities to improve organizational effectiveness and drive operational excellence Manage complex calendars and scheduling for multiple executives Drive continuous improvement through process optimization and new mechanisms Coordinate team activities including staff meetings, offsites, and events Schedule and manage cost-effective travel Attend key meetings, track deliverables, and ensure timely follow-up Create expense reports and manage budget tracking Serve as liaison between executives and internal/external stakeholders Build collaborative relationships with Executive Assistants across the company and with critical external partners Help us build a great team culture in the Lab!
US, WA, Seattle
Prime Video is a first-stop entertainment destination offering customers a vast collection of premium programming in one app available across thousands of devices. Prime members can customize their viewing experience and find their favorite movies, series, documentaries, and live sports – including Amazon MGM Studios-produced series and movies; licensed fan favorites; and programming from Prime Video subscriptions such as Apple TV+, HBO Max, Peacock, Crunchyroll and MGM+. All customers, regardless of whether they have a Prime membership or not, can rent or buy titles via the Prime Video Store, and can enjoy even more content for free with ads. Are you interested in shaping the future of entertainment? Prime Video's technology teams are creating best-in-class digital video experience. As a Prime Video team member, you’ll have end-to-end ownership of the product, user experience, design, and technology required to deliver state-of-the-art experiences for our customers. You’ll get to work on projects that are fast-paced, challenging, and varied. You’ll also be able to experiment with new possibilities, take risks, and collaborate with remarkable people. We’ll look for you to bring your diverse perspectives, ideas, and skill-sets to make Prime Video even better for our customers. With global opportunities for talented technologists, you can decide where a career Prime Video Tech takes you! Key job responsibilities As a highly experienced and seasoned science leader, you will apply state of the art natural language processing and computer vision research to video centric digital media, while also responsible for creating and maintaining the best environment for applied science in order to recruit, retain and develop top talent. You will lead the research direction for a team of deeply talented applied scientists, creating the roadmaps for forward-looking research and communicate them effectively to senior leadership. You will also hire and develop applied scientists - growing the team to meet the evolving needs of our customers. About the team This team's mission is to deeply understand all content and empower all customers with relevant language options, innovative accessibility assists, and rich title-information across all their content-experiences on Prime Video. We create and publish content on-time that's meaningful, accurate, and accessible to every customer globally. We delight our customers by pushing the boundaries of content understanding and enrichment. Through inclusion and innovation, we do the most fulfilling work of our career.
IN, KA, Bengaluru
RBS (Retail Business Services) Tech team works towards enhancing the customer experience (CX) and their trust in product data by providing technologies to find and fix Amazon CX defects at scale. Our platforms help in improving the CX in all phases of customer journey, including selection, discoverability & fulfilment, buying experience and post-buying experience (product quality and customer returns). The team also develops GenAI platforms for automation of Amazon Stores Operations. As a Sciences team in RBS Tech, we focus on foundational ML research and develop scalable state-of-the-art ML solutions to solve the problems covering customer experience (CX) and Selling partner experience (SPX). We work to solve problems related to multi-modal understanding (text and images), task automation through multi-modal LLM Agents, supervised and unsupervised techniques, multi-task learning, multi-label classification, aspect and topic extraction for Customer Anecdote Mining, image and text similarity and retrieval using NLP and Computer Vision for product groupings and identifying duplicate listings in product search results. Key job responsibilities As a Data Scientist, you will be responsible to design and deploy scalable GenAI, NLP and Computer Vision solutions that will impact the content visible to millions of customer and solve key customer experience issues. You will develop novel LLM, deep learning and statistical techniques for task automation, text processing, image processing, pattern recognition, and anomaly detection problems. You will define the research and experiments strategy with an iterative execution approach to develop AI/ML models and progressively improve the results over time. You will partner with business and engineering teams to identify and solve large and significantly complex problems that require scientific innovation. You will help the team leverage your expertise, by coaching and mentoring. You will contribute to the professional development of colleagues, improving their technical knowledge and the engineering practices. You will independently as well as guide team to file for patents and/or publish research work where opportunities arise. The RBS org deals with problems that are directly related to the selling partners and end customers and the ML team drives resolution to organization level problems. Therefore, the Data Scientist role will impact the large product strategy, identifies new business opportunities and provides strategic direction which is very exciting.
IN, KA, Bengaluru
We are looking for a Senior Applied Scientist to help establish and lead the technical direction of our newly formed team in Bangalore. In this role, you will drive the research and development of next-generation machine learning models spanning computer vision, audio processing, and multimodal semantic understanding. You will help define the science roadmap, tackle high-ambiguity problems across modalities, and deliver solutions that operate at scale. This is a rare opportunity to shape the technical vision, culture, and long-term research agenda of a greenfield site. Key job responsibilities Model Development & Technical Leadership: Architect and drive development of advanced deep learning models for CV, audio understanding, and multimodal semantic fusion — setting the technical bar and defining best practices for the team. End-to-End Ownership: Own complex ML programs end-to-end — from identifying high-impact problems, designing data strategies and evaluation frameworks, through experimentation, optimization, and deployment at production scale. Research & Innovation: Define the science roadmap for your area; drive novel research directions in multimodal learning and deliver results that advance both the product and the broader field. Publications & Thought Leadership: Maintain an active publication record at top-tier venues (e.g. CVPR, NeurIPS, ICASSP, ICCV, ACL) and represent the team externally in the research community. Mentorship & Culture Building: Mentor scientists and engineers, raise the technical bar through hiring, and play a foundational role in establishing the Bangalore site's culture, processes, and scientific identity. A day in the life An Applied Scientist with the Alexa Edge AI team will lead science solution design, run experiments, research new algorithms, and find new ways of optimizing the customer experience; while setting examples for the team on good science practice and standards. Besides theoretical analysis and innovation, a Sr. Applied Scientist will also drive cross functional collaboration with talented engineers and scientists to put algorithms and models into production. About the team The Alexa Edge AI team has a mission to deliver best in class, resource efficient multimodal AI models in support of various perception (vision, audio and speech) and semantic understanding based applications for devices like Echo Show series within Amazon.
IN, KA, Bengaluru
The Alexa Edge AI team is seeking a talented and motivated Applied Scientist to join our newly established team in Bangalore. In this role, you will design, develop, and deploy state-of-the-art machine learning models spanning computer vision (CV), audio (including speech) processing, and multimodal semantic understanding for both edge and cloud deployment. You will work at the intersection of multiple modalities to build systems that can perceive, interpret, and reason about the world — pushing the boundaries of what's possible in unified multimodal intelligence. This is a unique opportunity to be a founding member of a brand-new site, shaping the team culture, technical direction, and research agenda from the ground up. Key job responsibilities Model Development: Design and build deep learning models for computer vision, audio understanding, and multimodal semantic fusion — including architectures that enable joint reasoning across visual, auditory, and textual modalities. End-to-End Ownership: Own the full ML lifecycle — from problem formulation, data strategy, and annotation design through experimentation, evaluation frameworks, model optimization, and deployment at scale. Research & Innovation: Stay at the frontier of CV, audio ML, and multimodal learning; identify and apply cutting-edge techniques and contribute to the scientific community through papers at top-tier venues (CVPR, NeurIPS, ICASSP, ICCV, ACL). Mentorship & Culture Building: As a founding member of the Bangalore site, help hire, onboard, and establish the technical practices that define the team's culture. A day in the life An Applied Scientist with the Alexa Edge AI team will support science solution design, run experiments, research new algorithms, and find new ways of optimizing the customer experience; while setting examples for the team on good science practice and standards. Besides theoretical analysis and innovation, an Applied Scientist will also work closely with talented engineers and scientists to put algorithms and models into production. About the team The Alexa Edge AI team has a mission to deliver best in class, resource efficient multimodal AI models in support of various perception (vision, audio and speech) and semantic understanding based applications for devices like Echo Show series within Amazon.
IN, KA, Bengaluru
The Alexa Edge AI team is seeking a talented and motivated Applied Scientist to join our newly established team in Bangalore. In this role, you will design, develop, and deploy state-of-the-art machine learning models spanning computer vision (CV), audio (including speech) processing, and multimodal semantic understanding for both edge and cloud deployment. You will work at the intersection of multiple modalities to build systems that can perceive, interpret, and reason about the world — pushing the boundaries of what's possible in unified multimodal intelligence. This is a unique opportunity to be a founding member of a brand-new site, shaping the team culture, technical direction, and research agenda from the ground up. Key job responsibilities Model Development: Design and build deep learning models for computer vision, audio understanding, and multimodal semantic fusion — including architectures that enable joint reasoning across visual, auditory, and textual modalities. End-to-End Ownership: Own the full ML lifecycle — from problem formulation, data strategy, and annotation design through experimentation, evaluation frameworks, model optimization, and deployment at scale. Research & Innovation: Stay at the frontier of CV, audio ML, and multimodal learning; identify and apply SOTA techniques and contribute to the scientific community through papers at top-tier venues (CVPR, NeurIPS, ICASSP, ICCV, ACL). Mentorship & Culture Building: As a founding member of the Bangalore site, help hire, onboard, and establish the technical practices that define the team's culture. A day in the life An Applied Scientist with the Alexa Edge AI team will support science solution design, run experiments, research new algorithms, and find new ways of optimizing the customer experience; while setting examples for the team on good science practice and standards. Besides theoretical analysis and innovation, an Applied Scientist will also work closely with talented engineers and scientists to put algorithms and models into production. About the team The Alexa Edge AI team has a mission to deliver best in class, resource efficient multimodal AI models in support of various perception (vision, audio and speech) and semantic understanding based applications for devices like Echo Show series within Amazon.