George Michailidis paper abstract.jpg
Sequential change-point detection in high-dimensional Gaussian graphic models”, published by Hossein Keshavarz, a senior data scientist at relationalAI; George Michailidis; and Yves Atchadé, a professor of statistics at Boston University, touches on the theme of anomaly detection.

George Michailidis: How to identify important changes in online networks

Amazon Scholar discusses the evolution of anomaly detection research.

As the director of the University of Florida Informatics Institute, George Michailidis, who is also an Amazon Scholar on the Supply Chain Optimization Technologies (SCOT) team, leads a diverse community of data scientists with training in engineering, statistics, applied math, and other sciences. He notes that assortment of backgrounds is important in data science.

George Michailidis is the director of the University of Florida Informatics Institute, and an Amazon Scholar on the Supply Chain Optimization Technologies team.
George Michailidis is the director of the University of Florida Informatics Institute, and an Amazon Scholar on the Supply Chain Optimization Technologies team.

“In addition to statistics, there are a number of other disciplines that data scientists need to be aware of, such as programming, algorithms, optimization, and of course, some subject matter expertise because you don't do data science in a vacuum,” he says.

Michailidis was trained in applied mathematics and statistics, with a PhD thesis focused on optimization problems and its applications to statistical problems. His postdoc was in operations research, which introduced him to a different class of problems. “Some of them come about in Amazon’s supply chain, for example, such as problems of how to schedule the jobs on the machine, or how to route the traffic in the network, and so forth.”

For about 17 years, Michailidis was a faculty member at the University of Michigan in statistics with a joint appointment in electrical engineering. “I combined my statistical training with my interest in engineering types of problems.”

Data integration

Since then, his research agenda at the University of Florida has had strong theoretical components, but he remains very interested in practical applications. One of his current interests is data integration, and its many potential uses. For example, when it comes to the study of diseases, there is a wealth of molecular-level data from patients’ samples. At the same time, there is information on the patient's clinical records and demographics.

Related content
Gari Clifford, the chair of the Department of Biomedical Informatics at Emory University and an Amazon Research Award recipient, wants to transform healthcare.

“How do you create models to try to identify key drivers, for example, for disease progression by combining all these different data sources,” is one of the questions that motivates Michailidis’ work. With these models, he tries to provide insights both for prognostic or diagnostic purposes, but also for the understanding of the biological mechanisms that lead to that disease.

Another large component of Michailidis’ research relates to a problem known as anomaly detection. “This is an old problem that has been going on for more than 60 years,” he says. To a large extent, it originated in manufacturing, where people were interested in finding defects in the manufacturing process and fixing them. As the technology evolved, similar questions have been arising in many other fields.

This is broadly the theme of a paper published by Michailidis and his colleagues Hossein Keshavarz, a senior data scientist at relationalAI, and Yves Atchadé, a professor of statistics at Boston University, entitled “Sequential change-point detection in high-dimensional Gaussian graphic models.”

Michailidis notes that, as manufacturing processes became more complex, it became necessary to monitor many more metrics.

Related content
NASA is using unsupervised learning and anomaly detection to explore the extreme conditions associated with solar superstorms.

“A typical example of this complexity is semiconductor manufacturing, where you have to monitor hundreds of little things,” he says.

In more modern applications, the next step is to monitor networks.

“You’re not only monitoring a lot of things. Now these things are interconnected and you're trying to understand how this network, as an object, changes its structure at some point in time,” Michailidis explains. “And you're doing that in an online fashion because this process keeps going. You keep observing the network and you're trying to identify changes as quickly as possible.”

In addition to developing a technique to detect changes, researchers also must establish that their technique is sensitive enough for certain types of changes and determine whether it detects them quickly enough. This is the challenge, in the online realm, that Michailidis and his colleagues attempt to address in their paper. The paper introduces “introduces a novel scalable online algorithm for detecting an unknown number of abrupt changes”.

Related content
Ping Xu, forecasting science director within Amazon’s Supply Chain Optimization Technologies (SCOT) organization, talks about the importance of using science to forecast the future.

In the paper, the authors present an application on stock market data, where the network is made of movements of stocks. “We showed how the network changes, for example, during the great financial crisis of 2008, and how the stock market got affected by the European debt crisis in 2012 and so forth.” Michailidis notes that these techniques are especially suited for problems where there are dependencies between observable elements without knowledge of the nature of those dependencies.

“With stocks, whether they are moving together or in different directions, these movements —or lack of movement — is what gives rise to the network structure. And that’s what we are capturing with these graphical models,” he says.

Within the SCOT organization, Michailidis says he has the opportunity to tackle challenging problems at an unprecedented scale. “The problems are much more complex because they're not as clear cut as they are in academia.” In this interview, he discusses his research on anomaly detection and its potential applications.

  1. Q. 

    Your paper mentions high dimensional piecewise sparse graphical models. What does that entail and what are some applications?

    A. 

    The graphical model is a particular statistical model that tries to capture statistical dependencies between the things that are measured on the nodes. In the stock market example, you're looking at the rate of return of a stock. This is the measurement that you have on every node over time and you're trying to understand, for example, whether the return of one technology stock is correlated with the return of some other technology stock. So that's what the graphical model is trying to capture — the statistical dependencies.

    The next step is what we mean by high dimensional. Essentially, it means that the number of nodes, or variables, in your network becomes very large compared to how many observations you have. You may have a short observation period, but with a high number of nodes. What we call high-dimensional statistics became a big field of study 15 to 20 years ago, with a lot of applications. The reason is that, in more classical statistics, we always made the assumption that the sample size in our observations is much larger than the number of variables. In the high-dimensional regime, the relationship flips and you have many more variables than observations and that poses a whole bunch of technical challenges, to the point where you can’t even solve the problem.

    So, you need some additional assumptions, and that's where another important term comes in: sparse. This means that this network doesn't have too many connections. If it was very well connected, then we would not be able to solve the problem for technical reasons, because you would not have enough data. So, you make the assumption that these networks are not too connected to compensate for how much data you have.

    And the last term we need to understand is piecewise. By piecewise, we mean that, for this period, the network structure stays the same, and then changes abruptly to some other structure. It's not a gradual change — although this may be happening in reality. It heavily depends on the underlying application. It may either be a simplifying assumption in order to do the analysis or, in many cases, that's exactly what happens.

    In the neuroscience example, if the subject sits in the scanner without moving, and then you tell them — “raise your hand or read this sentence” — there is an abrupt change because there is a new task after a resting state. This is also possible in the stock market, where new information may create these abrupt changes.

    In many applications, there is really an abrupt change and this is the proper setting to use. In some other cases, changes may be a little bit more gradual. But we can still look at them as abrupt changes because it becomes a good working hypothesis and simplifies things. A lot of these techniques that people develop are good working models, and not exactly what's going on, that's fairly standard in a lot of scientific fields. And that explains the high dimensional piecewise sparse graphical model. That's where all the pieces come together.

  2. Q. 

    Why is it important to be able to detect these abrupt changes in an online setting?

    A. 

    Because you keep collecting the data, and you would like to identify these changes as things evolve. You could solve the same problem, with the same high dimensional sparse piecewise graphical model, in an offline manner. In that case, the difference is that you have already collected these data and would like to explore them in a retrospective manner to see if you can find these types of changes. That's also a problem of interest.

    The reason that in this article we focus on online detection is that we have already done work on the offline version, so it was natural to start exploring what is different in an online setting. And it's much, much more challenging, because you don't know the future and you keep getting new information, and you're trying to detect these changes quickly. Online problems in machine learning and other areas are more challenging than offline problems, as a general rule. So, this is for me a natural evolution, since I’ve already used these sparse graphical models in an offline setting.

  3. Q. 

    What does the paper demonstrate and how is it applicable to Amazon?

    A. 

    The paper does demonstrate that it is possible to detect these changes online, so it’s a positive message. And it also shows a caveat. If, for example, the changes in this connectivity pattern were concentrated on only one node, then we could not detect them with the current technology. Because that's a very localized change, it only involves a very tiny part of the network. And our technique would only be able to detect them by waiting for a very long time. From an applications perspective, that makes it uninteresting. That tells you the limitations, which are important in some settings. We have done most of the work, but we found out that we were missing something. So, we need to go and develop a little bit more.

    The results could be applicable to Amazon because these graphical models come up a lot. So far, we have used techniques where we haven't taken the connections into account, we have just looked simply at what is going on in the time series, let's say, of a single node and whether that changes. Obviously, given the fact that Amazon operates in a highly volatile environment, changes are important. In the longer term, given the fact that the team has done work with graphical models, it may be interesting to utilize some of these techniques. The potential is there.

    In general, anomaly detection work to date across many disciplines (statistics, signal processing, machine learning, econometrics) has largely focused on parametric models, where with some effort the theoretical properties of anomaly detection procedures can be elucidated analytically and then validated through simulations. The analytical work provides deeper insights into the performance of these anomaly detection procedures and their limitations, and when they do not perform well. With all the advances in deep learning models, they become prime tools to use in anomaly detection problems.

    However, the challenge then becomes, to understand the performance limits of such models, beyond relying on numerical work. Such advances may take some time, but once the community makes progress, much more powerful procedures will be available to the practitioners.

Related content

US, WA, Seattle
Prime Video is a first-stop entertainment destination offering customers a vast collection of premium programming in one app available across thousands of devices. Prime members can customize their viewing experience and find their favorite movies, series, documentaries, and live sports – including Amazon MGM Studios-produced series and movies; licensed fan favorites; and programming from Prime Video add-on subscriptions such as Apple TV+, Max, Crunchyroll and MGM+. All customers, regardless of whether they have a Prime membership or not, can rent or buy titles via the Prime Video Store, and can enjoy even more content for free with ads. Are you interested in shaping the future of entertainment? Prime Video's technology teams are creating best-in-class digital video experience. As a Prime Video technologist, you’ll have end-to-end ownership of the product, user experience, design, and technology required to deliver state-of-the-art experiences for our customers. You’ll get to work on projects that are fast-paced, challenging, and varied. You’ll also be able to experiment with new possibilities, take risks, and collaborate with remarkable people. We’ll look for you to bring your diverse perspectives, ideas, and skill-sets to make Prime Video even better for our customers. With global opportunities for talented technologists, you can decide where a career Prime Video Tech takes you! As an Applied Scientist in the Prime Video Playback Intelligence Organization, you will have deep subject matter expertise in applied machine learning and data science, with specializations in video streaming optimization, information retrieval, anomaly detection and root-causing systems, large language models and generative AI across various modalities. Key job responsibilities - Work with multiple teams of scientists, engineers, and product managers to translate business and functional requirements into concrete deliverables leading strategic efforts to enhance customer quality of experiences. - Work on problems spaces such as: improving the customer playback quality of experience across Video on Demand, Live Events and Linear Content. - Reduce the time/cost/effort to optimize the customer experience as well as detect, root-cause, and mitigate defects in the customer experience. You’ll seek to understand the depth and nuance of streaming video at scale and identify opportunities to grow our business and improve customer quality of experience via principled ML/AI solutions. - Lead integration of new algorithms and processes into existing modeling stacks, simplify and streamline the existing modeling stacks, and develop testing and evaluation strategies. Ultimately, you'll work backwards from the desired outcomes and lead the way on determining the ideal solution (statistical techniques, traditional ML, GenAI, etc). A day in the life We love solving challenging and hard problems in our quest to innovate on behalf of our customers and provide the best video streaming experience. We push the boundaries to leverage and invent technologies which help create unrivaled experiences for our customers to help us move fast in a growing and changing environment. We use data to guide our decisions, work closely with our engineering and product counterparts, and partner with other Science teams as well as academic institutions to learn and guide in an environment of innovation.
IN, KA, Bengaluru
Selection Monitoring team is responsible for making the biggest catalog on the planet even bigger. In order to drive expansion of the Amazon catalog, we develop advanced ML/AI technologies to process billions of products and algorithmically find products not already sold on Amazon. We work with structured, semi-structured and Visually Rich Documents using deep learning, NLP and image processing. The role demands a high-performing and flexible candidate who can take responsibility for success of the system and drive solutions from research, prototype, design, coding and deployment. We are looking for Applied Scientists to tackle challenging problems in the areas of Information Extraction, Efficient crawling at internet scale, developing ML models for website comprehension and agents to take multi-step decisions. You should have depth and breadth of knowledge in text mining, information extraction from Visually Rich Documents, semi structured data (HTML) and advanced machine learning. You should also have programming and design skills to manipulate Semi-Structured and unstructured data and systems that work at internet scale. You will encounter many challenges, including: - Scale (build models to handle billions of pages), - Accuracy (requirements for precision and recall) - Speed (generate predictions for millions of new or changed pages with low latency) - Diversity (models need to work across different languages, market places and data sources) You will help us to - Build a scalable system which can algorithmically extract information from world wide web. - Intelligently cluster web pages, segment and classify regions, extract relevant information and structure the data available on semi-structured web. - Build systems that will use existing Knowledge Base to perform open information extraction at scale from visually rich documents. Key job responsibilities - Use AI, NLP and advances in LLMs/SLMs and agentic systems to create scalable solutions for business problems. - Efficiently Crawl web, Automate extraction of relevant information from large amounts of Visually Rich Documents and optimize key processes. - Design, develop, evaluate and deploy, innovative and highly scalable ML models, esp. leveraging latest advances in RL-based fine tuning methods like DPO, GRPO etc. - Work closely with software engineering teams to drive real-time model implementations. - Establish scalable, efficient, automated processes for large scale model development, model validation and model maintenance. - Lead projects and mentor other scientists, engineers in the use of ML techniques. - Publish innovation in research forums.
BR, SP, Sao Paulo
Do you like working on projects that are highly visible and are tied closely to Amazon’s growth? Are you seeking an environment where you can drive innovation leveraging the scalability and innovation with Amazon's AWS cloud services? The Amazon International Technology Team is hiring Applied Scientists to work in our Machine Learning team in Mexico City. The Intech team builds International extensions and new features of the Amazon.com web site for individual countries and creates systems to support Amazon operations. We have already worked in Germany, France, UK, India, China, Italy, Brazil and more. Key job responsibilities About you You want to make changes that help millions of customers. You don’t want to make something 10% better as a part of an enormous team. Rather, you want to innovate with a small community of passionate peers. You have experience in analytics, machine learning, LLMs and Agentic AI, and a desire to learn more about these subjects. You want a trusted role in strategy and product design. You put the customer first in your thinking. You have great problem solving skills. You research the latest data technologies and use them to help you innovate and keep costs low. You have great judgment and communication skills, and a history of delivering results. Your Responsibilities - Define and own complex machine learning solutions in the consumer space, including targeting, measurement, creative optimization, and multivariate testing. - Design, implement, and evolve Agentic AI systems that can autonomously perceive their environment, reason about context, and take actions across business workflows—while ensuring human-in-the-loop oversight for high-stakes decisions. - Influence the broader team's approach to integrating machine learning into business workflows. - Advise leadership, both tech and non-tech. - Support technical trade-offs between short-term needs and long-term goals.
US, WA, Bellevue
Alexa International Science team is looking for a passionate, talented, and inventive Senior Applied Scientist to help build industry-leading technology with Large Language Models (LLMs) and multimodal systems, requiring strong deep learning and generative models knowledge. At this level, you will drive cross-team scientific strategy, influence partner teams, and deliver solutions that have broad impact across Alexa's international products and services. Key job responsibilities As a Senior Applied Scientist with the Alexa International team, you will work with talented peers to develop novel algorithms and modeling techniques to advance the state of the art with LLMs, particularly delivering industry-leading scientific research and applied AI for multi-lingual applications — a challenging area for the industry globally. Your work will directly impact our global customers in the form of products and services that support Alexa+. You will leverage Amazon's heterogeneous data sources and large-scale computing resources to accelerate advances in text, speech, and vision domains. The ideal candidate possesses a solid understanding of machine learning, speech and/or natural language processing, modern LLM architectures, LLM evaluation & tooling, and a passion for pushing boundaries in this vast and quickly evolving field. They thrive in fast-paced environment, like to tackle complex challenges, excel at swiftly delivering impactful solutions while iterating based on user feedback, and are able to influence and align multiple teams around a shared scientific vision.
US, CA, San Francisco
Amazon has launched a new research lab in San Francisco to develop foundational capabilities for useful AI agents. We’re enabling practical AI to make our customers more productive, empowered, and fulfilled. Our work leverages large vision language models (VLMs) with reinforcement learning (RL) and world modeling to solve perception, reasoning, and planning to build useful enterprise agents. Our lab is a small, talent-dense team with the resources and scale of Amazon. Each team in the lab has the autonomy to move fast and the long-term commitment to pursue high-risk, high-payoff research. We’re entering an exciting new era where agents can redefine what AI makes possible. Key job responsibilities You will contribute directly to AI agent development in an applied research role to improve the multi-model perception and visual-reasoning abilities of our agent. Daily responsibilities including model training, dataset design, and pre- and post-training optimization. You will be hired as a Member of Technical Staff.
US, NY, New York
We are looking for detail-oriented, organized, and responsible individuals who are eager to learn how to apply their structural econometrics skillsets to solve real world problems. The intern will work in the area of Amazon Private Brands and develop models to improve our product selection. Our PhD Economist Internship Program offers hands-on experience in applied economics, supported by mentorship, structured feedback, and professional development. Interns work on real business and research problems, building skills that prepare them for full-time economist roles at Amazon and beyond. You will learn how to build data sets and perform applied econometric analysis collaborating with economists, scientists, and product managers. These skills will translate well into writing applied chapters in your dissertation and provide you with work experience that may help you with placement. These are full-time positions at 40 hours per week, with compensation being awarded on an hourly basis. About the team The Amazon Private Brands science advance team applies Machine Learning, Statistics and Econometrics/economics to solve high-impact business problems, develop prototypes for Amazon-scale science solutions, and optimize key business functions of Amazon Private Brands and other Amazon orgs. We are an interdisciplinary team, using science and technology and leveraging the strengths of engineers and scientists to build solutions for some of the toughest business problems at Amazon, covering areas such as pricing, discovery, negotiation, forecasting, supply chain and product selection/development.
US, WA, Seattle
Innovators wanted! Are you an entrepreneur? A builder? A dreamer? This role is part of an Amazon Special Projects team that takes the company’s Think Big leadership principle to the extreme. We focus on creating entirely new products and services with a goal of positively impacting the lives of our customers. No industries or subject areas are out of bounds. If you’re interested in innovating at scale to address big challenges in the world, this is the team for you. Here at Amazon, we embrace our differences. We are committed to furthering our culture of inclusion. We have thirteen employee-led affinity groups, reaching 40,000 employees in over 190 chapters globally. We are constantly learning through programs that are local, regional, and global. Amazon’s culture of inclusion is reinforced within our 16 Leadership Principles, which remind team members to seek diverse perspectives, learn and be curious, and earn trust. Our team highly values work-life balance, mentorship and career growth. We believe striking the right balance between your personal and professional life is critical to life-long happiness and fulfillment. We care about your career growth and strive to assign projects and offer training that will challenge you to become your best.
US, TX, Austin
Amazon Security is looking for a talented and driven Applied Scientist II to spearhead Generative AI acceleration within the Secure Third Party Tools (S3T) organization. The S3T team has bold ambitions to re-imagine security products that serve Amazon's pace of innovation at our global scale. This role will focus on leveraging large language models and agentic AI to transform third-party security risk management, automate complex vendor assessments, streamline controllership processes, and dramatically reduce assessment cycle times. You will drive builder efficiency and deliver bar-raising security engagements across Amazon. Key job responsibilities Lead the research, design, and development of GenAI-powered solutions to enhance the security and governance of third-party tools across Amazon Develop and fine-tune large language models (LLMs) and other ML models tailored to security use cases, including risk detection, anomaly identification, and automated compliance Collaborate with cross-functional teams — including Security Engineers, Software Development Engineers, and Product Managers — to translate scientific innovations into scalable, production-ready systems Define and drive the GenAI roadmap for the S3T organization, influencing strategy and prioritization Conduct rigorous experimentation, evaluate model performance, and iterate rapidly to deliver measurable impact Stay current with the latest advancements in GenAI and applied ML research, and bring relevant innovations into Amazon's security ecosystem Mentor junior scientists and contribute to a culture of scientific excellence within the team About the team Security is central to maintaining customer trust and delivering delightful customer experiences. At Amazon, our Security organization is designed to drive bar-raising security engagements. Our vision is that Builders raise the Amazon security bar when they use our recommended tools and processes, with no overhead to their business. Diverse Experiences Amazon Security values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying. Why Amazon Security? At Amazon, security is central to maintaining customer trust and delivering delightful customer experiences. Our organization is responsible for creating and maintaining a high bar for security across all of Amazon’s products and services. We offer talented security professionals the chance to accelerate their careers with opportunities to build experience in a wide variety of areas including cloud, devices, retail, entertainment, healthcare, operations, and physical stores. Inclusive Team Culture In Amazon Security, it’s in our nature to learn and be curious. Ongoing DEI events and learning experiences inspire us to continue learning and to embrace our uniqueness. Addressing the toughest security challenges requires that we seek out and celebrate a diversity of ideas, perspectives, and voices. Training & Career Growth We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, training, and other career-advancing resources here to help you develop into a better-rounded professional. Work/Life Balance We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why flexible work hours and arrangements are part of our culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve.
US, MA, N.reading
Amazon Industrial Robotics is seeking exceptional talent to help develop the next generation of advanced robotics systems that will transform automation at Amazon's scale. We're building revolutionary robotic systems that combine cutting-edge AI, sophisticated control systems, and advanced mechanical design to create adaptable automation solutions capable of working safely alongside humans in dynamic environments. This is a unique opportunity to shape the future of robotics and automation at an unprecedented scale, working with world-class teams pushing the boundaries of what's possible in robotic dexterous manipulation, locomotion, and human-robot interaction. This role presents an opportunity to shape the future of robotics through innovative applications of deep learning and large language models. At Amazon Industrial Robotics we leverage advanced robotics, machine learning, and artificial intelligence to solve complex operational challenges at an unprecedented scale. Our fleet of robots operates across hundreds of facilities worldwide, working in sophisticated coordination to fulfill our mission of customer excellence. Join us in building the next generation of intelligent robotics systems that will transform the future of automation and human-robot collaboration. Key job responsibilities Design and deploy end-to-end teleoperation pipelines integrating VR/AR headsets and haptics interfaces with robotic hardware Implement force-feedback and tactile sensing algorithms to provide operators with a "sense of touch," improving performance in contact-rich manipulation tasks Collaborate with ML teams to ensure teleoperation interfaces capture high-fidelity state-action pairs, including proprioception, visual, and force/torque data for model training Develop custom networking and streaming protocols to minimize operator-to-robot latency. Conduct user studies to evaluate ergonomics, cognitive load, and "telepresence" effectiveness to iterate on UI/UX designs.
US, WA, Seattle
Innovators wanted! Are you an entrepreneur? A builder? A dreamer? This role is part of an Amazon Special Projects team that takes the company’s Think Big leadership principle to the next-level. We focus on creating entirely new products and services with a goal of positively impacting the lives of our customers. No industries or subject areas are out of bounds. If you’re interested in innovating at scale to address big challenges in the world, this is the team for you. Here at Amazon, we embrace our differences. We are committed to furthering our culture of inclusion. We have thirteen employee-led affinity groups, reaching 40,000 employees in over 190 chapters globally. We are constantly learning through programs that are local, regional, and global. Amazon’s culture of inclusion is reinforced within our 16 Leadership Principles, which remind team members to seek diverse perspectives, learn and be curious, and earn trust. Key job responsibilities * Develop, deploy, and operate scalable bioinformatics analysis workflows on AWS * Evaluate and incorporate novel bioinformatic approaches to solve critical business problems * Originate and lead the development of new data collection workflows with cross-functional partners * Partner with laboratory science teams on design and analysis of experiments About the team Our team highly values work-life balance, mentorship and career growth. We believe striking the right balance between your personal and professional life is critical to life-long happiness and fulfillment. We care about your career growth and strive to assign projects and offer training that will challenge you to become your best.