MTU Artificial Intelligence Colloquium

Organizers:

Dr. Vinh Nguyen (Mechanical and Aerospace Engineering)


The MTU AI Colloquium will host MTU students/faculty and their research in Artificial Intelligence during the Fall and Spring semesters!

If you would like to present your work, or just have questions, reach out to organizer Dr. Vinh Nguyen at vinhn@mtu.edu

Location: EERC Room 315

Next Colloquium:

October 28, 2025 at Noon

"RL-STPA: Adapting System-Theoretic Hazard Analysis for Safety-Critical Reinforcement Learning"

Steven Senczyszyn, Mechanical Engineering

Abstract:

As reinforcement learning (RL) deployments expand into safety-critical domains, existing evaluation methods fail to systematically identify hazards arising from the black-box nature of neural network enabled policies and distributional shift between training and deployment. This paper introduces Reinforcement Learning System-Theoretic Process Analysis (RL-STPA), a framework that adapts conventional STPA's systematic hazard analysis to address RL's unique challenges through three key contributions: hierarchical subtask decomposition using both temporal phase analysis and domain expertise to capture emergent behaviors, coverage-guided perturbation testing that explores the sensitivity of state-action spaces, and iterative checkpoints that feed identified hazards back into training through reward shaping and curriculum design. We demonstrate RL-STPA in the safety-critical test case of autonomous drone navigation and landing, revealing potential loss scenarios that can be missed by standard RL evaluations. The proposed framework provides practitioners with a toolkit for systematic hazard analysis, quantitative metrics for safety coverage assessment, and actionable guidelines for establishing operational safety bounds. While RL-STPA cannot provide formal guarantees for arbitrary neural policies, it offers a practical methodology for systematically evaluating and improving RL safety and robustness in safety-critical applications where exhaustive verification methods remain intractable.

*Food and Refreshments will be provided*


Past Colloquiums