Navigating Assurance and Approval in Machine Learning: Principles, Practice, and Burden
Assurance and approval in aircraft type certification ensure aircraft and systems designs meet the regulatory safety standards set by the FAA for safe operation in the National Airspace System. Regulations mandate safe performance under expected conditions and are guided by an “all and only” principle: the aircraft, as a whole, must be shown to do all that it is intended to and only that which is acceptable. Developers show adherence to this principle using a suite of standardized assurance methods.
Assurance methods help ensure that each step of an aircraft design is consistent. Demonstrating clear alignment between the aircraft’s overall design intent, as well as the systems and software that are implemented, is essential for building a safe and reliable aircraft. This process is also critical for showing that a system does ‘all and only’ what it’s intended to. As a design becomes more complex, such as by adding novel safety features, it’s important to maintain a clear connection between each layer of the system to make sure everything still reflects the original design intent.
Showing the aircraft as a whole meets “all and only” is typically achieved through item-level showings that support system-level showings that ultimately support the aircraft showing. In simpler terms, certifying an aircraft is like building a pyramid: every block has to fit perfectly to support the next layer. To prove the aircraft does exactly what it’s supposed to—and nothing more—engineers start by verifying the smallest pieces (like software or components), then show how those come together into larger systems, and finally how those systems make the whole aircraft work safely. When machine learning is used, it can only shape behavior through these individual components, like software. That means the toughest safety questions start at the foundation.
Unlike conventional development processes, ML can introduce unacceptable behaviors into items in ways other than human error. While this challenges existing assurance methods, it also provides perspectives on how this challenge may be addressed. This involves clarifying assurance concerns and examining processes for demonstrating compliance.