Researchers at EPFL have developed a new training approach to ensure that machine learning models, particularly deep neural networks, consistently perform as intended, significantly enhancing their reliability. This new model employs a continuously adaptive attack strategy to create a more intelligent training scenario and is applicable across a wide range of activities that depend on AI for classification. The research was awarded an esteemed Best Paper Award at the 2023 International Conference on Machine Learning’s New Frontiers and Adversarial Machine Learning Workshop for recognizing and correcting an error in a very well-established way to train, improving AI defenses against adversarial manipulation.
