Nvidia has filed a patent for a system that uses a generative AI model to synthesize datasets that can be used to train a machine learning model for specific visual tasks, such as autonomous driving, robotics or facial recognition. The generative model bridges the gap between synthetic and real-world data, and the machine learning model is validated against a real-world validation dataset. Synthetic data can make training AI a more accessible task, and having access to loads of synthetic data can be beneficial for small companies or individual developers.
