MIT’s Improbable AI Lab has developed a new framework, HiP, that uses three different foundation models trained on different data modalities to help robots plan and execute tasks. This removes the need for expensive paired data and makes the reasoning process more transparent.