Meta’s No Language Left Behind (NLLB) project has scaled their technology to include 200 lesser-spoken languages, but experts suggest that Meta should consult with native speakers and language specialists to improve the tool. The project uses a combination of data from the Opus repository and mined data from sources like Wikipedia to create a multilingual language model (MLM) that can translate between any pair of languages without relying on English data.
