The Atlantic created a searchable database of the music used to train AI

June 20, 2026 Terrence O’Brien

A smiling computer surrounded by music notes.

Atlantic reporter Alex Reisner recently uncovered four datasets of music being used to train AI models and made them fully searchable for the public. Two of the sets are absolutely enormous at 12 million and 9 million tracks. The other two are much smaller, but still represent a significant amount of training data at over 100,000 songs each.

According to Reisner, the sets have been downloaded thousands of times and, while it's impossible to know exactly who has used them, Google and Stability have both confirmed they have in research papers. Some of the sources, like the Free Music Archive dataset, are free to stream for personal use but re …

Read the full story at The Verge.

Previous Article
Sony’s Xperia 1 VIII is still a phone for the fans
Sony’s Xperia 1 VIII is still a phone for the fans

The Xperia 1 VIII marks an attempt at a step change for Sony's flagship phone line. Not only has it had an ...

Next Article
Musician and YouTuber Hainbach on ‘Breath of the Wild’ and Swiss Army Knives
Musician and YouTuber Hainbach on ‘Breath of the Wild’ and Swiss Army Knives

Stefan Paul Goetsch, better known as Hainbach, is a German experimental composer, artist, and YouTuber who ...