These 183,000 Books Are Fueling the Biggest Fight in Publishing and Tech
The Atlantic – Use our new search tool to see which authors have been used to train the machines. This summer, I acquired a data set of more than 191,000 books that were used without permission to train generative-AI systems by Meta, Bloomberg, and others. I wrote in The Atlantic about how the data set, …