Skip to content

Big-name publishers are refusing to let Apple Intelligence train on data

Website owners have a simple mechanism to tell Apple Intelligence not to scrape the site for training purposes, and reportedly major platforms like Facebook and the New York Times are using it.

Craig Federighi standing in front of a large screen with 'Apple Intelligence' in colorful letters, inside a modern, minimalistic room.
Future expansions to Apple Intelligence may involve more AI partners, paid subscriptions

Apple has been offering publishers millions of dollars for the right to scrape their sites, as opposed to Google which believes all data should be freely available to train AI large language modules. As part of this, Apple honors a system where a site can just say in a particular file that it does not want to be scraped.

That file is a simple text one called robots.txt, and according to Wired, very many major publishers are choosing to use this to block Apple’s AI training.

Continue Reading on AppleInsider | Discuss on our Forums