Meta torrented & seeded 81.7 TB dataset containing copyrighted data
gameshot911 Friday, February 07, 2025
          
          Summary
        
        Meta has been accused of using over 81.7TB of pirated books to train its artificial intelligence language model, leading to concerns about the legality and ethics of this practice from authors and publishers.
      
      
        
        1,269
      
      
          
        923
      
    
      
          
          Summary
        
      
          
          arstechnica.com