Meta torrented & seeded 81.7 TB dataset containing copyrighted data

Authors allege that Meta was aware torrenting was illegal based on emails, and ignored warnings while downloading and seeding data from shadow libraries as of April 2024. Meta allegedly hid its seeding and modified settings to minimize it, also avoiding Facebook servers to evade detection. New information has led to claims that Meta staff involved must be deposed again as it contradicts prior testimony. Mark Zuckerberg denies involvement in using LibGen for AI training, but unredacted messages suggest otherwise. While Meta maintains AI training on LibGen was fair use, it may face challenges as authors expand distribution claims. meta may aim to address this through summary judgment.

https://arstechnica.com/tech-policy/2025/02/meta-torrented-over-81-7tb-of-pirated-books-to-train-ai-authors-say/

To top