
CosmicTaco
11moOpenAI Accused of Using Paywalled Books for AI Training
- A new paper by the AI Disclosures Project claims OpenAI used non-public O'Reilly books to train its GPT-4o model.
- The paper suggests GPT-4o shows strong recognition of paywalled content compared to earlier models like GPT-3.5 Turbo.
- Co-authors used the DE-COP method to detect potential copyrighted material in the models' training data.
- OpenAI faces criticism for its data practices, though it does have some licensing agreements and opt-out mechanisms.
- OpenAI did not respond to the allegations and continues to seek high-quality training data.
Source: TechCrunch
11mo ago

You're early. There are no comments yet.
Be the first to comment.
Discover more
Curated from across