Blapoo@lemmy.mltoTechnology@lemmy.world•OpenAI now tries to hide that ChatGPT was trained on copyrighted books, including J.K. Rowling's Harry Potter seriesEnglish
1·
1 year agoAh, but that’s the thing. Training isn’t copying. It’s pattern recognition. If you train a model “The dog says woof” and then ask a model “What does the dog say”, it’s not guaranteed to say “woof”.
Similarly, just because a model was trained on Harry Potter, all that means is it has a good corpus of how the sentences in that book go.
Thus the distinction. Can I train on a comment section discussing the book?
But you and I did NOT. I see a lot of people online who can’t make the distinction.
EDIT: Thanks for replies, all. Some good conversation here