I fucked with the title a bit. What i linked to was actually a mastodon post linking to an actual thing. but in my defense, i found it because cory doctorow boosted it, so, in a way, i am providing the original source here.

please argue. please do not remove.

  • Melllvar@startrek.website
    link
    fedilink
    English
    arrow-up
    1
    ·
    11 months ago

    I think we should have a rule that says if a LLM company invokes fair use on the training inputs then the outputs are public domain.

    • Steve@communick.news
      link
      fedilink
      English
      arrow-up
      1
      ·
      edit-2
      11 months ago

      That’s already been ruled on once.

      A recent lawsuit challenged the human-authorship requirement in the context of works purportedly “authored” by AI. In June 2022, Stephen Thaler sued the Copyright Office for denying his application to register a visual artwork that he claims was authored “autonomously” by an AI program called the Creativity Machine. Dr. Thaler argued that human authorship is not required by the Copyright Act. On August 18, 2023, a federal district court granted summary judgment in favor of the Copyright Office. The court held that “human authorship is an essential part of a valid copyright claim,” reasoning that only human authors need copyright as an incentive to create works. Dr. Thaler has stated that he plans to appeal the decision.

      Why would companies care about copyright of the output? The value is in the tool to create it. The whole issue to me revolves around the AI company profiting on it’s service. A service built on a massive library of copyrighted works. It seems clear to me, a large portion of their revenue should go equally to the owners of the works in their database.

  • Cyber Yuki@lemmy.world
    link
    fedilink
    English
    arrow-up
    0
    ·
    11 months ago

    What constitutes fair use?

    17 U.S.C. § 107

    Notwithstanding the provisions of sections 17 U.S.C. § 106 and 17 U.S.C. § 106A, the fair use of a copyrighted work, including such use by reproduction in copies or phonorecords or by any other means specified by that section, for purposes such as criticism, comment, news reporting, teaching (including multiple copies for classroom use), scholarship, or research, is not an infringement of copyright.

    GenAI training, at least regarding art, is neither criticism, comment, news reporting scholarship, nor research.

    AI training is not done by scientists but engineers of a corporative entity with a long term profit goal.

    So, by elimination, we can conclude that none of the purposes covered by the fair use doctrine apply to Generative AI training.

    Q.E.D.