• vrighter@discuss.tchncs.de
    link
    fedilink
    arrow-up
    2
    ·
    5 months ago

    it is only open source if i can build it myself. Which I can’t if you just give me the weights.

    The weights are the “compiled” version of the dataset. It’s the dataset that’s the source, not the weights

    • delirious_owl@discuss.online
      link
      fedilink
      arrow-up
      0
      ·
      5 months ago

      So the cover art I made for a friend’s album isn’t open source, even though I released it as CC BY-SA… because you can’t make it yourself?

      • leopold@lemmy.kde.social
        link
        fedilink
        English
        arrow-up
        2
        ·
        edit-2
        5 months ago

        I would consider the “source code” for artwork to be the project file, with all of the layers intact and whatnot. The Photoshop PSD, the GIMP XCF or the Krita KRA. The “compiled” version would be the exported PNG/JPG.

        You can license a compiled binary under CC BY if you want. That would allow users to freely decompile/disassemble it or to bundle the binary for their purposes, but it’s different from releasing source code. It’s closed source, but under a free license.

      • sweng@programming.dev
        link
        fedilink
        arrow-up
        0
        ·
        5 months ago

        It would depend on the format what is counted as source, and what isn’t.

        You can create a picture by hand, using no input data.

        I challenge you to do the same for model weights. If you truly just sit down and type away numbers in a file, then yes, the model would have no further source. But that is not something that can be done in practice.

        • delirious_owl@discuss.online
          link
          fedilink
          arrow-up
          0
          ·
          5 months ago

          I challenge you to recreate the Mona Lisa.

          My point is that these models are so complex that they’re closer to art than anything reproduce

          • sweng@programming.dev
            link
            fedilink
            arrow-up
            0
            ·
            edit-2
            5 months ago

            I don’t see your point? What is the “source” for Mona Lisa I would use? For LLMs I could reproduce them given the original inputs.

            Creating those inputs may be an art, but so could any piece of code. No one claims that code being elegant disqualifies it from being open source.

            • delirious_owl@discuss.online
              link
              fedilink
              arrow-up
              0
              ·
              5 months ago

              Are you sure that you can reproduce the model, given the same inputs? Reproducibility is a difficult property to achieve. I wouldn’t think LLMs are reproduce.

              • sweng@programming.dev
                link
                fedilink
                arrow-up
                0
                ·
                edit-2
                5 months ago

                In theory, if you have the inputs, you have reproducible outputs, modulo perhaps some small deviations due to non-deterministic parallelism. But if those effects are large enough to make your model perform differently you already have big issues, no different than if a piece of software performs differently each time it is compiled.

  • mindbleach@sh.itjust.works
    link
    fedilink
    arrow-up
    1
    ·
    edit-2
    4 months ago

    Years ago I found myself explaining to Chinese Room dinguses - in a neural network, the part that does stuff is not the part written by humans.

    I’m not sure it’s meaningful to say this sort of AI has source. You can have open data sets. (Or rather you can be open about your data sets. I don’t give a shit if LLMs list a bunch of commercial book ISBNs.) But rebuilding a network isn’t exactly a matter of hitting “compile” and going out for coffee. It can take months, and the power output of a small city… and it still can’t be exact. There’s so much randomness involved in the process that it’d be iffy whether you get the same weights twice, even if you built everything around that goal.

    Saying “here’s the binary, do whatever” is honestly a lot better for neural networks than for code, because it’s not like the people who made it know how it works either.

  • howrar@lemmy.ca
    link
    fedilink
    arrow-up
    1
    ·
    5 months ago

    This needs to have multiple levels of “openness” to distinguish between having access to the code, the dataset, a documented training procedure, and the final weights. I wouldn’t consider it fully open unless these are all available, but I still appreciate getting something over nothing, and I think that should be encouraged.