The newest open-source concern around AI that is seeing a lot of interest this weekend is when large language models / AI code generators may rewrite large parts of a codebase and then the “developers” claiming an alternative license incompatible with the original source license. This became a real concern this week with a popular Python project experiencing an AI-driven code rewrite and now published under an alternative license that its original author does not agree with and incompatible with the original code.

Chardet as a Python character encoding detector with its v7.0 release last week was a “ground-up, MIT-licensed rewrite of chardet.” This rewrite was largely driven via AI/LLM and claims to be up to 41x faster and offer an array of new features. But with this AI-driven rewrite, the license shifted from the LGPL to MIT.

    • med@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      1
      ·
      edit-2
      12 days ago

      It still doesn’t matter.

      • They can coopt all the open source licenses they want, the development work doesn’t need them.
      • They’re not capturing all commits going forward in time, they’ll have to redo it every time they need an updated library.
      • Any legal work done later that legitimizes this relicensing will open the door for the public, open source world, and more importantly, other relicensing companies to do it to them.

      I believe the end game of legitimizing open source relicensing theft is accidentally abolishing software copyright altogether.

      https://nedroidcomics.tumblr.com/image/41879001445