Increasingly, the authors of works being used to train large language models are complaining (and rightfully so) that they never gave permission for such a use-case. If I were an LLM company, I’d be seriously looking for a Plan B right now, whether that’s engaging publishing companies to come up with new licensing options, paying 1,000,000 grad students to write 1,000,000 lines of prose, or something else entirely.
Keep saying the same about diffusion models as well. I guess we just want adobe and other wealthy companies to be the only ones with access to proprietary datasets large enough to make futuristic art tools.
Pay subscriptions to your overlords or suffer.