The only text-to-audio model I can think of at the moment is Stable Audio Open, which AFAIK is rather underwhelming for your use-case, if it can even handle stuff more complex than basic sounds - and no lyrics.
It is even under the “new” membership licensing of SAI.
I remember reading about a more recent one, but I currently can’t find it, and I don’t think that that one too could handle lyrics.
I suppose the Music industry is a lot harder to fight, so not a lot of people want to entangle themself with it.
I didn’t know about this project, so I took a quick look around.
I didn’t see any mention of Telemetry or Metrics, but I assume they can use this:
https://tails.net/doc/upgrade/index.en.html#automatic
Still, I just gave this a few minutes, so there could be more.