Remix.run Logo
RobinL 2 days ago

With apologies for the X link, here is an example from Suno which felt very musical to me: https://x.com/sunomusic/status/1857501332560818342

Here's another example on the Suno website: https://suno.com/song/fc991b95-e4e9-4c8f-87e8-e5e4560755e7

ben_w 2 days ago | parent | next [-]

There are things I like from Suno, but, having used it to make quite a lot, I also get vibes of something subtly wrong that I can't put my finger on, which I assume is somewhere between the audio version of bad kerning and Cronenberg fingers. Too many examples of vocoder/autotune in the training set, perhaps?

That said, I mostly prefer AI over "real" human-made recordings (pop, classical, metal, bardcore, whatever) because I tend to learn the patterns too fast to enjoy, or really even tolerate, any recording more than about 3 times* — I assume I'd like live jazz for longer, but have only been to one place that ever had it so I don't know if it breaks that pattern.

* sole exception: TV theme tunes, though the point of them isn't to listen to them

numpad0 2 days ago | parent | prev | next [-]

I don't find any problems whatsoever in those audio, but I'm not an avid music listener, so out of intuition I'm making a guess that there's same underlying issue as image generation happening: AI makes technically horrible and rage-inducing fillers that lack high level semantic structure, but average people has no words nor experience to assess and describe what's going on.

ahofmann 2 days ago | parent [-]

> I don't find any problems whatsoever in those audio

I think this is why there is no real, powerful protest against all that generated stuff. Only the people, that care, are able to articulate what's wrong with it. To me, all of AI generated content sounds horrible. To almost everyone else, this sounds ok. So we will see and hear more of this generated stuff. We are in the middle of the enshittification of all consumable media.

ben_w 2 days ago | parent | next [-]

I think there's a lot of different reasons all simultaneously going on.

Most human musicians have very little power; that's been going away for a long time, even since "canned music" "robots" pushed live bands out of cinemas a century ago: https://www.smithsonianmag.com/history/musicians-wage-war-ag...

Most popular music already feels, and to an extent is, fake. Not only because mere recording allows repeated takes until it's inhumanly "perfect".

When I played an MP3 of Britney Spears to my mum around the turn of the century, she thought it was a robot singing because of the autotune.

The Monkees was famously an attempt at a manufactured band whose members just happened to not feel like playing that game and did it for real, Gorillaz is even more obviously manufactured. Parasocial relationships are inherently different from "real" relationships, but the performers have to pretend that it's personal when they address a crowd or a camera.

Axis of Awesome demonstrated the similarity of most modern hits with their "4 Chords": https://youtu.be/oOlDewpCfZQ?feature=shared

Those with the power were, possibly still are, the record labels; but if the AI are trained on the works of small musicians that can't afford the copyright cases or the political influence, but also whose works are not under the umbrella of the labels who do have those resources but not the right or short term motivation to intercede on their behalf, the big labels themselves may lose the consumer market to free AI output, while professionals will dismiss both the AI output and the label's output as "just different kinds of slop but both slop" (or whatever the current insult de jour is for AI).

numpad0 2 days ago | parent | prev | next [-]

Agreed. To me AI generated images look horrible, and AI generated audio is still somewhat gut twisting but less painful. AI generated code works for HTML/CSS+JS, but not that great for others. AI generated e-commerce reviews ... on par with human reviews?

I'm starting to think that what AI might be replacing is high ends of consumption, not low ends of generation. Arts has followers that are often less historically significant than genre pioneering works. Doesn't that seem like what AI is doing?

com2kid 2 days ago | parent | prev | next [-]

People were happy with the included wired iPhone earbuds for years, even though they were terrible.

Listening on a laptop speaker Sumo sounds fine. Listening on my wireless earbuds it is... ok. I am too lazy anymore to pull out any of my high quality wired headphones, and if somebody who used to care about sound quality enough to purchase multiple HQ headphones can't be arsed, then the general pubic really is going to think everything is just fine.

numpad0 2 days ago | parent [-]

That kind of quality limitations is not the point. Nor extra digits in images.

Generative AI outputs trigger uncanny valley discomfort that professionals and connoisseurs are better equipped to verbalize.

The quality question is what's the point of stuffs like that, or is it good, even safe, for us to consume.

anonzzzies 2 days ago | parent | prev [-]

I find almost all popular music that's made in the past 20+ years quite terrible. This is not worse. For people who enjoy this chewing gum stuff, which seems most of the population of earth, this is fine. And as such, this will be all popular music in the future; upload your voice, pick a style, generate 13 songs, go on tour to make money.

codedokode 2 days ago | parent [-]

> go on tour

Why go on tour if you can send an AI singer instead and if you cannot sing as good as it anyway?

anonzzzies 2 days ago | parent [-]

They want to believe it is human; however when the robots get good enough... That's further away though maybe.

ahofmann 2 days ago | parent | prev [-]

Suno is by far the best generated AI music I've heard. That said, it is hot garbage.

I've listened to both songs on my Bose QC ultra headphones, which are far from perfect headphones. But even on them, the female voice has unbearable resonances in the higher frequencies. The male voice sounds mostly ok, but has also something that sounds like compression artifacts (like mp3 compression, not loudness compression). All instruments in these songs have these problems. They sound somewhat like the real thing, but really badly recorded. Also, the mixing isn't any good.

It is still very impressive that AI can generate that. But if I would record my band and someone would create such a mix out of it, I would fire them immediately. Heck, I would be furious that they fucked up so bad and would try to get my money back.

So the two links you provided just confirm what I said.

CraftingLinks 2 days ago | parent | next [-]

I use Suno like a producer in a music studio hires musicians to bring ideas to life. I wish more features in Suno would empower music producers. I sample pieces, re-mix doodles, get ideas to continue my tracks... I can see the future, and as an amateur, it's just liberating and a lot of fun.

snapcaster 2 days ago | parent | prev [-]

Really interesting, haven't listened to their output with high quality speakers or anything like that. Do poorly made human recordings have this problem or is this currently a signal of AI generation?