How do you bring back an audio file from compressiong for more quality. - How to make something less bad.

PointOfNoReturnPoint · Nov 14, 2024

This is a song I loved hearing when it came out, but you can hear it is compressed to save space as this was a flash game in the mid 2000s so size was important. Without spending any money how would i bring the quality back that was lost form compression?

Vecr · Nov 14, 2024

Ya don't. Read the initial plane ride scene in Zendegi. You might be able to do a bit of tuning to make it subjectively sound better, maybe do stem separation to take it apart and put it back together, but at some point you're just going to have to pay someone to make a cover.

Pee Cola · Nov 14, 2024

Vecr said:
Ya don't.

Yep, because you can't replace what was taken out.

Why-do-we-need-a-lossy-format-when-we-have-lossless_png.jpg

Source (A)

This visual image of lossless digital music (in this case a CD) vs the 320kbps MP3 file says it all. Note the thinning between 16-20kHz and the brick wall at 20kHz with the lossy file, whereas the highs from the original CD are all intact.

That being said, this particular CD looks like a victim of the Loudness Wars, as evidenced by the brick wall at 22kHz.

The lower the bitrate and/or sample rate, the worse it gets.

Squishie PP · Nov 15, 2024

What was said above used to be true, but now there are neural networks that can attempt to reconstruct a higher-fidelity signal. Here is an example of one: https://kuleshov.github.io/audio-super-res/

Vecr · Nov 15, 2024

Squishie PP said:
What was said above used to be true, but now there are neural networks that can attempt to reconstruct a higher-fidelity signal. Here is an example of one: https://kuleshov.github.io/audio-super-res/

That's what I was talking about with the stem separation, you would stem separate, run something like that, then mix it back together. It might sound better, but the original information is just gone.

PointOfNoReturnPoint · Nov 15, 2024

Squishie PP said:
What was said above used to be true, but now there are neural networks that can attempt to reconstruct a higher-fidelity signal. Here is an example of one: https://kuleshov.github.io/audio-super-res/

I saw that then got linked to github, but then thats when it fell apart idk what i am doing...Thanks for the input but idk how to use this. Most thing's i am finding are plug ins for DAW's or an API, so I am pretty much lost until a GUI comes around.

Squishie PP · Nov 15, 2024

Vecr said:
That's what I was talking about with the stem separation, you would stem separate, run something like that, then mix it back together. It might sound better, but the original information is just gone.

Better than nothing, I guess. There is also a model trained for stem separation here: https://github.com/deezer/spleeter

PointOfNoReturnPoint said:
I saw that then got linked to github, but then thats when it fell apart idk what i am doing...Thanks for the input but idk how to use this. Most thing's i am finding are plug ins for DAW's or an API, so I am pretty much lost until a GUI comes around.

Theres a cloud-based utility here that will let you do something similar: https://replicate.com/nateraw/audio-super-resolution

But of course it costs money

This is sadly one of the crappy parts of the machine learning community's over-reliance on Python; it makes it far too difficult for end users to use most of the software created in this space.

PointOfNoReturnPoint · Nov 15, 2024

Squishie PP said:
Better than nothing, I guess. There is also a model trained for stem separation here: https://github.com/deezer/spleeter

Theres a cloud-based utility here that will let you do something similar: https://replicate.com/nateraw/audio-super-resolution

But of course it costs money

This is sadly one of the crappy parts of the machine learning community's over-reliance on Python; it makes it far too difficult for end users to use most of the software created in this space.

Sweet thanks! I really only need a Gui to help do this more than anything, I don't code anyway.

Squishie PP · Nov 15, 2024

PointOfNoReturnPoint said:
Sweet thanks! I really only need a Gui to help do this more than anything, I don't code anyway.

Its honestly not that hard to use once you set it up. Only the installation process is a pain for those not familiar with python.

Dawdler · Nov 15, 2024

lossless codec actually isn't lossless overall since ADC's aren't perfect. 44kHz is just 2 samples at 20kHz frequency, not a lot at all.
Master audio gets taken in at 192kHz and 44 is just "Peasant grade" music

Vecr · Nov 15, 2024

Dawdler said:
lossless codec actually isn't lossless overall since ADC's aren't perfect. 44kHz is just 2 samples at 20kHz frequency, not a lot at all.
Master audio gets taken in at 192kHz and 44 is just "Peasant grade" music

Unless you plan on slowing it down there's not much loss if any at all. It's not perfect, but it's not anywhere near the problem OP has. If it was just ADC issues, I wouldn't be able to hear anything wrong with his song on my (somewhat terrible) setup. I can hear the problem all right, so it's much worse than that.

https://en.wikipedia.org/wiki/Nyquist_frequency

Dawdler said:
It's actually pretty horrible as 2 samples are just enough for rough approximation and nothing more

It really isn't. Maybe you have a super fancy DAC that really runs at that rate (and won't just smooth everything over with capacitors), but on both of my mid-end DACs I can't tell anything wrong with the quality on a well encoded song. This song, on the other hand, is terrible.

Dawdler · Nov 15, 2024

It's actually pretty horrible as 2 samples are just enough for rough approximation and nothing more

JohnnyG · Nov 15, 2024

Dawdler said:
lossless codec actually isn't lossless overall since ADC's aren't perfect. 44kHz is just 2 samples at 20kHz frequency, not a lot at all.
Master audio gets taken in at 192kHz and 44 is just "Peasant grade" music

Dawdler said:
It's actually pretty horrible as 2 samples are just enough for rough approximation and nothing more

Two points uniquely define a sine wave, which is why 44khz was chosen to begin with. Unlike bit depth, where there's a whole range of quality and "good enough" seldom is, sample rate is much more deterministic.

https://en.m.wikipedia.org/wiki/Nyquist_frequency

PointOfNoReturnPoint · Nov 15, 2024

Squishie PP said:
Its honestly not that hard to use once you set it up. Only the installation process is a pain for those not familiar with python.

Thats...kind of the problem, I do not know how to set it up..This is the whole issue with me right now.

Dawdler · Nov 16, 2024

JohnnyG said:
Two points uniquely define a sine wave, which is why 44khz was chosen to begin with. Unlike bit depth, where there's a whole range of quality and "good enough" seldom is, sample rate is much more deterministic.

They don't define a sine wave but you are approximating it and also defining the input as a mixed sine wave.
For any measurements 10 points per Hz are barely enough, going to 100 for anything meaningful like harmonics hidden in the sine.
With 2 points the result is same if you are imputing a triangle or a sine and you are completely lost on the phase of the sound wave.
Luckily for the sampling, most natural sounds are some sort of sine wave with harmonics on top so a lot of deficiencies are hidden.

Squishie PP · Nov 16, 2024

On second thought, the audio super-resolution model is probably not the best model for this application, since the problem in this case isn't low resolution. I found a paper with several model architectures that actually do solve the problem of reconstructing heavily compressed audio here: https://arxiv.org/pdf/2207.01667

Edit: Here is a page full of examples of what their model can do: https://sonycslparis.github.io/restoration_mdpi_suppl_mat/
It is honestly pretty impressive.

How do you bring back an audio file from compressiong for more quality. - How to make something less bad.

PointOfNoReturnPoint

Kermit at a rave mofucka

Vecr

DM if I don't respond.

Pee Cola

you got the right one baby uh-huh

Squishie PP

Vecr

DM if I don't respond.

PointOfNoReturnPoint

Kermit at a rave mofucka

Squishie PP

PointOfNoReturnPoint

Kermit at a rave mofucka

Squishie PP

Dawdler

Planet is 71% water and you keep calling it earth

Vecr

DM if I don't respond.

Dawdler

Planet is 71% water and you keep calling it earth

JohnnyG

PointOfNoReturnPoint

Kermit at a rave mofucka

Dawdler

Planet is 71% water and you keep calling it earth

Squishie PP