This page is meant to track ideas about low-delay, high-quality audio coding. The work has just started, so don't expect anything in the near future (or at all for that matter).
There are many signal types that can be found:
- A few pure (or nearly pure) tones
- Periodic waveforms (e.g. voice)
- Many (sometimes closely spaced) harmonics
- Shapred noise
- Signals that are (or are indistinguishable) from filtered (coloured) white noise
- Whatever does't fit above I guess
Good when most of the energy is contained in a few sinusoids. May be problematic for very harmonic signals, e.g. a male voice may have close to a hundred harmonics in the full audio band.
Good for harmonic signals. Hard to estimate and code when extra sinusoids and noise are present.
Very general. Can code anything, but not very good at anything. High delay (2x frame size).
LPC + stochastic cb
Codec Structure Ideas
Sinusoidal + wavelet
- Extract as many sinusoids as possible
- Wavelet transform
- Code wavelet coefs using VQ
Sinusoidal, pitch and noise
- Joint pitch + sinusoidal estimation
- LPC analysis
- CELP-like coding of the residual (mainly noise)