From XiphWiki
Revision as of 16:11, 22 December 2005 by Jmspeex (talk | contribs)

Jump to: navigation, search

This page is meant to track ideas about low-delay, high-quality audio coding. The work has just started, so don't expect anything in the near future (or at all for that matter).

Signal types

There are many signal types that can be found:

  • Sinusoids
    • A few pure (or nearly pure) tones
  • Harmonic
    • Periodic waveforms (e.g. voice)
    • Many (sometimes closely spaced) harmonics
  • Shapred noise
    • Signals that are (or are indistinguishable) from filtered (coloured) white noise
  • Transients
    • Whatever does't fit above I guess

Signal analysis


Good when most of the energy is contained in a few sinusoids. May be problematic for very harmonic signals, e.g. a male voice may have close to a hundred harmonics in the full audio band.


Good for harmonic signals. Hard to estimate and code when extra sinusoids and noise are present.


Very general. Can code anything, but not very good at anything. High delay (2x frame size).


LPC + stochastic cb

Codec Structure Ideas

Sinusoidal + wavelet

  • Preemphasis
  • Extract as many sinusoids as possible
  • Wavelet transform
  • Code wavelet coefs using VQ

Sinusoidal, pitch and noise

  • Preemphasis
  • Joint pitch + sinusoidal estimation
  • LPC analysis
  • CELP-like coding of the residual (mainly noise)