Ghost

From XiphWiki
Revision as of 17:11, 22 December 2005 by Jmspeex (talk | contribs)
Jump to navigation Jump to search

This page is meant to track ideas about low-delay, high-quality audio coding. The work has just started, so don't expect anything in the near future (or at all for that matter).

Signal types

There are many signal types that can be found:

  • Sinusoids
    • A few pure (or nearly pure) tones
  • Harmonic
    • Periodic waveforms (e.g. voice)
    • Many (sometimes closely spaced) harmonics
  • Shapred noise
    • Signals that are (or are indistinguishable) from filtered (coloured) white noise
  • Transients
    • Whatever does't fit above I guess

Signal analysis

Sinusoidal

Good when most of the energy is contained in a few sinusoids. May be problematic for very harmonic signals, e.g. a male voice may have close to a hundred harmonics in the full audio band.

Pitch

Good for harmonic signals. Hard to estimate and code when extra sinusoids and noise are present.

MDCT

Very general. Can code anything, but not very good at anything. High delay (2x frame size).

Wavelets

LPC + stochastic cb

Codec Structure Ideas

Sinusoidal + wavelet

  • Preemphasis
  • Extract as many sinusoids as possible
  • Wavelet transform
  • Code wavelet coefs using VQ

Sinusoidal, pitch and noise

  • Preemphasis
  • Joint pitch + sinusoidal estimation
  • LPC analysis
  • CELP-like coding of the residual (mainly noise)