Ghost: Difference between revisions

From XiphWiki
Jump to navigation Jump to search
No edit summary
No edit summary
Line 17: Line 17:


=== Sinusoidal ===
=== Sinusoidal ===
Good when most of the energy is contained in a few sinusoids. May be problematic for very harmonic signals, e.g. a male voice may have close to a hundred harmonics in the full audio band.
=== Pitch ===
=== Pitch ===
Good for harmonic signals. Hard to estimate and code when extra sinusoids and noise are present.
=== MDCT ===
=== MDCT ===
Very general. Can code anything, but not very good at anything. High delay (2x frame size).
=== Wavelets ===
=== Wavelets ===
=== LPC + stochastic cb ===
=== LPC + stochastic cb ===
== Codec Structure Ideas ==
=== Sinusoidal + wavelet ===
* Preemphasis
* Extract as many sinusoids as possible
* Wavelet transform
* Code wavelet coefs using VQ
=== Sinusoidal, pitch and noise ===
* Preemphasis
* Joint pitch + sinusoidal estimation
* LPC analysis
* CELP-like coding of the residual (mainly noise)

Revision as of 16:11, 22 December 2005

This page is meant to track ideas about low-delay, high-quality audio coding. The work has just started, so don't expect anything in the near future (or at all for that matter).

Signal types

There are many signal types that can be found:

  • Sinusoids
    • A few pure (or nearly pure) tones
  • Harmonic
    • Periodic waveforms (e.g. voice)
    • Many (sometimes closely spaced) harmonics
  • Shapred noise
    • Signals that are (or are indistinguishable) from filtered (coloured) white noise
  • Transients
    • Whatever does't fit above I guess

Signal analysis

Sinusoidal

Good when most of the energy is contained in a few sinusoids. May be problematic for very harmonic signals, e.g. a male voice may have close to a hundred harmonics in the full audio band.

Pitch

Good for harmonic signals. Hard to estimate and code when extra sinusoids and noise are present.

MDCT

Very general. Can code anything, but not very good at anything. High delay (2x frame size).

Wavelets

LPC + stochastic cb

Codec Structure Ideas

Sinusoidal + wavelet

  • Preemphasis
  • Extract as many sinusoids as possible
  • Wavelet transform
  • Code wavelet coefs using VQ

Sinusoidal, pitch and noise

  • Preemphasis
  • Joint pitch + sinusoidal estimation
  • LPC analysis
  • CELP-like coding of the residual (mainly noise)