Difference between revisions of "OpusTodo"

From XiphWiki
Jump to navigation Jump to search
Line 1: Line 1:
== 1.1-beta ==
== 1.1-beta ==
Line 10: Line 9:
* Improve variable frame size
* Improve variable frame size
* Tune transient detector?
* Tune transient detector?
* Use ALLOC in tonality analysis
* LOTS of testing
* LOTS of testing

Revision as of 10:58, 12 March 2013


  • Tune mode switching decisions
    • Use SILK up to higher rates on voice
    • Adapt stereo SILK/CELT threshold based on stereo width
  • Tune hybrid rate allocation
  • Figure out how to use speech/music detection optimally
    • find optimal switching time (low energy/tonality)
  • Improve variable frame size
  • Tune transient detector?
  • Use ALLOC in tonality analysis
  • LOTS of testing

Lower priority

  • Handle packets with PLC frames followed by FEC
  • Better handling for the case where FEC has a different bandwidth than the current mode
  • PLC transitions on unprotected SILK-SILK bandwidth changes?



  • De-uglify webpage - some suggestions: write about codecs obsoleted by OPUS (Speex, CELT, Vorbis(?), and the prop. ones), write about implementations (is there only one so far?), comparison table (Opus, Vorbis, Speex, ..., MP5) of features (channels, freq, bits per sample, license, language (C89), integer impl. (Vorbis decoder only, Opus YES, ...), future use in video files (Theora? Dirac? WebM? other future codecs...), audio files for storage (like Vorbis, no raw Opus defined, only inside OGG), ...
  • Promotional material (some nice free or Public domain sounds in Opus format)


  • Oggz-validate (should also validate opus toc)


  • A simple real time streaming example tool
  • Replaygain (half done— needs a gain tool)


  • Test exp_analysis and void_my_warranty.patch

Future work

  • Smart automatic mode decision
  • psymodel based VBR
  • Remove copy in inverse MDCT
  • Save some float<->int conversions
  • Improvements to LP mode CBR (greg has some code)