Difference between revisions of "OpusTodo"

From XiphWiki
Jump to: navigation, search
(11 intermediate revisions by 3 users not shown)
Line 1: Line 1:
 +
== 1.1-beta ==
 +
 +
* Tune mode switching decisions
 +
** Use SILK up to higher rates on voice
 +
** Adapt stereo SILK/CELT threshold based on stereo width
 +
* Tune hybrid rate allocation
 +
* Figure out how to use speech/music detection optimally
 +
** find optimal switching time (low energy/tonality)
 +
* Improve variable frame size
 +
* Tune transient detector?
 +
* Use ALLOC in tonality analysis
 +
* LOTS of testing
 +
 +
== Lower priority ==
 +
 +
* Handle packets with PLC frames followed by FEC
 +
* Better handling for the case where FEC has a different bandwidth than the current mode
 +
* PLC transitions on unprotected SILK-SILK bandwidth changes?
 +
 
== Spec ==
 
== Spec ==
* Ogg mapping
+
* Ogg mapping. See [[http://tools.ietf.org/html/draft-ietf-codec-oggopus IETF draft]]
 
* Matroska mapping. See: [[MatroskaOpus]]
 
* Matroska mapping. See: [[MatroskaOpus]]
* RTP payload format
+
* RTP payload format See [[http://tools.ietf.org/html/draft-spittka-payload-rtp-opus IETF draft]]
  
 
== Website ==
 
== Website ==
* De-uglify webpage
+
* De-uglify webpage - some suggestions: write about codecs obsoleted by OPUS (Speex, CELT, Vorbis(?), and the prop. ones), write about implementations (is there only one so far?), comparison table (Opus, Vorbis, Speex, ..., MP5) of features (channels, freq, bits per sample, license, language (C89), integer impl. (Vorbis decoder only, Opus YES, ...), future use in video files (Theora? Dirac? WebM? other future codecs...), audio files for storage (like Vorbis, no raw Opus defined, only inside OGG), ...
* Promotional material
+
* Promotional material (some nice free or Public domain sounds in Opus format)
  
 
== Other ==
 
== Other ==
Line 18: Line 37:
 
== Experiments ==
 
== Experiments ==
  
* Measure pitch post-filter offset (try without if (nbAvailableBytes<25) pf_threshold += QCONST16(.1f,15); )
 
 
* Test exp_analysis and void_my_warranty.patch
 
* Test exp_analysis and void_my_warranty.patch
  

Revision as of 09:58, 12 March 2013

1.1-beta

  • Tune mode switching decisions
    • Use SILK up to higher rates on voice
    • Adapt stereo SILK/CELT threshold based on stereo width
  • Tune hybrid rate allocation
  • Figure out how to use speech/music detection optimally
    • find optimal switching time (low energy/tonality)
  • Improve variable frame size
  • Tune transient detector?
  • Use ALLOC in tonality analysis
  • LOTS of testing

Lower priority

  • Handle packets with PLC frames followed by FEC
  • Better handling for the case where FEC has a different bandwidth than the current mode
  • PLC transitions on unprotected SILK-SILK bandwidth changes?

Spec

Website

  • De-uglify webpage - some suggestions: write about codecs obsoleted by OPUS (Speex, CELT, Vorbis(?), and the prop. ones), write about implementations (is there only one so far?), comparison table (Opus, Vorbis, Speex, ..., MP5) of features (channels, freq, bits per sample, license, language (C89), integer impl. (Vorbis decoder only, Opus YES, ...), future use in video files (Theora? Dirac? WebM? other future codecs...), audio files for storage (like Vorbis, no raw Opus defined, only inside OGG), ...
  • Promotional material (some nice free or Public domain sounds in Opus format)

Other

  • Oggz-validate (should also validate opus toc)

Opus-tools

  • A simple real time streaming example tool
  • Replaygain (half done— needs a gain tool)

Experiments

  • Test exp_analysis and void_my_warranty.patch


Future work

  • Smart automatic mode decision
  • psymodel based VBR
  • Remove copy in inverse MDCT
  • Save some float<->int conversions
  • Improvements to LP mode CBR (greg has some code)