OpusTodo: Difference between revisions

From XiphWiki
Jump to navigation Jump to search
No edit summary
No edit summary
(39 intermediate revisions by 6 users not shown)
Line 1: Line 1:
== Code ==
== 1.1-beta ==


=== For IETF draft ===
* Tune mode switching decisions
* Code cleanup (any left?)
** <s>Use SILK up to higher rates on voice</s>
* Multi-channel signalling (done, needs more testing)
** <s>Adapt stereo SILK/CELT threshold based on stereo width</s>
* Make opus-compare fail for single seriously trashed frames
* Tune hybrid rate allocation?
* Figure out how to use speech/music detection optimally
** find optimal switching time (low energy/tonality)
* Improve variable frame size
* Tune transient detector?
* Use ALLOC in tonality analysis
* LOTS of testing


== SILK issues ==
* <s>CLANG ARITHMETIC UNDEFINED at <silk/decode_core.c, (108:40)> : Op: *, Reason : Signed Multiplication Overflow, BINARY OPERATION: left (int32): 916258817 right (int32): -3 </s>
* <s>CLANG ARITHMETIC UNDEFINED at <silk/decode_core.c, (108:40)> : Op: +, Reason : Signed Addition Overflow, BINARY OPERATION: left (int32): -418535217 right (int32): -1832517634</s>
* silk warning: comparison is always true due to limited range of data type with assert
* <s>Activity interface / silk_toc (currently unreachable)</s>
=== Later ===
* Usable command-line tools (<s>opus is a major regression from libcelt right now</s>)
* Smart automatic mode decision
* psymodel based VBR
* Remove copy in inverse MDCT
* Save some float<->int conversions


== Spec ==
== Spec ==
* Ogg mapping. See [[http://tools.ietf.org/html/draft-ietf-codec-oggopus IETF draft]]
* Matroska mapping. See: [[MatroskaOpus]]
* RTP payload format See [[http://tools.ietf.org/html/draft-spittka-payload-rtp-opus IETF draft]]


* Finish codec draft
== Website ==
* Ogg mapping (including multi-channel). See: [[OggOpus]]
* De-uglify webpage - some suggestions: write about codecs obsoleted by OPUS (Speex, CELT, Vorbis(?), and the prop. ones), write about implementations (is there only one so far?), comparison table (Opus, Vorbis, Speex, ..., MP5) of features (channels, freq, bits per sample, license, language (C89), integer impl. (Vorbis decoder only, Opus YES, ...), future use in video files (Theora? Dirac? WebM? other future codecs...), audio files for storage (like Vorbis, no raw Opus defined, only inside OGG), ...  
* RTP payload format
* Promotional material (some nice free or Public domain sounds in Opus format)


== Other ==
== Other ==


* Logo
* Test vectors
* Listening tests
* Documentation (at a minimum every exported symbol should have complete and accurate documentation)
* Add content to opus-codec.org
* Oggz-validate (should also validate opus toc)
* Oggz-validate (should also validate opus toc)
** The above documentation
** Presentations
** Examples and test results  (hyperlink to Monty's demo, gmaxwell's HA results page, etc)
* Useful example software (e.g. streaming software that works correctly) (opus-tools in xiph git)
** <s>Support for resampling in tools</s>


== Third party software ==
== Opus-tools ==
* Support in ekiga
* A simple real time streaming example tool
* Support in mumble
* Replaygain (half done— needs a gain tool)
* Support in asterisk
 
* Support in firefox (rtcweb and in ogg)
== Experiments ==
* Support in VLC
 
* Support in ogg123
* Test exp_analysis and void_my_warranty.patch
* Support in ffmpeg
 
* Support in rockbox
== Future work ==
* Support in foobar2000
* psymodel based VBR
* Support in gstreamer
* Remove copy in inverse MDCT
* Support in mplayer
* Save some float<->int conversions
* Support in xmms
* Improvements to LP mode CBR (greg has some code)
* Support in oggdsf
* Better handling for the case where FEC has a different bandwidth than the current mode
* Support in xiphqt
* PLC transitions on unprotected SILK-SILK bandwidth changes?

Revision as of 06:20, 10 April 2013

1.1-beta

  • Tune mode switching decisions
    • Use SILK up to higher rates on voice
    • Adapt stereo SILK/CELT threshold based on stereo width
  • Tune hybrid rate allocation?
  • Figure out how to use speech/music detection optimally
    • find optimal switching time (low energy/tonality)
  • Improve variable frame size
  • Tune transient detector?
  • Use ALLOC in tonality analysis
  • LOTS of testing


Spec

Website

  • De-uglify webpage - some suggestions: write about codecs obsoleted by OPUS (Speex, CELT, Vorbis(?), and the prop. ones), write about implementations (is there only one so far?), comparison table (Opus, Vorbis, Speex, ..., MP5) of features (channels, freq, bits per sample, license, language (C89), integer impl. (Vorbis decoder only, Opus YES, ...), future use in video files (Theora? Dirac? WebM? other future codecs...), audio files for storage (like Vorbis, no raw Opus defined, only inside OGG), ...
  • Promotional material (some nice free or Public domain sounds in Opus format)

Other

  • Oggz-validate (should also validate opus toc)

Opus-tools

  • A simple real time streaming example tool
  • Replaygain (half done— needs a gain tool)

Experiments

  • Test exp_analysis and void_my_warranty.patch

Future work

  • psymodel based VBR
  • Remove copy in inverse MDCT
  • Save some float<->int conversions
  • Improvements to LP mode CBR (greg has some code)
  • Better handling for the case where FEC has a different bandwidth than the current mode
  • PLC transitions on unprotected SILK-SILK bandwidth changes?