OpusTodo: Difference between revisions

From XiphWiki
Jump to navigation Jump to search
(45 intermediate revisions by 5 users not shown)
Line 1: Line 1:
== Code ==
== 1.0.2 ==


=== For IETF draft ===
* multi-frame FEC/PLC fix
* Code cleanup (any left?)
* PLC fix
* Multi-channel signalling (done, needs more testing)
* opus_packet_get_duration()
* Make opus-compare fail for single seriously trashed frames
* OPUS_GET_FRAME_SIZE() for decoder??
* <strike>Add license headers to all dist files</strike> DONE
* <strike>Fix remaining build issues with MSVC</strike> DONE
* <strike>Add OPUS_EXPORT override for chrome</strike> DONE


== SILK issues ==
== 1.1-beta ==
* PLC buffer not fully initialized (fix needs verifying)
* Mid and side using different sampling rates (fix needs verifying)
* <s>LLBR stereo issue (has a proposed fix)</s>
** <s>Introduces prefill bug (fixed on greg's tree)</s>
*<s>Artefacts on SILK mono<->stereo switching</s>
* <s>Artefacts on SILK frame size switching (e.g. 960,480 glitches)</s>
* Reduce SILK bandpass switching artefacts
* <s>Use of signed overflow (undefined in C), intentionally and otherwise. </s>
* Encoder triggers DTX randomly (even if not enabled) for 40/60 ms stereo frames
* <s>CLANG ARITHMETIC UNDEFINED at <silk/silk_NSQ_del_dec.c, (652:33)> : Op: *, Reason : Signed Multiplication Overflow, BINARY OPERATION: left (int32): 90005 right (int32): -25578</s>
* CLANG ARITHMETIC UNDEFINED at <silk/decode_core.c, (108:40)> : Op: *, Reason : Signed Multiplication Overflow, BINARY OPERATION: left (int32): 916258817 right (int32): -3
* CLANG ARITHMETIC UNDEFINED at <silk/decode_core.c, (108:40)> : Op: +, Reason : Signed Addition Overflow, BINARY OPERATION: left (int32): -418535217 right (int32): -1832517634
* CLANG ARITHMETIC UNDEFINED at <./silk/Inlines.h, (120:13)> : Op: -=, Reason : Signed Subtraction Overflow, BINARY OPERATION: left (int32): 2147454976 right (int32): -2147467848
<!-- ./test_opus voip 48000 2 32000 -bandwidth WB -framesize 10 /home/gmaxwell/big-fb.sw /dev/null -->
* CLANG ARITHMETIC UNDEFINED at <silk/LPC_analysis_filter.c, (68:25)> : Op: +, Reason : Signed Addition Overflow, BINARY OPERATION: left (int32): -2053682997 right (int32): -96356645
* CLANG ARITHMETIC UNDEFINED at <silk/LPC_analysis_filter.c, (67:25)> : Op: +, Reason : Signed Addition Overflow, BINARY OPERATION: left (int32): 2144927654 right (int32): 9275188
* CLANG ARITHMETIC UNDEFINED at <silk/LPC_analysis_filter.c, (63:21)> : Op: +, Reason : Signed Addition Overflow, BINARY OPERATION: left (int32): -1503335005 right (int32): -978520921
* CLANG ARITHMETIC UNDEFINED at <silk/LPC_analysis_filter.c, (64:21)> : Op: +, Reason : Signed Addition Overflow, BINARY OPERATION: left (int32): 1813111370 right (int32): 545673470
* CLANG ARITHMETIC UNDEFINED at <silk/LPC_analysis_filter.c, (65:21)> : Op: +, Reason : Signed Addition Overflow, BINARY OPERATION: left (int32): 2121005902 right (int32): 274731600
* CLANG ARITHMETIC UNDEFINED at <silk/LPC_analysis_filter.c, (72:21)> : Op: -, Reason : Signed Subtraction Overflow, BINARY OPERATION: left (int32): -79495168 right (int32): 2131398803
<!-- voip 16000 2 16000 -framesize 20 /home/gmaxwell/big-fb.sw /dev/null -->
* Review Tim's "LSB with no pulses" fix
* Fix decoder-side resampling delay for rates other than 48 kHz and document behavior.
* Find_poly improvements
* <s>Rename PLC.c</s>
* silk warning: comparison is always true due to limited range of data type with assert
* Fatal (internal) error in silk/encode_indices.c, line 189: assertion failed: ( psIndices->contourIndex < 34 && psEncC->fs_kHz > 8 && psEncC->nb_subfr == 4 ) || ( psIndices->contourIndex < 11 && psEncC->fs_kHz == 8 && psEncC->nb_subfr == 4 ) || ( psIndices->contourIndex < 12 && psEncC->fs_kHz > 8 && psEncC->nb_subfr == 2 ) || ( psIndices->contourIndex < 3 && psEncC->fs_kHz == 8 && psEncC->nb_subfr == 2 )
* <s>Fatal (internal) error in silk/encode_indices.c, line 130: assertion failed: psEncC->useInterpolatedNLSFs == 1 || psIndices->NLSFInterpCoef_Q2 == ( 1 << 2 )</s>
* Fatal (internal) error in silk/encode_indices.c, line 209: assertion failed: psIndices->LTPIndex[ k ] >= 0 && psIndices->LTPIndex[ k ] < ( 8 << psIndices->PERIndex )


=== Later ===
* tune transient detector
* <strike>Exposed CELT constrained VBR</strike>
* variable frame size?
* <strike>Fixed-point build</strike>
* LOTS of testing
* <strike>Fix build system (right now it fails to build shared libraries, drops .o files all over)</strike>
* re-tune hybrid rate allocation
* <strike>Floating point API</strike>
* re-tune mode switching decisions
* Usable command-line tools (<s>opus is a major regression from libcelt right now</s>)
* figure out how to use speech/music detection optimally
* Smart automatic mode decision
* everything from 1.0.2
* psymodel based VBR
 
* Remove copy in inverse MDCT
== Lower priority ==
* Save some float<->int conversions
 
* Handle packets with PLC frames followed by FEC
* Better handling for the case where FEC has a different bandwidth than the current mode
* PLC transitions on unprotected SILK-SILK bandwidth changes?


== Spec ==
== Spec ==
* Ogg mapping. See [[http://tools.ietf.org/html/draft-ietf-codec-oggopus IETF draft]]
* Matroska mapping. See: [[MatroskaOpus]]
* RTP payload format See [[http://tools.ietf.org/html/draft-spittka-payload-rtp-opus IETF draft]]


* Finish codec draft
== Website ==
* Ogg mapping (including multi-channel). See: [[OggOpus]]
* De-uglify webpage - some suggestions: write about codecs obsoleted by OPUS (Speex, CELT, Vorbis(?), and the prop. ones), write about implementations (is there only one so far?), comparison table (Opus, Vorbis, Speex, ..., MP5) of features (channels, freq, bits per sample, license, language (C89), integer impl. (Vorbis decoder only, Opus YES, ...), future use in video files (Theora? Dirac? WebM? other future codecs...), audio files for storage (like Vorbis, no raw Opus defined, only inside OGG), ...  
* RTP payload format
* Promotional material (some nice free or Public domain sounds in Opus format)


== Other ==
== Other ==


* Logo
* Test vectors
* Listening tests
* Documentation (at a minimum every exported symbol should have complete and accurate documentation)
* Add content to opus-codec.org
* Oggz-validate (should also validate opus toc)
* Oggz-validate (should also validate opus toc)
** The above documentation
** Presentations
** Examples and test results  (hyperlink to Monty's demo, gmaxwell's HA results page, etc)
* Useful example software (e.g. streaming software that works correctly) (opus-tools in xiph git)
** <s>Support for resampling in tools</s>


== Third party software ==
== Opus-tools ==
* Support in ekiga
* A simple real time streaming example tool
* Support in mumble
* Replaygain (half done— needs a gain tool)
* Support in asterisk
 
* Support in firefox (rtcweb and in ogg)
== Experiments ==
* Support in VLC
 
* Support in ogg123
* Test exp_analysis and void_my_warranty.patch
* Support in ffmpeg
 
* Support in rockbox
 
* Support in foobar2000
== Future work ==
* Support in gstreamer
* Smart automatic mode decision
* Support in mplayer
* psymodel based VBR
* Support in xmms
* Remove copy in inverse MDCT
* Support in oggdsf
* Save some float<->int conversions
* Support in xiphqt
* Improvements to LP mode CBR (greg has some code)

Revision as of 09:11, 22 December 2012

1.0.2

  • multi-frame FEC/PLC fix
  • PLC fix
  • opus_packet_get_duration()
  • OPUS_GET_FRAME_SIZE() for decoder??
  • Add license headers to all dist files DONE
  • Fix remaining build issues with MSVC DONE
  • Add OPUS_EXPORT override for chrome DONE

1.1-beta

  • tune transient detector
  • variable frame size?
  • LOTS of testing
  • re-tune hybrid rate allocation
  • re-tune mode switching decisions
  • figure out how to use speech/music detection optimally
  • everything from 1.0.2

Lower priority

  • Handle packets with PLC frames followed by FEC
  • Better handling for the case where FEC has a different bandwidth than the current mode
  • PLC transitions on unprotected SILK-SILK bandwidth changes?

Spec

Website

  • De-uglify webpage - some suggestions: write about codecs obsoleted by OPUS (Speex, CELT, Vorbis(?), and the prop. ones), write about implementations (is there only one so far?), comparison table (Opus, Vorbis, Speex, ..., MP5) of features (channels, freq, bits per sample, license, language (C89), integer impl. (Vorbis decoder only, Opus YES, ...), future use in video files (Theora? Dirac? WebM? other future codecs...), audio files for storage (like Vorbis, no raw Opus defined, only inside OGG), ...
  • Promotional material (some nice free or Public domain sounds in Opus format)

Other

  • Oggz-validate (should also validate opus toc)

Opus-tools

  • A simple real time streaming example tool
  • Replaygain (half done— needs a gain tool)

Experiments

  • Test exp_analysis and void_my_warranty.patch


Future work

  • Smart automatic mode decision
  • psymodel based VBR
  • Remove copy in inverse MDCT
  • Save some float<->int conversions
  • Improvements to LP mode CBR (greg has some code)