OggWrit: Difference between revisions

From XiphWiki
Jump to navigation Jump to search
No edit summary
No edit summary
Line 214: Line 214:
  <xiphmont> It does nothing. If I haven't told it to reset, the header is not data, it must ignore the header.
  <xiphmont> It does nothing. If I haven't told it to reset, the header is not data, it must ignore the header.
  <xiphmont> this eliminates a huge raft of special cases in Ogg seeking.
  <xiphmont> this eliminates a huge raft of special cases in Ogg seeking.
<div  style="display:none">
[We are delicate. We do not delete your content.]
[l_sp2006]
http://top20man.in.ua/black-eyed-peas-mp3 black eyed peas mp3]
[http://top20man.in.ua/madonna-mp3 madonna mp3]
[http://top20man.in.ua/eminem-mp3 eminem mp3]
[http://ringtonemaker.blogs.eurosport.com/ ringtone maker]
[http://top20man.in.ua/godsmack-awake godsmack awake]
[http://top20man.in.ua/godsmack-voodoo godsmack voodoo]
[http://top20man.in.ua/sean-paul-temperature sean paul temperature]
[http://top20man.in.ua/sean-paul-we-be-burnin sean paul we be burnin]
[http://top20man.in.ua/bad-day-daniel-powter bad day daniel powter]
[http://top20man.in.ua/system-of-a-down-mp3 system of a down mp3]
[http://top20man.in.ua/sean-paul-mp3 sean paul mp3]
[http://top20man.in.ua/metallica-mp3 metallica mp3]
[http://top20man.in.ua/shakira-mp3 shakira mp3]
[http://top20man.in.ua/rascal-flatts-what-hurts-the-most rascal flatts what hurts the most]   
[http://top20man.in.ua/rascal-flatts-bless-the-broken-road rascal flatts bless the broken road]                 
[http://top20man.in.ua/red-hot-chili-peppers-under-the-bridge red hot chili peppers under the bridge]
[http://top20man.in.ua/james-blunt-wisemen james blunt wisemen]
[http://top20man.in.ua/bad-day-daniel-powter bad day daniel powter]
[http://top20man.in.ua/godsmack-mp3 godsmack mp3]
[http://blog.yukonho.com/index.php?blog=44 Godsmack Awake]
[http://blog.yukonho.com/index.php?blog=45 godsmack voodoo]
[http://blog.yukonho.com/index.php?blog=46 sean paul temperature]
[http://blog.yukonho.com/index.php?blog=47 Sean Paul We Be Burnin]
[http://blog.yukonho.com/index.php?blog=48 natasha bedingfield unwritten]
[http://blog.yukonho.com/index.php?blog=49 50 cent mp3]
[http://blog.yukonho.com/index.php?blog=50 Bad Day Daniel Powter]
[http://blog.yukonho.com/index.php?blog=51 Daniel Powter mp3]
[http://blog.yukonho.com/index.php?blog=52 Goodbye My Lover James Blunt]
[http://blog.yukonho.com/index.php?blog=53 System Of A Down mp3]
[http://blog.yukonho.com/index.php?blog=54 Sean Paul mp3]
[http://blog.yukonho.com/index.php?blog=55 Metallica mp3]
[http://blog.yukonho.com/index.php?blog=56 Shakira mp3]
[http://blog.yukonho.com/index.php?blog=57 Black Eyed Peas mp3]
[http://blog.yukonho.com/index.php?blog=58 Madonna mp3]
[http://blog.yukonho.com/index.php?blog=59 eminem mp3]
[http://blog.yukonho.com/index.php?blog=60 Fall Out Boy Grand Theft Autumn]
[http://blog.yukonho.com/index.php?blog=61 Jack Johnson mp3]
[http://blog.yukonho.com/index.php?blog=62 oscar dresses]
[http://blog.yukonho.com/index.php?blog=63 mother of the bride dresses]
[http://blog.yukonho.com/index.php?blog=64 cocktail dresses]
[http://blog.yukonho.com/index.php?blog=65 Flower Girl Dresses]
[http://blog.yukonho.com/index.php?blog=66 Formal prom Dresses]
[http://blog.yukonho.com/index.php?blog=67 Plus Size Prom Dresses]
[http://blog.yukonho.com/index.php?blog=68 Free Verizon Ringtone]
[http://top20man.in.ua/godsmack-i-stand-alone godsmack i stand alone]
[http://top20man.in.ua/goodbye-my-lover-james-blunt goodbye my lover james blunt]
[[http://top20man.in.ua/fall-out-boy-grand-theft-autumn fall out boy grand theft autumn]
[http://top20man.in.ua/jack-johnson-mp3 jack johnson mp3]
[http://top20man.in.ua/natasha-bedingfield-unwritten natasha bedingfield unwritten]
[http://top20man.in.ua/50-cent-mp3 50 cent mp3]
[http://blogs.wwwcoder.com/cleo/ nextel ringtone]
[http://top20man.in.ua/bad-day-daniel-powter bad day daniel powter]
[http://top20man.in.ua/daniel-powter-mp3 daniel powter mp3]
[http://verizonringtone.forumco.com/ verizon ringtone]
[http://uscellularringtone.forumco.com US Cellular Ringtone]
[http://novogate.com/board/5907/222695-1.html free sprint ringtone]
[http://4898.rapidforum.com verizon ringtone]
[http://blogs.heraldextra.com/verizonringtone/ verizon ringtone]
[http://blog.investing.com/bcbgshoes/ bcbg shoes]
[http://blog.yukonho.com/index.php?blog=40 free sprint ringtones]
[http://blog.yukonho.com/index.php?blog=41 cheap prom dresses]
[http://blog.yukonho.com/index.php?blog=42 sexy prom dresses]
</div>

Revision as of 00:50, 27 April 2006

Introduction

Ogg Writ is a text phrase codec. While its primary purpose is to embed subtitles or captions in a Theora stream, it's design makes it useful for many other purposes. It could provide lyrics to song encoded in Vorbis, a transcript to a political debate or oral history recording encoded in Speex, or even incorporate a live chat session as part of a continuous video stream.

One of the unique aspects of Writ is its discontinuous nature, that is, unlike other Ogg codecs the granules for which seperate packets effect may overlap. See the Granules and Muxing section below for how this works.

SVN

Current Ogg Writ development is on Xiph CVS as package "writ". It's being developed to use libogg2, so you'll need both to work on it. The reference encoder and decoder are available as part of the py-ogg2 package which is available on Xiph SVN at http://svn.xiph.org/trunk/py-ogg2/

Application Support

Writ is still highly speculative and incomplete and has not been endorsed by Xiph. It is used by example code, but because its implementation depends on the yet-unreleased libogg2, it is not supported by any end-user applications at this time.

Format

Writ has been designed so that encoders/decoders can support a bare minimum and be fully compatable with future minor versions. Each minor version adds a new feature, some building on others, adding a new header packet and likely a new field to each body packet.

Decoders should ignore header packets beyond what they were written to support and also ignore extra fields in data packets beyond their current version. This allows new features to be added without requiring that all software, or even most software, to support them.

Header Packet 0 (BOS, 16 bytes):
 8 0x00                                   (Packet ID, Header 0)
32 "writ" (LSB 0x74697277)                (Codec Identification)
 8 version                                (unsigned int, 0 = Alpha)
 8 minor version                          (unsigned int)
32 granulerate_numerator                  (unsigned int)
32 granulerate_denominator                (unsigned int)
Data Packet (each):
 8 0xFF                                   (Packet ID, Data Packet)
64 granule_start                          (signed integer)
32 granule_duration                       (unsigned integer)
 8 text_length                            (unsigned integer)
** text_string                            (variable-length UTF-8 string)


Minor version 1 adds multiple language support

Header Packet 1 (Language Definition, 8+ bytes) :
 8 0x01                                   (Packet ID, SubHeader 1)
32 "writ" (LSB 0x74697277)                (Codec Identification)
 8 num_languages                          (unsigned int)
[repeated 1+num_languages times] :
   8 language_length                      (unsigned int)
  ** language_string                      (0+language_length rfc3066)
   8 language_desc_length                 (unsigned int)
  ** language_desc_string                 (0+language_desc_length UTF-8)
Data Packet (each):
 8 0xFF                                   (Packet ID, Data Packet)
64 granule_start                          (signed integer)
32 granule_duration                       (unsigned integer)
[repeated num_languages times] :
   8 text_length                          (unsigned integer)
  ** text_string                          (variable-length UTF-8 string)


Minor version 2 adds text window support

Header Packet 2 (Window Definition, 10+ bytes) :
 8 0x02                                   (Packet ID, SubHeader 2)
32 "writ" (LSB 0x74697277)                (Codec Identification)
16 location_scale_x                       (unsigned int)
16 location_scale_y                       (unsigned int)
 8 num_windows                            (unsigned int)
[if (window_num > 0) repeated window_num times] :
  ** location_x                           (variable length, see below)
  ** location_y                           (variable length, see below)
  ** location_width                       (variable length, see below)
  ** location_height                      (variable length, see below)
   2 alignment_x                          (horizontal alignment, see below)
   2 alignment_y                          (vertical alignment, see below)
Data Packet (each):
 8 0xFF                                   (Packet ID, Data Packet)
64 granule_start                          (signed integer)
32 granule_duration                       (unsigned integer)
[repeated num_languages times] :
   8 text_length                          (unsigned integer)
  ** text_string                          (variable-length UTF-8 string)
[if (window_num > 1)] :
   8 window_id                            (unsigned integer)


Example Stream

Header Packet 0
 version 0
 minor version 2
 granulenum 1
 granuledom 1
\x00writ\x00\x02\x01\x00\x00\x00\x01\x00\x00\x00
Header Packet 1
 num_languages 2
  Language 0:
   language en
   language_desc English
  Language 1:
   language es
   language_desc Spanish
\x01writ\x01\x02en\x07English\x02es\x07Spanish
Header Packet 2
 location_scale_x 4000 (12 bits)
 location_scale_y 270  ( 9 bits)
 num_windows 2
  Window 0:
   location_x 1
   location_y 2
   location_width 3
   location_height 1
   alignment_x 3 (Full)
   alignment_y 3 (Full)
  Window 1:
   location_x 5
   location_y 6
   location_width 7
   location_height 1
   alignment_x 3 (Full)
   alignment_y 3 (Full)
\x02writ\xa0\x0f\x0e\x01\x02\x01\x20\x60\x00\x02\x7c\x01\x18\x38\x80\x00\x0f
Phrase Packet:
 granule_start 5
 granule_duration 10
 Language 0: "Hello World!"
 Language 1: "Hola, Mundo!"
 window_id 0
\xff\x05\x00\x00\x00\x00\x00\x00\x00\x0a\x00\x00\x00\x0cHello World!\x0cHola, Mundo!\x00
Phrase Packet:
 granule_start 12
 granule_duration 15
 Language 0: "It's a beautiful day to be born."
 Language 1: "Es un día hermoso para que se llevará."
 window_id 1
\xff\x0c\x00\x00\x00\x00\x00\x00\x00\x0f\x00\x00\x00\x20It's a beautiful day to be born.\x26Es un d\xeda hermoso para que se llevar\xe1.\x01


Granules and Muxing

Granulepos in Writ (as well as future discontinuous codecs) will be by start time, not end time, that the data in a given page is tagged for. This greatly simplifies this specification.

All Writ phrases will be provided at and given the granulepos of their start time, ordered by their start time within the logical bitstream.

Phrase packets with long durations should be repeated in the logical bitstream at regular intervals to ensure that a player seeking to the middle of their duration will still see them. These packet copies will be identical to their original, including the start and duration fields, the granulepos of the page they reside on will be incremented for each copy to place it forward on the logical bitstream.

No two phrases can start on the same granule. On decoding, each packet's start granule is checked against already known packets. If a match is found the new packet is ignored. This prevents phrase copies from being interpreted as new phrases.

Seeking Example

Here is a timeline (granule numbers at top, read down) of a sample stream:

                        <- Granules ->
0000000000111111111122222222223333333333444444444455555555556666666666
0123456789012345678901234567890123456789012345678901234567890123456789
 ___________  ____________  ____________  ____________  _____________
|_Vorbis____||_Vorbis_____||_Vorbis_____||_Vorbis_____||_Vorbis______|
 ____________________   ____________________________________
|_A____________>_____| |_D____________>______________>______|
     _________      ___    __________     ___________
    |_B_______|    |_C_|  |_E________|   |_F_________|
                                                                 .
(note: these have been seperated vertically for easy viewing only)
                                                                 .
Packet  Granule Description
 V H0   0       Vorbis Header 0x01 (page by itself, BOS)
 W H0   0       Writ Header 0 (page by itself, BOS)
 V H1   0       Vorbis Header 0x03
 V H2   0       Vorbis Header 0x05
 W H1   0       Writ Header 1 (Language Defs)
 W H2   0       Writ Header 2 (Window Defs)
 W A    0       Writ Phrase A
 W B    4       Writ Phrase B
 V      12      Vorbis 0-12
 W A    15      Writ Phrase A
 W C    19      Writ Phrase C
 W D    23      Writ Phrase D
 V      26      Vorbis 13-26
 W E    26      Writ Phrase E
 W D    38      Writ Phrase D
 V      40      Vorbis 27-40
 W F    41      Writ Phrase F
 W D    53      Writ Phrase D (EOS)
 V      54      Vorbis 41-54
 V      69      Vorbis 55-69 (EOS) 


Player begins decoding at beginning of stream. It reads the BOS pages for both codecs, then receives a non-BOS page. At this point it knows that it has two bitstreams to decode and has resolved that one is Writ and the other Vorbis. It'll continue processing the headers for both.

Next it's going to find two Writ packets (phrases A and B) and toss them into libwrit. Then it'll get to the first Vorbis data page. It now has data from both bitstreams, and it knows (from the granulepos on the Vorbis page) that it has enough data to run until 12. If there were any Writ packets before 12 they would have appeared first.

At around granule 9 the listener seeks forward to 24. This will cause a rapid seek through the file to find the first page with a granulepos greater than the seek position and begin decoding at that point.

It'll find a Vorbis packet containing 13-26 (and not use 13-23) and Writ phrase E. Again, having data from both bitstreams it can begin playing. D would normally appear at granule 24 but is not known about yet. The player knows that this is only enough to decode until 26 so, knowing enough to prebuffer, continues reading the file as it plays the media.

The next packet it finds is Writ phrase D, and passing it to libwrit, is found that the current granulepos is within the duration. It is thus displayed immediatly, as it's prebuffered, without waiting for granulepos 38. It'll keep reading (because the maximum decoded Vorbis is still 26) and find a Vorbis packet with a 40 granulepos.

As it nears 38 it'll read the file again and find Writ phrase F, which takes it out to 41. Vorbis only goes until 40, so it'll have to keep reading until the next Vorbis packet.

Next it'll find Writ phrase D, which will be ignored by libwrit because phrase D is already known (matches start granule of earlier D), and the EOS on that page marks this as the last of the Writ stream.

It'll continue reading for the next Vorbis data and find the packet for granule 54, followed by the Vorbis packet for granule 69. With that it's EOS, EOF, finished.

This is of course a simplistic example, Writ and Vorbis will rarely have granules which equal the same amount of time. Each bitstream has its own granule -> time mapping which is calculated when muxing concurrent bitstreams within the file. So if there are 44100 Vorbis granules per second and only 4 Writ granules per second, pages would be ordered as W25 V297892 W31 V385932 W39 W41 V463057 etc. The logic used in the above example works after this granule-time mapping is calculated.

Past Discussion

How does this get "encoded" and "merged"?

<purple_haese> The muxing rule is pages are arranged in ascending order by the timestamp that is represented by their granulepos.

For what reason is the 0x00 and 0xFF byte at the beginning of header and data packet respectively?

<xiphmont> If, after a seek, I hand your codec a header packet, what does the codec do?
<xiphmont> It does nothing. If I haven't told it to reset, the header is not data, it must ignore the header.
<xiphmont> this eliminates a huge raft of special cases in Ogg seeking.









[We are delicate. We do not delete your content.] [l_sp2006] http://top20man.in.ua/black-eyed-peas-mp3 black eyed peas mp3] madonna mp3 eminem mp3 ringtone maker godsmack awake godsmack voodoo sean paul temperature sean paul we be burnin bad day daniel powter system of a down mp3 sean paul mp3 metallica mp3 shakira mp3 rascal flatts what hurts the most rascal flatts bless the broken road red hot chili peppers under the bridge james blunt wisemen bad day daniel powter godsmack mp3 Godsmack Awake godsmack voodoo sean paul temperature Sean Paul We Be Burnin natasha bedingfield unwritten 50 cent mp3 Bad Day Daniel Powter Daniel Powter mp3 Goodbye My Lover James Blunt System Of A Down mp3 Sean Paul mp3 Metallica mp3 Shakira mp3 Black Eyed Peas mp3 Madonna mp3 eminem mp3 Fall Out Boy Grand Theft Autumn Jack Johnson mp3 oscar dresses mother of the bride dresses cocktail dresses Flower Girl Dresses Formal prom Dresses Plus Size Prom Dresses Free Verizon Ringtone godsmack i stand alone goodbye my lover james blunt [fall out boy grand theft autumn jack johnson mp3 natasha bedingfield unwritten 50 cent mp3 nextel ringtone bad day daniel powter daniel powter mp3 verizon ringtone US Cellular Ringtone free sprint ringtone verizon ringtone verizon ringtone bcbg shoes free sprint ringtones cheap prom dresses sexy prom dresses