<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://wiki.xiph.org/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Relen</id>
	<title>XiphWiki - User contributions [en]</title>
	<link rel="self" type="application/atom+xml" href="https://wiki.xiph.org/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Relen"/>
	<link rel="alternate" type="text/html" href="https://wiki.xiph.org/Special:Contributions/Relen"/>
	<updated>2026-05-01T06:42:12Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.45.1</generator>
	<entry>
		<id>https://wiki.xiph.org/index.php?title=Ambisonics&amp;diff=6242</id>
		<title>Ambisonics</title>
		<link rel="alternate" type="text/html" href="https://wiki.xiph.org/index.php?title=Ambisonics&amp;diff=6242"/>
		<updated>2007-02-11T15:00:46Z</updated>

		<summary type="html">&lt;p&gt;Relen: /* Resources on Ambisonics */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&#039;&#039;&#039;Ambisonics&#039;&#039;&#039; is a surround sound system first developed in the 1970s. &lt;br /&gt;
Its main difference from other surround techniques is that it separates &lt;br /&gt;
transmission channels from speaker feeds, the speaker feeds being derived &lt;br /&gt;
using a decoder situated in the living room.  Decoders can be implemented &lt;br /&gt;
in either hardware or software. Typically more speakers are used than &lt;br /&gt;
transmission channels, and the more speakers used then the more stable the &lt;br /&gt;
resulting soundfield. Speakers can be arranged in a number of configurations, &lt;br /&gt;
regular polygons being the most popular.&lt;br /&gt;
&lt;br /&gt;
Ambisonic files can come in a number of different formats. The main one is &lt;br /&gt;
called B-Format, the other formats being derived from this. UHJ format is &lt;br /&gt;
mono- and stereo-compatible. G-Format is a set of speaker feeds, so can be &lt;br /&gt;
enjoyed in surround sound without the need for a decoder in the living room.&lt;br /&gt;
&lt;br /&gt;
== Ambisonics and 5.1 ==&lt;br /&gt;
Ambisonics and conventional 5.1 surround sound are very different. 5.1 is a &lt;br /&gt;
set speaker feeds, the signal only being fully defined for sounds coming &lt;br /&gt;
from a speaker. Phantom images between speakers can be created, but the &lt;br /&gt;
technique to do so is left unspecified. Many 5.1 releases use pair-wise &lt;br /&gt;
mixing to create phantom images. This is understandable as almost all &lt;br /&gt;
stereo recording are mixed using pair-wise mixing.&lt;br /&gt;
&lt;br /&gt;
Pair-wise mixing is also called &amp;quot;pan-potting&amp;quot;, &amp;quot;amplitude mixing&amp;quot; and &lt;br /&gt;
&amp;quot;intensity stereophony&amp;quot;. It mixes signals into the feeds for a pair of &lt;br /&gt;
speakers to create the illusion that a sound is coming from a point &lt;br /&gt;
somewhere between the speakers. During mixing, the apparent location of &lt;br /&gt;
each sound is determined only by the relative amplitude of that sound in &lt;br /&gt;
the two speakers.&lt;br /&gt;
&lt;br /&gt;
Unfortunately, pair-wise mixing works poorly when the speakers are to the &lt;br /&gt;
rear of the listener and not-at-all when they are to one side. You can &lt;br /&gt;
demonstrate this for yourself by performing &lt;br /&gt;
[http://members.tripod.com/martin_leese/Ambisonic/exper.html a very simple experiment].&lt;br /&gt;
Pair-wise mixing did not work in the quadraphonic era and it will not work &lt;br /&gt;
now. Such an absolute statement can be made because the way that humans &lt;br /&gt;
localise sound has not changed.&lt;br /&gt;
&lt;br /&gt;
Ambisonics is fundamentally different from 5.1. What is encoded in &lt;br /&gt;
Ambisonics is not speaker feeds, but &#039;&#039;direction&#039;&#039;. When mixing in &lt;br /&gt;
Ambisonics, the positions of the speakers are unknown &lt;br /&gt;
&#039;&#039;and are of no interest&#039;&#039;. Further, when Ambisonics is decoded to speaker &lt;br /&gt;
feeds, all of the speakers cooperate to localise a sound in its correct &lt;br /&gt;
position. The speakers all contribute to the creation of a single coherent &lt;br /&gt;
soundfield.&lt;br /&gt;
&lt;br /&gt;
=== Ambisonics to 5.1 ===&lt;br /&gt;
Converting Ambisonics to 5.1 is straightforward, and is discussed below &lt;br /&gt;
(see [[#G-Format|G-Format]]).&lt;br /&gt;
&lt;br /&gt;
=== 5.1 to Ambisonics ===&lt;br /&gt;
Converting 5.1 to Ambisonics is more difficult. It is easy to make each &lt;br /&gt;
speaker feed a phantom image (called a &amp;quot;virtual speaker&amp;quot;). The problem &lt;br /&gt;
with this is that even if the Ambisonic rendering is perfect, the result &lt;br /&gt;
will only be as good as the original 5.1 played through &#039;&#039;real&#039;&#039; speakers. &lt;br /&gt;
It will not be an improvement. Nobody has yet come up with a way for &lt;br /&gt;
Ambisonics to improve 5.1; 5.1 is simply too broken.&lt;br /&gt;
&lt;br /&gt;
== B-Format ==&lt;br /&gt;
B-Format is a single coherent soundfield composed of a set of related &lt;br /&gt;
channels.  The number of channels used depends on whether the soundfiled &lt;br /&gt;
is horizontal-only or full-sphere, and on the order. These B-Format &lt;br /&gt;
channels are transmission channels, not speaker feeds. Listening to &lt;br /&gt;
B-Format requires a decoder in your living room. Some numbers of &lt;br /&gt;
channels are tabulated below.&lt;br /&gt;
&lt;br /&gt;
=== Channel correlation ===&lt;br /&gt;
The correlation between B-Format channels depends on the content.  &lt;br /&gt;
Four-channel B-Format (which Ogg supports) consists of an omni-directional &lt;br /&gt;
component, called W, and three figure-of-eight components pointing &lt;br /&gt;
forward, left and up, called X, Y, Z. &lt;br /&gt;
([http://members.tripod.com/martin_leese/Ambisonic/Harmonic.html Pictures are available].) &lt;br /&gt;
Three-channel, horizontal-only B-Format simply omits the Z channel. This &lt;br /&gt;
means that anything in X also appears in W. Same for Y and Z. (W is &lt;br /&gt;
omni-directional; everything appears in W.) Also, if content comes from &lt;br /&gt;
Front-Left then it appears equally in X and Y. Same for content from &lt;br /&gt;
Front-Right, Back-Left, Back-Right; only the relative polarities change. &lt;br /&gt;
So there can be a lot of correlation between B-Format channels, but it is &lt;br /&gt;
content dependent.&lt;br /&gt;
&lt;br /&gt;
There is a file specification in use for downloadable B-Format files &lt;br /&gt;
called the &lt;br /&gt;
[http://www.ambisonicbootlegs.net/Members/mleese/file-format-for-b-format/ &amp;quot;.amb&amp;quot; specification].&lt;br /&gt;
&lt;br /&gt;
=== Limitations of the &amp;quot;.amb&amp;quot; specification ===&lt;br /&gt;
The [http://www.ambisonicbootlegs.net/Members/mleese/file-format-for-b-format/ &amp;quot;.amb&amp;quot; specification] &lt;br /&gt;
for downloadable B-Format files is based on the WAVE-EX format.  There are &lt;br /&gt;
currently over 85 pieces available in this format &lt;br /&gt;
[http://www.ambisonicbootlegs.net for free download]. Most of these are &lt;br /&gt;
first-order full-sphere soundfields. Some of the limitations of the &lt;br /&gt;
specification are:  &lt;br /&gt;
&lt;br /&gt;
#It is limited to 4 GByte files (2 GBytes if somebody screwed up).&lt;br /&gt;
#It is limited to third-order soundfields and below. While third-order looks like a lot (16 channels), there already exists a prototype mic that can record up to fourth-order.&lt;br /&gt;
#No compression (particularly lossless).&lt;br /&gt;
&lt;br /&gt;
The reason that the &amp;quot;.amb&amp;quot; file specification is limited to third-order &lt;br /&gt;
and below is because it uses the number of channels to uniquely define the &lt;br /&gt;
soundfield order. Unfortunately this simple and elegant scheme does not &lt;br /&gt;
work above third-order as ambiguities creep in. (One ambiguity is &lt;br /&gt;
illustrated in the table below.)&lt;br /&gt;
&lt;br /&gt;
A more general file format will have to use something else, such as &lt;br /&gt;
&#039;&#039;Malham notation&#039;&#039;, or storing both the horizontal-order and &lt;br /&gt;
height-order. There is a one-to-one correspondence between Malham notation &lt;br /&gt;
and the pair of orders, and either can generate the number of channels.&lt;br /&gt;
&lt;br /&gt;
==== Malham notation ====&lt;br /&gt;
Malham notation specifies the order of a B-Format soundfield using a &lt;br /&gt;
string of characters, each character being either &#039;&#039;&#039;f&#039;&#039;&#039; (for full-sphere) &lt;br /&gt;
or &#039;&#039;&#039;h&#039;&#039;&#039; (for horizontal). The first character in the string specifies &lt;br /&gt;
the type of the first-order components, the second character the type of &lt;br /&gt;
the second-order components, etc.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; style=&amp;quot;text-align:center&amp;quot;&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
!&amp;lt;span style=&amp;quot;font-size:80%&amp;quot;&amp;gt;Horizontal&amp;lt;br&amp;gt;order&amp;lt;/span&amp;gt;&lt;br /&gt;
!&amp;lt;span style=&amp;quot;font-size:80%&amp;quot;&amp;gt;Height&amp;lt;br&amp;gt;order&amp;lt;/span&amp;gt;&lt;br /&gt;
!&amp;lt;span style=&amp;quot;font-size:80%&amp;quot;&amp;gt;Soundfield_type&amp;lt;/span&amp;gt;&lt;br /&gt;
!&amp;lt;span style=&amp;quot;font-size:80%&amp;quot;&amp;gt;Malham&amp;lt;br&amp;gt;notation&amp;lt;/span&amp;gt;&lt;br /&gt;
!&amp;lt;span style=&amp;quot;font-size:80%&amp;quot;&amp;gt;Number&amp;lt;br&amp;gt;of_channels&amp;lt;/span&amp;gt;&lt;br /&gt;
!&amp;lt;span style=&amp;quot;font-size:80%&amp;quot;&amp;gt;Channels&amp;lt;/span&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
| 1|| 0||horizontal||&#039;&#039;&#039;h&#039;&#039;&#039;|| 3||WXY&lt;br /&gt;
|-&lt;br /&gt;
| 1|| 1||full-sphere||&#039;&#039;&#039;f&#039;&#039;&#039;|| 4||WXYZ&lt;br /&gt;
|-&lt;br /&gt;
| 2|| 0||horizontal||&#039;&#039;&#039;hh&#039;&#039;&#039;|| 5||WXYRS&lt;br /&gt;
|-&lt;br /&gt;
| 2|| 1||mixed-order||&#039;&#039;&#039;fh&#039;&#039;&#039;|| 6||WXYZRS&lt;br /&gt;
|-&lt;br /&gt;
| 2|| 2||full-sphere||&#039;&#039;&#039;ff&#039;&#039;&#039;|| 9||WXYZRSTUV&lt;br /&gt;
|-&lt;br /&gt;
| 3|| 0||horizontal||&#039;&#039;&#039;hhh&#039;&#039;&#039;|| 7||WXYRSPQ&lt;br /&gt;
|-&lt;br /&gt;
| 3|| 1||mixed-order||&#039;&#039;&#039;fhh&#039;&#039;&#039;|| 8||WXYZRSPQ&lt;br /&gt;
|-&lt;br /&gt;
| 3|| 2||mixed-order||&#039;&#039;&#039;ffh&#039;&#039;&#039;|| 11||WXYZRSTUVPQ&lt;br /&gt;
|-&lt;br /&gt;
| 3|| 3||full-sphere||&#039;&#039;&#039;fff&#039;&#039;&#039;|| 16||WXYZRSTUVKLMNOPQ&lt;br /&gt;
|-&lt;br /&gt;
| 4|| 0||horizontal||&#039;&#039;&#039;hhhh&#039;&#039;&#039;|| 9||extra channels unlabled&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
=== Default channel conversions from B-Format ===&lt;br /&gt;
Converting a B-Format file to a mono file is straightforward.  Use Mono = &lt;br /&gt;
W*sqrt(2).&lt;br /&gt;
&lt;br /&gt;
Converting a B-Format file to a stereo file is more difficult.  The &amp;quot;proper&amp;quot; &lt;br /&gt;
way to do this is to convert the W,X,Y channels to two-channel UHJ.  &lt;br /&gt;
Unfortunately this requires the use of wide-band 90-degree phase shifters.  &lt;br /&gt;
In the digital domain these are usually implemented as convolution filters.&lt;br /&gt;
&lt;br /&gt;
Assuming 90-degree phase shifters are unavailable then the problem is one of &lt;br /&gt;
choice.  Starting from B-Format, it is possible to synthesize &#039;&#039;any&#039;&#039; mic &lt;br /&gt;
response pointing in &#039;&#039;any&#039;&#039; direction.  Hence, it is possible to synthesize &lt;br /&gt;
&#039;&#039;all&#039;&#039; coincident stereo mic techniques.  Two popular stereo techniques are &lt;br /&gt;
&#039;&#039;Blumlein Mid-Side&#039;&#039; and &#039;&#039;Blumlien Crossed Pairs&#039;&#039;.&lt;br /&gt;
  &lt;br /&gt;
==== Blumlein Mid-Side ====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
Mid = (W*sqrt(2)) + X  /*This is a cardioid response pointing forward*/&lt;br /&gt;
Left = Mid + Y&lt;br /&gt;
Right = Mid - Y&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== Blumlein Crossed Pairs ====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
Left = (X + Y)/sqrt(2)&lt;br /&gt;
Right = (X - Y)/sqrt(2)&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Which conversion to stereo is better depends on the material and how it was &lt;br /&gt;
recorded.  A good suggestion is to not specify a &#039;&#039;particular&#039;&#039; default &lt;br /&gt;
channel conversion; instead, simply specify that there must be one.  If one &lt;br /&gt;
has to be specified then Blumlein Crossed Pairs is the simpler.&lt;br /&gt;
&lt;br /&gt;
== UHJ format ==&lt;br /&gt;
B-Format is the main format for Ambisonic files. However, B-Format is &lt;br /&gt;
not mono- or stereo-compatible. This is why the UHJ hierarchical system &lt;br /&gt;
was developed. Depending on the number of channels available, the UHJ &lt;br /&gt;
system can carry more or less information, but at all times it is fully &lt;br /&gt;
mono- and stereo-compatible. Up to four channels (Left, Right, T, Q) may &lt;br /&gt;
be used. The T-channel can also be band-limited but, as this &lt;br /&gt;
&amp;quot;2&amp;amp;frac12;-channel UHJ&amp;quot; was only ever used for FM radio transmission, it &lt;br /&gt;
will not be discussed further.&lt;br /&gt;
&lt;br /&gt;
To listen to UHJ files in surround requires a decoder in your living room. &lt;br /&gt;
Also, UHJ is restricted to first-order soundfields, either horizontal (two- &lt;br /&gt;
and three-channel UHJ) or full-sphere (four-channel UHJ).&lt;br /&gt;
&lt;br /&gt;
Converting B-Format channels to UHJ channels, and vice versa, requires the &lt;br /&gt;
use of wide-band 90-degree phase shifters. In the digital domain these &lt;br /&gt;
are usually implemented as convolution filters. Conversion between &lt;br /&gt;
four-channel B-Format (W, X, Y, Z) and four-channel UHJ (Left, Right, T, &lt;br /&gt;
Q) can be accomplished without loss of information. The same with &lt;br /&gt;
three-channel to three-channel (W, X, Y) &amp;lt;=&amp;gt; (Left, Right, T). It is &lt;br /&gt;
possible to recover three-channel B-Format (W, X, Y) from two-channel UHJ &lt;br /&gt;
(Left, Right), but not without loss. It is also important for the Ambisonic &lt;br /&gt;
decoder to be aware that the B-Format channels were recovered from &lt;br /&gt;
two-channel UHJ.&lt;br /&gt;
&lt;br /&gt;
Several hundred two-channel UHJ LPs and CDs have been released. Three- &lt;br /&gt;
and four-channel UHJ recordings have never been commercially released.&lt;br /&gt;
&lt;br /&gt;
There is a file specification for downloadable two-channel UHJ files &lt;br /&gt;
called the &lt;br /&gt;
[http://www.ambisonicbootlegs.net/Members/mleese/file-format-for-uhj/ &amp;quot;.uhj&amp;quot; specification], but it is not currently in use.&lt;br /&gt;
&lt;br /&gt;
=== Limitations of the &amp;quot;.uhj&amp;quot; specification ===&lt;br /&gt;
The [http://www.ambisonicbootlegs.net/Members/mleese/file-format-for-uhj/ &amp;quot;.uhj&amp;quot; specification] &lt;br /&gt;
for downloadable two-channel UHJ files is based on the WAVE or WAVE-EX &lt;br /&gt;
format. A UHJ chunk is added to the file to indicate it is UHJ. As &lt;br /&gt;
unrecognized chunks are always skipped, use of this chunk maintains stereo &lt;br /&gt;
compatibility. Some of the limitations of the specification are:  &lt;br /&gt;
&lt;br /&gt;
#It is limited to 4 GByte files (2 GBytes if somebody screwed up).&lt;br /&gt;
#It is limited to two-channel UHJ files. Three- and four-channel UHJ are not accommodated.&lt;br /&gt;
#No compression.&lt;br /&gt;
&lt;br /&gt;
The &amp;quot;.uhj&amp;quot; spcecification is only defined for two-channel UHJ to maintain &lt;br /&gt;
stereo compatibility. While it would be possible to add the UHJ chunk to &lt;br /&gt;
three- and four-channel WAVE or WAVE-EX files, the recommendations from &lt;br /&gt;
Microsoft for playing such files is that the audio device should render the &lt;br /&gt;
extra channels to output ports not in use. This can happen even when the &lt;br /&gt;
extra channels are masked off. Because of this, three- and four-channel &lt;br /&gt;
WAVE or WAVE-EX files can not be made stereo compatible.&lt;br /&gt;
&lt;br /&gt;
In the Ogg world, it should be possible to use default channel conversions &lt;br /&gt;
to ensure that three- and four-channel UHJ files remain stereo compatible.&lt;br /&gt;
&lt;br /&gt;
=== Default channel conversions from UHJ ===&lt;br /&gt;
Converting a UHJ file to a mono file is straightforward.  Use Mono = &lt;br /&gt;
(Left + Right)/2.&lt;br /&gt;
&lt;br /&gt;
Converting a UHJ file to a stereo file is even easier.  Use Left = Left, Right = Right, and discard T and Q if present.&lt;br /&gt;
&lt;br /&gt;
== G-Format ==&lt;br /&gt;
A G-Format file contains an Ambisonic soundfield pre-decoded to a set of &lt;br /&gt;
speaker feeds. This allows listeners who do not own an Ambisonic decoder &lt;br /&gt;
to enjoy Ambisonics. When producing the G-Format file, the sound engineer &lt;br /&gt;
creates a set of speaker feeds for a particular number and arrangement of &lt;br /&gt;
speakers. This is typically four speakers arranged in a square or five &lt;br /&gt;
speakers arranged in a regular pentagon. However, other speaker arrangements &lt;br /&gt;
are possible.&lt;br /&gt;
&lt;br /&gt;
A problem with playing G-Format recordings (and Ambisonics in general) is &lt;br /&gt;
that the speakers used need to be phase matched. This is because, in &lt;br /&gt;
Ambisonics, all speakers cooperate to localise sounds in any particular &lt;br /&gt;
direction; there are no &amp;quot;surround speakers&amp;quot; as such. The easiest way to &lt;br /&gt;
accomplish this is to use identical speakers. Unfortunately, many home &lt;br /&gt;
theatre systems include a centre-front speaker which is different from the &lt;br /&gt;
other speakers. It is noticeable that the G-Format recordings commercially &lt;br /&gt;
released on DVD-A by [http://www.wyastone.co.uk/nrl/dvd.html Nimbus Records] &lt;br /&gt;
are for four speakers in a square, the centre-front speaker being unused.&lt;br /&gt;
&lt;br /&gt;
The G-Format file also contains conversion coefficients which can be used &lt;br /&gt;
to recover the original B-Format channels. The recovered B-Format channels &lt;br /&gt;
can then be fed to a decoder in the listener&#039;s living room, and so &lt;br /&gt;
accommodate a speaker arrangement different from the one used when the &lt;br /&gt;
G-Format file was produced. Each B-Format channel is recovered using a &lt;br /&gt;
weighted combination of the speaker feeds in the G-Format file.&lt;br /&gt;
&lt;br /&gt;
More details will follow later. A file format for downloadable G-Format &lt;br /&gt;
files is being developed. As soon as it has been critiqued by the &lt;br /&gt;
Ambisonic community, it will be described here (assuming it survives).&lt;br /&gt;
&lt;br /&gt;
== Resources on Ambisonics ==&lt;br /&gt;
*There is a set of [http://en.wikipedia.org/wiki/Ambisonic Wikipedia articles on Ambisonics].&lt;br /&gt;
*Of particular relevance is the [http://www.ambisonicbootlegs.net/Members/mleese/file-format-for-b-format/ &amp;quot;.amb&amp;quot; specification] in use for downloadable B-Format files.  However the &amp;quot;.amb&amp;quot; spec has some limitations which it would be useful to overcome.&lt;br /&gt;
*There is also the [http://www.ambisonicbootlegs.net/Members/mleese/file-format-for-uhj/ &amp;quot;.uhj&amp;quot; specification] for downloadable two-channel UHJ files, but it is not currently in use.  The &amp;quot;.uhj&amp;quot; spec also has some limitations which it would be useful to overcome.&lt;br /&gt;
*[http://members.tripod.com/martin_leese/Ambisonic/ This website] has many pages on Ambisonics (including at the bottom links to other Ambisonic websites).&lt;br /&gt;
*[http://www.ambisonic.net/ Ambisonic.Net] includes a detailed series of descriptive and practical articles on current and past Ambisonic techniques with links to tools, other sites and additional material.&lt;/div&gt;</summary>
		<author><name>Relen</name></author>
	</entry>
</feed>