Jump to content

Draft:Audio Definition Model

From Wikipedia, the free encyclopedia

Audio Definition Model
ITU-R Recommemndation BS.2076
AbbreviationADM
StatusPublished
First publishedJune 2015 (2015-06)
Latest version2
OrganizationInternational Telecommunications Union
Related standardsITU-R Recommendations BS.2125, BS.2094, BS.2127
PredecessorEBU Tech 3364
Websitehttps://www.itu.int/rec/R-REC-BS.2076-2-201910-I/en

The Audio Definition Model (ADM) is a standardised metadata model for describing the technical properties of audio. It was originally specified by the European Broadcasting Union, but since 2015 has specified by the ITU as Recommendation ITU-R BS.2076.

Scope

[edit]

The ADM is intended for the delivery and exchange of audio content from production through to the stage where the audio is encoded for final broadcast. Due to its flexibilty it can be used to describe very simple audio configurations such as traditional mono and stereo content, all the way through to highly complex immersive and interactive productions, including Next Generation Audio (NGA). The ADM does not set any limits on the quantity or complexity of the audio it is describing.

Type of Supported Audio

[edit]

The ADM supports five types of audio:

  • Channel-based - an audio channel is expected to be delivered to a loudspeaker without any need for modification, e.g. stereo.
  • Scene-based - the audio channels represent a speaker-independent representation of a soundfield using spherical harmonics. e.g ambisonics and Higher Order Ambisonics (HOA).
  • Object-based - each audio channel has positional metadata or other properties attached to it.
  • Matrix-based - audio channels are combined via matrix equations to generate other channels.
  • Binaural-based - binaural recording has been used to simulate the acoustic effects of the head and ears, or the audio has been pre-rendered using binural filters; and intended to be played over headphones.

Structure

[edit]

The ADM consists of the set of elements to desribe the format and general content of the associated audio. The elements that describe the format are:

  • audioTrackFormat - describes the essential audio data type in a track (e.g. PCM).
  • audioStreamFormat - a combination of tracks (audioTrackFormat) that must be combined to represent one or more audio channels.
  • audioChannelFormat - describes a single mono channel of audio, including which type of audio it is (e.g. front-left channel)
  • audioPackFormat - a combination of related channels (audioChannelFormat) that must be combined to represent a particular sound (e.g. stereo).

The elements that describe the content are:

  • audioObject - references one or more tracks and associated audioPackFormat to define an audio object.
  • audioContent - describes a set of audio objects with related content, and contains content information such a language and loudness.
  • audioProgramme - decscribes a whole programme and references the relevant audioContents to make that programme.

Rendering

[edit]

As with any metadata the ADM metadata requires parsing and processing to be useful. Apart from channel-based audio, each type of audio requires processing to convert it into audio channels that are able to sent to output devices. This type of processor is called a renderer, which reads in ADM metaadata and its associated audio and outputs audio channels based on an assignement output configuration.

A reference ADM renderer has been standardised in the ITU with Recommendation ITU-R BS.2127. This was derived from EBU work in EBU Tech .

Serial ADM

[edit]

The ADM, as defined in ITU-R BS.2076, is designed for file-based applications, where a complete audio programme will be carried in a single file. However, this is not suitable for streaming or live scenarios where audio and metadata needs to be delivered in real-time. The Serial ADM (S-ADM) is a version of the ADM that carries audio and its associated metadata in a succession of time-limited frames. This is standardised in the ITU as Recommendation ITU-R BS.2125.

Format and Carriage

[edit]

The ADM is primarily represened in XML. It does not carry any audio itself, though it does the reference audio tracks it is describing. All the XML elements and attributes names are in English.

ADM XML can be carried in a number of file formats including:

  • '''BW64''' - the axml chunk carries the ADM XML metadata, and the chna chunk carries the channel allocation look-up table.
  • '''MXF''' -

S-ADM frames can be carried over these transport methods:


See also

[edit]
  • BWF, Broadcast Wave Format
  • BW64 ([[1] Long-form file format for the international exchange of audio programme materials with metadata])

References

[edit]

Audio Definition Model]

Common definitions for the Audio Definition Model]

Audio Definition Model renderer for advanced sound systems]

A serial representation of the Audio Definition Model]

Advanced sound system for programme production]