Cinepedia

The Technology Behind The Big Screen

  • System
    • System Basics
    • DCI & SMPTE
  • Packaging
    • DCP
    • The Composition
    • Title Versions
    • Composition Playlist (CPL)
    • Track Files
    • Track File Encryption
    • Packing List
    • Additional Metadata
    • Interop DCP / SMPTE DCP
  • Security
    • Trust Model
    • Encryption
    • Trusted Device List
    • Digital Certificate
    • KDM
    • Media Block
    • Security Log
  • Picture
    • Picture Introduction
    • Color Distribution & Display
    • Color Gamut
    • Contrast & Dynamic Range
    • Resolution & Aspect Ratio
    • Light
    • Picture Track File & Compression
    • Projector Image Formation
    • Projection Screens
    • Stereoscopic 3D
  • Sound
    • Cinema Sound Basics
    • Sound Formats and Soundfields
    • MainSound Track File
  • Accessibility
    • Accessibility Overview
    • Accessibility & Audio Track File
    • Timed Text Track Files
    • Reel Flexibility for Timed Text
    • Communications for Off-Screen Timed Text
  • FAQs
    • Technology FAQs
    • Business FAQs
    • Accessibility FAQs
  • History
    • Early History
  • Terminology
  • References
  • Interop DCP
  • menuToggle Mobile Menu
  • Back to Top

Cinema Sound Basics

section-1404

Digital cinema sound is unique from every other commercial distribution format in that the audio signal is not compressed.  Audio is delivered to the cinema in full 24-bits per sample, typically 48,000 samples per second.  (It is possible to carry sound at 96KHz sampling rate, but this rate rarely, if ever, is used.)

Cinema sound has a rich history of formats, beginning with the commercial availability of monophonic sound-on-film in the 1920s.  Stereo-matrix optical sound-on-film was popular in the late 1970’s and throughout the 1980’s.  (“Matrix” refers to the phase-encoding of 4 channels of sound into 2.)  Not only did this format fill out the screen channels with Left, Center, and Right, but also mono Surround.  The format also introduced the “horseshoe” array of surround speakers in the auditorium, surrounding the audience.  Bass-enhancement, as it was called, derived a low-frequency signal from the front speaker channels, which drove separate low-frequency speakers.  When digital sound was introduced for film in the 1990’s, the surround array was divided to deliver a stereo surround signal, and a discrete subwoofer channel was added to the distribution format.  This format remains popular today, known as 5.1 sound.  The 5.1 format delivers Left (L), Center (C), Right (R), Left Surround (Ls), Right Surround (Rs), and Low Frequency Effects ( LFE, or subwoofer).  The astute reader will note that 5.1 sound has 6 channels of sound.  In the nomenclature of multi-channel sound, the “.1” indicates the LFE channel.  As a result, when counting channels, 5.1 = 5 + 1 = 6.

7.1 sound is an improvement on 5.1, in that it further divides the surround speaker array into four sections:  Left Side Surround (Lss), Right Side Surround (Rss), Left Rear Surround (Lrs), and Right Rear Surround (Rrs).  This adds up to 8 channels of sound.

The channel labels L, C, R, Ls, Rs, Lss, Rss, Lrs, Rrs, LFE, and more, are defined in SMPTE ST 428-12 DCDM Common Audio Channels and Soundfield Groups.

In digital cinema, the MainSound Track File defines the carriage of 16 channels of sound.  Of course, 16 channels is never enough, and emerging immersive sound formats allow a much more complex array of signals.  Since 5.1/7.1 sound only occupies 8 channels, at most, of the available 16, there remain 8 channels of bitstream carriage for other signals that are to be synchronized to the movie.  It is in these available sound tracks where Assistive Listening and Narrative audio tracks reside, motion seat data resides, as well as compressed video for assistive sign language.

last changed 2019-03-11 in Sound by MK

Sound Formats and Soundfields

section-1449

Sound Formats

While the concept of surround sound has been around since the 1940’s, the horseshoe-shaped surround speaker array commonly found in cinemas today was established in the 1970’s, with the introduction of the “Stereo Optical” format by Dolby Laboratories. The 5.1 and 7.1 sound formats, in common use today, simply divide the surround speaker array into multiple channels. The progression of surround sound formats is illustrated below.

Cinema speaker formats: Stereo Optical, 5.1 Sound, and 7.1 Sound

Figure S1. Cinema speaker formats: Stereo Optical, 5.1 Sound, and 7.1 Sound

Soundfield Groups and Immersive Sound

The concept of Soundfield Groups embodies the idea that a multiplicity of sound channels will continue to evolve.  What has actually evolved, however, is a more generalized concept of cinema sound, where a common set of sound channels, forming a sound bed, is combined with more localized sounds rendered across speakers located above and about the audience.  This is the basis of Immersive Sound formats.  A key characteristic of Immersive Sound is the 3-dimensional positioning of sound objects through the application of metadata, a rendering engine, and an appropriate loudspeaker system.  To further break from the channel concept, sound objects of limited duration can simply be carried as a chunk of audio.  In this manner, a rich soundfield may be created without the need for a large number of sound channels in distribution.  In practice, 5.1 / 7.1 speaker arrays are combined with additional speakers located above and about the audience, in combination receiving audio from the rendering engine.

Commercially available cinema immersive sound formats are available from Dolby Laboratories (Dolby Atmos®), DTS (DTS:X®), and Barco/Auro Technologies (Auro-3D®). Each of these formats consist of a unique distribution format and proprietary rendering engine. As of this writing, a SMPTE standards committee is working towards a common distribution format for immersive sound. The common distribution format is intended to drive competitive and proprietary immersive sound rendering engines.

last changed 2019-03-11 in Sound by MK

MainSound Track File

section-1406

The MainSound Track File audio characteristics of bit depth, sample rate, channel count, and reference level are defined in SMPTE ST428-2 DCDM Audio Characteristics, which are applicable to both Interop DCP and SMPTE DCP. The maximum number of audio channels carried in the Sound Track File is 16.

Audio channels in the Composition map to the audio outputs of the media block or server in a one-to-one manner. An audio signal carried in channel 1 of the track file will output at channel 1 of the media block or server. Audio signals are typically paired in AES3-formatted streams for output. The pairing takes place in numerical order, i.e, audio channels 1 and 2 are output in the 1st AES3 signal, and so on. The one-to-one mapping of channel to output occurs for both Interop DCP and SMPTE DCP, although SMPTE DCP has exceptions discussed further below.

The ordering of audio and data in the MainSound Track File is called out in ISDCF Doc4 – 16 Channel Audio Packaging Guide. The table below summarizes the ordering of the MainSound Track File:

Channel in Package 5.1 7.1 Description
1 L L Left
2 R R Right
3 C C Center
4 LFE LFE Low Frequency Effects
5 Ls Lss Left Surround / Left Side Surround
6 Rs Rss Right Surround / Right Side Surround
7 HI HI Hearing Impaired
8 VI-N VI-N Visually Impaired Narrative
9 not used not used Reserved for SDDS 7.1
10 not used not used Reserved for SDDS 7.1
11 not used Lrs Left Rear Surround
12 not used Rrs Right Rear Surround
13 Motion Data Motion Data Motion Seats
14 Sync Sync FSK Sync Signal for Immersive Sound
15 Sign Language Sign Language Sign Language Video
16 not used not used

 

Table S-1. MainSound Track File Signal Order

It was noted earlier that the signal order described in the table above also describes the ordering of audio outputs from the media block or server. If Left audio is to be placed differently in the track file, then the playback device must route the Left signal to the output defined in the table for interoperability.

SMPTE DCP prescribes several incompatible methods for packaging 5.1 and 7.1 audio. Of these multiple methods, it is recommended that only the channel configuration defined in SMPTE 429-2 DCP Operational Constraints, Annex A, Channel Configuration 4, be used in commercial distributions. In this channel configuration, the ordering of audio and data channels defined in Table S-1 above is recommended.

However, SMPTE 429-2 also offers a few other methods for packaging 5.1 and 7.1 audio. SMPTE 429-2 defines alternative packaging channel configurations for 5.1 and 7.1, as well as configurations for other sound formats. In addition, it is possible to label each audio channel in accordance with the SMPTE ST 377-4 MXF Multichannel Audio Labeling Framework. These alternative methods for packaging multi-channel audio order the channels differently, and as discussed, require signal routing in the media block or server to output the signal to the proper speaker or device. However, the alternative methods in SMPTE DCP are not available in Interop DCP, and not all media blocks and servers are configured to properly route signals. For SMPTE DCP, it is recommended that Channel Configuration 4 and the signal ordering described in the table above be used for compatibility.

last changed 2019-03-11 in Sound by MK

About

About Cinepedia

Interop DCP

The Interop DCP documentation below is provided for those who seek interoperability with older … more

An Early History of Digital Cinema

Public demonstrations of modern day digital cinema began in 1999 ...

copyright © 2016 - 2025 mkpe consulting llc