Digital Audio#

This Guide is still in draft format.

Section 1. Introduction to Digital Audio#

Section 2. Creating Digital Audio#

Section 3. Preserving Digital Audio#

Bibliography and Further Reading#

Knight, G. & McHugh, J. (2005) Preservation Handbook: Digital Audio. AHDS. http://www.ahds.ac.uk/preservation/audio-preservation-handbook.pdf

1.1 What is Digital Audio?#

  • Aims and Objectives of Specific Guide
    • Background to the guide's focus in an archaeological context (uses)
      • what is #
      • applications of #, how is # used in archaeology

As with digital video, digital audio files have become far easier to create over the last ten years. Although digital video has perhaps found more applications in archaeology, digital audio files are often created as a component of projects looking to record oral histories or to recreate 'archaeological sounds', either through the modern reconstruction of archaeological musical instruments or through the recording or sounds within archaeological contexts such as reconstructed - physically or virtually - churches or henges/theatres and so on. A number of examples of the latter exist in the online Internet Archaeology journal (e.g. Thomas 2011).

As with the Digital Video guide, this guide aims to address the issues involved in the creation and preservation of 'born digital' audio files and will not aim to cover the creation of files from analogue originals (although many of the issues discussed here equally apply). The digitisation of analogue audio files is covered in detail in the JISC Digital Media guide 'Audio: Digitising analogue media' [1]. In addition to the extensive material available on the JISC Digital Media site, this guide also draws heavily from a number of other key guides on preserving digital audio files, namely the AHDS 'Preservation Handbook: Digital Audio' (Knight & McHugh 2005), JISC's 'Significant Properties Testing Report: Audio Recordings' (Knight 2010) and IASA Technical Committee's 'Guidelines on the Production and Preservation of Digital Audio Objects' (2nd ed.) (Bradley 2009).

1.2 Current Issues and Concerns#

Again, as with digital video data, digital audio files may be large when created/stored in uncompressed formats and informed decisions need to be made when deciding when and how lower quality files are created. Another issue that is again similar to digital video is that the range of digital audio files consist of a mix of container fomats and codecs which again emphasise the importance of detailed technical metadata in successfully identifying and working with audio files. Metadata also plays a key role with audio files in documenting the file's creation process and contents (e.g. names and dates of interviews, locations, etc.) as these elements may not be as apparent as in similar video files.

[1] http://www.jiscdigitalmedia.ac.uk/audio/docs/category/digitising-analogue-media

Section 2. Creating Digital Audio#

2.1 General Considerations#

As discussed in section 1, the quality of the file is a key consideration when creating digital audio files compression vs uncompressed / lossy/lossless - remember, can't go back many files allow metadata to be embedded. interviews - clear/be aware of copyright, document the process / transcripts (clarify who is involved in the recording / saying what)

2.2 File Formats#

Waveform Audio (.wav)Any one of the following compressed audio formats (preferably supplied by depositor):
Quicktime .mov
Real Audio .rm, .ra or .ram
Windows Media Audio .wma
MPEG-1 Audio Layer III .mp3
Ogg Vorbis .ogg

Audio Interchange File .aif
Preferred format
(likely from Mac users)
Audio Interchange File .aifas .wavas .wav
SUN au.auWaveform Audio.wavas .wavas .wav
ahds guide
Advanced Audio Coding / AAC MPEG-4 Audio (.aac)Playback is not supported by all audio players. There is a slight loss in sound quality if it is not sampled at a high enough bit rate. Not suitable for preservation at the moment.the Advanced Audio Coding format is based on the MPEG2 and MPEG4 standards. aac files are usually ADTS or ADIF containers.
Audio Interchange File Format (.aif, .aiff)Uncompressed audio suitable as a preservation format. However these files are large in size and are not seen as a standard. standard audio file format used by Apple. It could be considered the Apple equivalent of wav.
Broadcast Wave Format (BWF) (.bwf .wav)Uncompressed audio that supplements the basic RIFF WAVE structure with a “broadcast extension chunk” for metadata. Migration from WAVE to BWF may be problematic. Only linear PCM and MPEG code files are supported in the 2002 specs. Suitable for preservation, particularly in the PCM variant.
MIDI (.mid, .midi)Small audio files that contain instructions on recreating musical compositions. Variations in audio playback on different hardware make it unsuitable for preservation.
Ogg vorbis (.ogg)A non-proprietary open audio format that is constantly being revised. Xiph.org http://www.xiph.org May rival MP3 in popularity at some stage. a free, open source container format supporting a variety of formats, the most popular of which is the audio format Vorbis. Vorbis offers compression similar to MP3 but is less popular.
MP3 / MPEG-1 Audio Layer -III (.mp3)A widely accepted format that can be played on most platforms. Several patents cover the format, although a free licence is granted to non-profit organizations. Suitable for distribution, but ill equipped for preservation.
Quicktime (.qt, .mov)Apple proprietary streaming Codec. Not suitable for preservation
Real Audio (.ra, .ram)Real Networks proprietary streaming Codec. Not suitable for preservation.

Sun AU (.au)Large, high quality files but not widely supported outside the UNIX community, so not suitable for preservation.
SUN .au is a straightforward UNIX format, unfortunately not widely supported outside the UNIX community so therefore not suitable for preservation. We can accept these files and convert them to .wav for preservation. It should be noted that .au files are usually very slightly compressed so are often not as high quality as .wav and .aif files. We will accept .au files but depositors should be encouraged to use .wav or .aif if possible.

WAVEform (.wav)Uncompressed audio suitable as a preservation format. However these files are large in size and not accepted as a standard in some industries.
Windows Media Audio (.wma)Microsoft proprietary streaming Codec. Not suitable for preservation.

flac – File format for the Free Lossless Audio Codec, a lossless compression codec. m4p – A proprietary version of AAC in MP4 with Digital Rights Management developed by Apple for use in music downloaded from their iTunes Music Store. vox – the vox format most commonly uses the Dialogic ADPCM (Adaptive Differential Pulse Code Modulation) codec. Similar to other ADPCM formats, it compresses to 4-bits. Vox format files are similar to wave files except that the vox files contain no information about the file itself so the codec sample rate and number of channels must first be specified in order to play a vox file. wav – standard audio file container format used mainly in Windows PCs. Commonly used for storing uncompressed (PCM), CD-quality sound files, which means that they can be large in size—around 10 MB per minute. Wave files can also contain data encoded with a variety of (lossy) codecs to reduce the file size (for example the GSM or MP3 formats). Wav files use a RIFF structure. wma – the popular Windows Media Audio format owned by Microsoft. Designed with Digital Rights Management (DRM) abilities for copy protection.


Notes on formats / Future Directions#

.wav and .aif are uncompressed audio files and are therefore the only suitable formats for delivery and preservation. Whereas .wav is the default format for MS Windows, .aif is the equivalent on a Mac. As we work on Windows PCs, here at the ADS, .wav is the preferred delivery format.

As well as a raw audio file (.wav, .aif, .au), depositors should also supply a compressed version of their audio files that we can use for web delivery. There are a wide number of different formats that we can accept for delivery. We can create these ourselves if need be but this will obviously require time and money!

In situations where the depositor supplies only the compressed version of their audio files (for example .rm, .wma, .mp3, .ogg), they should be strongly encouraged to send us uncompressed originals as well. It is possible that the device they recorded with saved directly to a compressed format. If for whatever reason the depositor can not give us uncompressed files, we can use a free conversion utility to convert the files back to an uncompressed .wav format. It should of course be noted that by converting back to .wav we will not be regaining any of the information and quality from the uncompressed original, we will just be creating a file that we can refresh alongside other preservation files in the future.

A number of open source audio formats are being developed by Xiph.org (e.g. FLAC, Speex, Ogg Vorbis, etc.) and it may be that these become more widespread and therefore more suitable for us to accept as deposit and / or archival formats. FLAC (Free Lossless Audio Codec), as a file format that uses lossless compression (c. 40-50% reduction of file size) is the most likely of the Xiph.org formats to be of interest to us as a possible archival format for large audio files/collections. In terms of disseminations formats it is likely that MP3 will be superceded by the AAC/M4A developed by the MPEG group and currently used by Apple's iPod (see AHDS Moving Pic and Sound, p51 and Wikipedia: Advanced Audio coding).

Other formats#

If we receive formats other than those listed above we should contact the depositor and ask if they can supply the data in a format we support. If not need to inform them of our current practice. This is that we endeavour to transform the file(s) into an archive format if the software we have to hand can do this in a quick and automated fashion. If this is not the case we will archive the file(s) as is, but will be unable to migrate it to newer versions of that format.

How to transfer...#

Conversion of audio files can be a complicated process and this is why the best case scenario would be for the depositor to supply both compressed and uncompressed audio for web delivery and preservation.

When compressing audio for web delivery, we have to make decision about which codec (compressor/decompressor) to use. Compressing audio files will produce some loss of quality. Lossless codecs can be found (for example SHN (Shorten) and Apple Lossless) but they seem to only half the file size whereas MP3 can reduce a file to about a tenth of the original size. This inevitable loss of quality can be minimised by use of the correct codec. Different compressors can give very different results and choice of compressor may depend on the type of audio that has been recorded. MP3 may be a popular option for compressed digital audio but is designed primarily for music and is not designed to be streamed. Audio files deposited with the ADS are likely to be recordings of the human voice for oral history projects. As the human voice has a relatively small range, it is a good idea to compress it with a codec designed specifically for this purpose (for example the Quicktime codec Qualcomm Pure Voice), or the open source and free Xiph.org speex.

Check if there is any embedded metadata in the audio file. Many audio formats such as mp3 and those created by Quicktime and Windows Media Player have the facility to do this. Be aware of any metadata and remember to check the final version to see if it has been preserved. Metadata can be stored in an ASCII text file if all else fails.

Starting FormatProcedureEnd Format
.wav, .aif, .au1. Decide which presentation format to convert to (.ogg recommended though neither .ogg or .mp3 are ideal for the human voice!)
\2. open Audacity. Use 'File' => 'Preferences' then the 'File Formats' tab to adjust the settings for the export format.
\3. 'File' => 'Open' to open the file
\4. 'File' => 'Export as MP3' or 'File' => 'Export as Ogg Vorbis' to export files to desktop (need to download an encoder to export as MP3 or use the Lame option in Mediacoder)
\5. Listen to file, check length of file and carry out other checks as documented in the AHDS Audio Preservation Handbook
.ogg or .mp3
.au1. Open Audacity. Use 'File' => 'Preferences' then the 'File Formats' tab to adjust the settings for the export format.
\2. 'File' => 'Open' to open the file
\3. 'File' => 'Export as WAV' to export files to desktop
\4. Listen to file, check length of file and carry out other checks as documented in the AHDS Audio Preservation Handbook

Maximum (best) requirements for a deposited archive#

  • Both compressed and uncompressed audio files
  • All copyright issues clarified and documented
  • Full data documentation and transcriptions if appropriate
  • If depositor has used a file format that contains metadata, they should inform us of this so we can extract it and preserve, or ensure it's preserved in any future versions
  • Extract metadata from files if appropriate

Section 3. Preserving Digital Audio#

  • Archiving # data
    • deciding what to archive
      • Selection and retention
      • preservation intervention points / file and data lifecycles (specific to guide, will also be covered generically)
    • deciding how to archive
      • archiving strategies (migration (to new format, to 'basic' format), emulation, refreshment)?
      • significant properties
      • file types
    • Metadata and Documentation
      • project level
      • file level
      • standards specific to #
    • Structuring your archive

3.3 Metadata and Documentation#

Software used to create/encode (if applicable)
Bit rate (kbps)
sampling frequency range (KHz) if applicable
Codec used (where appropriate)
Length of recording (mins and secs).
Copyright clearances (very important for audio files, especially oral history).
Transcriptions of interviews etc (where appropriate).


  • Copyright
    • specific copyright considerations for each guide.

  • Case study/studies

Digital Audio: Bibliography#

Bradley, K. (ed.) (2009) Guidelines on the Production and Preservation of Digital Audio Objects. Second edition. IASA Technical Committee www.iasa-web.org/tc04/audio-preservation(info)

Thomas, D. (2011) 'An Investigation of Aural Space inside Mousa Broch by Observation and Analysis of Sound and Light'. Internet Archaeology 30. http://intarch.ac.uk/journal/issue30/thomas_index.html

Knight, G. & McHugh, J. (2005) Preservation Handbook: Digital Audio. AHDS. http://www.ahds.ac.uk/preservation/audio-preservation-handbook.pdf

Knight, G. (2010) Significant Properties Testing Report: Audio Recordings. JISC. http://www.significantproperties.org.uk/testingreports.html