Survey of Audio Digitization

Introduction

I'm J, and this is the page of audio information I've gathered for the Spring '06 Survey of Digitization (SoD) class at the School of Information at UT-Austin. I am not an expert, but a musician, music collector, and audio enthusiast. This site will focus, for the sake of the SoD students, on questions surrounding the digitization of audio at libraries, collections, and archives. This site is for those who may be involved in some way with the care of audio and its migration from analog to digital.

For the first of my two, three hour classes I've decided to concentrate on covering all the concepts and issues involved in working with analog/digital (a/d) conversion. The second class can then focus on the dilemmas involved with actual projects, particularly our own final project: the digitization of cassettes from the Benson Library at UT-Austin.

Overview of Sound and Recording

Sound Collections and Guides

Our Digitization Project

A few basic points of my own:

° More than with the ears we hear with the mind.

° The audio preservationist and restorationist needs to understand/hear every stage of a sound's life-cycle. (Ideally, this requires knowledge of genres and traditions, ethnography and musicology, vocalization and instrumentation, score and arrangement, players, acoustics, analog recording technology, formats and playback technologies, sound engineering, digital technologies, etc.)

° Though audio archives need preservation standards and fixed playback options, formats and players constantly evolve in a race to their own extinction.

Andrew Dylan's points from his commencement of the Sound Savings Symposium on why sound matters:

° Sound is a medium that is intimately tied to tests of copyright limits in our society. And this is most blatantly so since sound is a medium in high demand by consumers.

° Music and sound are transcultural in a manner that is not so for text. Whether white men can play the blues may never be resolved in some purists' minds, but there is no doubting that the representations of history and culture that are captured in music can be processed and enjoyed by people outside that culture. The rise of world music, the merging of cultural styles, and the worldwide love of opera by people who cannot speak a word of Italian are testimony to the emotional response people have to music.

° The next tidal wave of digital content is rich media, a seamless convergence of audio, video, and text. As yet, true hypermedia of the kind envisaged decades ago has yet to emerge and even the Web, in all its glory, is (with some noticeable exceptions) a text-heavy medium. Audio is the great underutilized resource. Hypermedia in popular use is a visual medium, with audio seen as ‘extra’, but there are signs that this will change.


Sound Basics

What is sound? "Sound can be defined as the change in air pressure above and below an equilibrium (usually the barometric pressure). For example, when a bass drum is struck, the skin vibrates back and forth. As the skin travels outwards, away from the center of the drum, the air pressure surrounding the drum rises above the barometric pressure; conversely as the drum skin travels inwards, the air pressure lowers. This to-and-fro action occurs numerous times per second creating waves of compression and decompression in surrounding air. As air pressure increases by the outward motion of the bass drum skin, the eardrum is pushed towards the center of the head; conversely, as pressure decreases, the eardrum travels away from the center of the head. The higher the vibration speed, the higher the pitch; the larger the change in air pressure, the louder the sound." By Gilles St-Laurent, Music Division, National Library Of Canada, from here.

There is a good animated gif of a sound wave's low/high pressure oscillation against a membrane here.

Check out this old black and white short, "How the Ear Functions."

Sound Perception

Human's generally perceive sound in the pitch range between 20Hz to 20,000kHz. "For those that are curious, the lowest note on the piano is A sounding at 27.500 hz and the highest is the top C sounding at 4186.009 hz (cycles per second). A440 concert pitch refers to tuning the A above middle C to 440.000 hz." from here.

Humans generally perceive sound in the noise intensity range as quiet/low as about 4dBA. On the upper end, noise generally becomes painful around 120dBA. Hearing loss is a function of noise level over time. Maximum exposure should be limited to about 85dBA for 8hrs. 110dBA can damage your ear after a minute and a half, 140dBA can cause damage in an instant, and perforation of the ear drum could occur around 160dBA.

Here is a scale to give you an idea of decibel levels:

10 normal breathing
20 whispering at 5 feet
30 soft whisper
50 rainfall
60 normal conversation
70 TV audio
85 handsaw
110 shouting in ear
120 thunder

For a more exhaustive list of common sounds and their decibel levels go here.

 

Can You Trust Your Ears?

Equal-loudness contour: this graph represents the change in loudness necessary across the audible pitch range for the pitches to be perceived at equal loudness.

Effect of Loudness Changes on Perceived Pitch - A high pitch (>2kHz) will be perceived to be getting higher if its loudness is increased, whereas a low pitch (<2kHz) will be perceived to be going lower with increased loudness. -from here.

Psycho-acoustics (from Wikipedia):It is important to note that the question of what humans hear is not only a physiological question of features of the ear but very much also a psychological issue.

Say this: Maresy-doats and dozy-doats, and liddle lamzy divey. A kiddley divey too, wouldn't you-oo? The way you hear it depends on whether you know what you're saying, which greatly affects interpretation. It's about animals eating oats. How do you hear it now? Sound is its own language. The more you know about what you're listening to, the more you hear. Recording, listening, making sound, dubbing recordings, making conversions, are all a psycho-acoustical art and science.


Brief History of Recording

Upon hearing an early sound recording device in 1888, Sir Arthur Sullivan stated that he was "astonished and somewhat terrified at the result of this evening's experiments--astonished at the wonderful power you have developed, and terrified at the thought that so much hideous and bad music may be put on record forever. - Edward B Moogk

Sound Timelines

A Few Major Events in Early Audio Technology

1876 Telephone - Alexander Graham Bell patents his telephone, built with the assistance of young self-trained engineer Thomas A. Watson. Elisha Gray, who developed a similar device at about the same time, will unsuccessfully

1877 Phonograph - Working with a team of engineers at his Menlo Park, New Jersey, laboratories, Thomas Alva Edison perfects a system of sound recording and transmission. The first recording replayed is a voice saying "Mary had a little lamb its fleece was white as snow." from here.

1916 Microphone - E.C. Wente at Bell Labs developed the condenser microphone to translate soundwaves into electrical waves that could be transmitted by the vacuum tube amplifier.

1918 Speaker - Henry Egerton at Bell Labs patented on Jan. 8 the first balanced-armature loudspeaker driver.

1921 P.A. System - The amplifier, microphone, loudspeaker innovations were combined at Bell Labs to create the first public address systems.

1928 - Nyquist presented his theorem stating that an analog signal should be sampled at regular intervals of time and twice the frequency of the signal's bandwidth to reproduce it with good fidelity.

1931 Hi-Fi Recording - in April, Leopold Stokowski invited Bell Labs to begin sound recording experiments with his Philadelphia Orchestra. In December, the first electrical recordings were made and continued throughout the 1931-32 concert season. 125 of these test recordings have been preserved (a limited edition album of these masters was released in 1980 by Bell Labs).

1932 Stereophonic Recordings - in March, several test recordings were made at the Academy of Music using two microphones connected to two styli cutting two tracks on the same wax disk. This recording is the earliest example of stereophonic recording that has survived.

1957 A/D Conversion - "New forms of playback, file formats, compression and storage of data are all changing on a seemingly daily basis, but the underlying mechanisms for converting real-world sound into digital values, manipulating those data and finally converting them back into real-world sound has not varied much since Max Mathews developed MUSIC I in 1957 at Bell Labs...The importance of their work to information theory, computing, networks and digital audio cannot be understated. For example, the same data stream theory used in high-speed networking, known as T-1 lines is the same technology used in higher-end Digidesign Pro Tools systems today (TDM or time division multiplexing). A widely used method for encoding and decoding binary data, such as that used in digital audio, pulse code modulation or PCM was also developed early on at Bell Labs, attributed to John R. Pierce..." Jeffrey Hass


Analog Formats

We've read "How to Care for your Audio," from the National Film and Sound Archive about basic care for your audio. Let's take a look at a few formats you might find in the basement. Most info gathered from here.

Cylinders - The first and first mass-produced recording format.

Disks - Many types of disks have been produced over the last 100 years. Here are a two common ones.

Acetate

Vynil

Vynil Bowls - make your own!

All Disks

Magnetic Tape - Magnetic tape first appeared in North America just after World War II.

General Handling and Storage Rules

Good Resources


Analog/Digital Conversion and Digital Formats

I found these sites to be good at explaining the process:

Digital Formats


Library of Congress

The Save Our Sounds Project - Dr. Michael Taft (Library of Congress), Head, Archive of Folk Culture, American Folklife Center

Factors considered in choosing which sounds to save for the Save Our Sounds project:

° Content.
° Historical or cultural significance.
° Present state of accessibility.
° Fragility and deterioration.
° Variety of sound recording formats.
° Complexity of collections.
° Diversity of material.
° Other political considerations.

Review of Audio Collection Preservation Trends and Challenges - Sam Brylawski (Library of Congress), Head, Recorded Sound Section Motion Picture, Broadcasting and Recorded Sound Division

"The Library's collections include over 100,000 audio cassettes and 170,000 open-reel tapes."

"Indeed, there is no permanent digital format."

"Following the Open Archival Information System (OAIS) model established by NASA,6 digital objects for sound recordings in the repository will include digital images of record labels or tape boxes and other graphics or accompanying text, in addition to the audio files. The audio files themselves will be very large, recorded at a sampling rates of 96kHz or 192 kHz, with 24-bit word lengths."

LOC uses Metadata Encoding and Transmission Standard (METS):

"METS is complicated. Because it requires populating a very large number of fields, at the present, it is time-consuming to create a full METS record. Officials at the Library of Congress hope and expect to "develop tools for automatically creating metadata," as recommended in a study of challenges related to the preservation of digital content."


The Macaulay Library of Natural Sounds at Cornell University

Info gleaned from this article at RLG DigiNews.

Audio control issues


More Metadata!: iTunes v. Preservation

When metadata is lost in the transfer from analog to digital, we lose more than we realize. This article makes the point that, "Online music stores should facilitate rather than hinder access to this information before, during and after a song or album is purchased."


Best Practice Guides


Resources of Resources

These are sites with lots of great links to information about audio archiving.


Audio Assignment

Thanks to Uri, we have a box full of cassettes from the Benson Library in need of digitization. Each of you will get a cassette to digitize and a spreadsheet to fill in for your audio project. The purpose of this assignment is to give you some experience with audio digitization and to help the Benson out. This is not a difficult project; I will lead you through it step-by-step and hopefully, we will produce digital copies of these tapes just as though we were the digitization lab hired to do the job.

Assignment: Digitize both sides of the tape assigned to you, filling out all the metadata fields given to you on the spreadsheet for documentation and quality control. Produce the raw capture .wav files, archival access .wav copies, and network/internet access .mp3 files for each side of the cassette. Burn a CD of the archival access copies with a track for each side.

Deliverables:

Where to deliver them:

The audio files will go in our project folder on the network, 2007-spring\projects\ischool-blac-aud-... Inside this folder, create a folder with your UTEID for its name, in lowercase. Inside your folder, create three folders:

Gradable Components

Quality Control

You will be performing three quality control measures and reporting them on the spreadsheet:

  1. Are there strange noises? - I would like you to randomly select 10% of your audio to listen to for noises that may have entered the audio file during digitization. These are things that might occur because of a malfunction in the A/D converter, the processor, power, etc. Note: using the computer you are doing the digitization on for anything else while digitizing could cause such noise. Don't have any other application running while digitizing. A momentary lapse in processor resources could mar your audio. Since we are digitizing 90 minutes, 10% comes out to about 10 minutes to listen back to. Select any 5 minutes from each track to listen for anything "digital sounding" that doesn't sound like it came from the tape. Once you have listened to 5 minutes of each, and are satisfied there are no strange digital noises, circle the "no" under "strange noises," and move on. If you do hear something funny, circle"yes" and make note of where it occurs and what it sounds like.


  2. Does the digital dub sound the same as the original? This is a listening test I would like you to perform on the first ten seconds and the last ten second of each side. I've chosen such a small amount of time so that you can effectively listen to the tape and then listen to the digital with the sound of the tape in mind. You will need to unplug the headphones from the Audiophile box and plug them into the tape player directly, going back and forth to compare. Do it a few times. Do they sound the same? If yes, circle"yes" under "sound the same?" If no, try to describe what the difference is in the notes. I'd like you to do this at the beginning and end to confirm that nothing dramatic changed during the process that you might not have been aware of, but which colored the audio somehow.


  3. Was there any clipping? Clipping occurs especially easily in digital recording when the signal is too loud to be processed resulting in distortion. It can be detected visibly when the audio signal fills the audio wave window so much that it hits the ceiling/floor and is simply flat. Visually scroll through the capture audio for signs of clipping. If you see any, listen to it to see if it distorts, causing unwanted noise. If you find any, circle "yes" under "any clipping" and note where is occurs and how bad it interrupts the audio track. If you don't see any, circle"no." In the following picture clipping is occurring to the red and orange waves, not the blue ones:


  4. If you answered "yes" to "strange noise?", "no" to "sounds the same", or "yes" to "any clipping," circle "yes" under "needs redub," and explain how serious you thing the problem interrupts the audio, or if perhaps it only need redubing if we are going to be perfectionists about the audio.