Fundamentals of Audio Signals - PowerPoint PPT Presentation

1 / 34

About This Presentation

Title:

Fundamentals of Audio Signals

Description:

Fundamentals of Audio Signals. Two signals of different amplitudes ... Professional audio uses 16 bits ... We may also have audio data coming from more than ... – PowerPoint PPT presentation

Number of Views:481

Avg rating:3.0/5.0

Slides: 35

Provided by: timoth73

Category:

more less

Transcript and Presenter's Notes

Title: Fundamentals of Audio Signals

1
Fundamentals of Audio Signals

Two signals of different amplitudes
A greater amplitude represents a louder sound.

2
Fundamentals of Audio Signals

Two signals of different frequencies
A greater frequency represents a higher pitched
sound.

3
Fundamentals of Audio Signals

Any sound, no matter how complex, can be
represented by a waveform.
For complex sounds, the waveform is built up by
the superposition of less complex waveforms
The component waveforms can be discovered by
applying the Fourier Transform
Converts the signal to the frequency domain
Inverse Fourier Transform converts back to the
time domain

4
Sampling

Sounds can be thought of as functions of a single
variable (t) which must be sampled and quantized
The sampling rate is given in terms of samples
per second, or, kHz
During the sampling process, an analog signal is
sampled at discrete intervals
At each interval, the signal is momentarily
held and represents a measurable voltage rate

5
Quantization

Audio is usually quantized at between 8 and 20
bits
Voice data is usually quantized at 8 bits
Professional audio uses 16 bits
Digital signal processors will often use a 24 or
32 bit structure internally

6
Quantization

The accuracy of the digital encoding can be
approximated by considering the word length per
sample
This accuracy is known as the signal-to-error
ratio (S/E) and is given by
S/E 6n 1.8 dB
n is the number of bits per sample

7
Quantization

When a coarse quantization is used, it may be
useful to add a high-frequency signal (analog
white noise) to the signal before it is quantized
This will make the coarse quantization less
perceptible when the signal is played back
This technique is known as dithering
During the sampling process, an analog signal is
sampled at discrete intervals
At each interval, the signal is momentarily
held and represents a measurable voltage rate

8
Channels

We may also have audio data coming from more than
one channels
Data from a multichannel source is usually
interleaved
Sampling rates are always measured per channel
Stereo data recorded at 8000 samples/second will
actually generate 16,000 samples every second

9
Digital Audio Data

A complete description of digital audio data
includes (at least)
sampling rate
number of bits per sample
number of channels (1 for mono, 2 for stereo,
etc.)

10
Analog to Digital Conversion

Nyquists theorem states that if an arbitrary
signal has been run through a low-pass filter of
bandwidth H, the filtered signal can be
completely reconstructed by making only 2H
(exact) samples per second.
So, a low-pass filter is placed before the
sampling circuitry of the analog-to-digital (A/D)
converter.

11
Analog to Digital Conversion

If frequencies greater than the Nyquist limit
enter the digitization process, an unwanted
condition called aliasing occurs
The low-pass filter used will require the use of
a gradual high-frequency roll-off, thus a
sampling rate somewhat higher than twice the
Nyquist limit is often used
A/D conversion may make use of a successive
approximation register (SAR)

12
Analog to Digital Conversion

The low-pass filter can cause side effects.
One way that these side effects can be overcome
is through the use of oversampling - a
signal-processing function that raises the sample
rate of a digitally encoded signal.
Consumer and professional 16-bit D/A converters
often use up to 8- and 12-times oversampling,
raising the sampling rate of a CD (for example)
from 44.1 kHz to 352.8 kHz or 529.2 kHz.
By altering the signals noise characteristics,
it is possible to shift much of the overall
bandwidth noise out of the range of human hearing.

13
Pulse Code Modulation

The method that has been discussed for storing
audio is known as pulse code modulation (PCM).

14
Pulse Code Modulation

PCM is common in long-distance telephone lines.
The analog signal (voice) is sampled at 8000
samples/second with 7 or 8 bits per sample
A T1 carrier handles 24 voice channels
multiplexed together
The bandwidth of this type of carrier can be
calculated as follows
8 bits x 8000 samples/second x 24 channels
1.544 Mbps
Note that one out of 8 bits is for control, not
data.

15
Pulse Code Modulation

D/A conversion process
parallelize the serial bit stream
generate an analog voltage analogous to the
voltage level at the original time of sampling
An output sample and hold circuit is used to
minimize spurious signal glitches
a final low-pass filter is inserted into the path
Smooths out the non-linear steps introduced by
digital sampling

16
Pulse Code Modulation

Other PCM topics
mu-law and A-law companding
DPCM
DM
ADPCM

17
Digital Signal Processing

Processing of a digital signal to achieve special
effects may generally be described in terms of
some simple functions
Addition
Multiplication
Delay
Resampling

18
Digital Signal Processing

Addition of two signals is accomplished by adding
the sample values of the signals at each sampling
point h(t)f(t)g(t)
We can add as many signals as desired together
Multiplication of a given signal is represented
as g(t)mf(t), where m is the multiplication
factor.
Multiplication is used to increase or decrease
the gain (loudness) of a signal. If m1, g is
louder than f. If m
Note that when adding signals together or
multiplying by a number greater than one, care
must be taken when the signal reaches the upper
limit of the sample size

19
Digital Signal Processing

Delay is an important effect described as
follows g(t)f(td), where d is a delay time
Use delay and addition to model echo
f(t) HELLO
g(t) f(t d1) , where 0
g(t) HELLO
h(t) f(t d2) , where 0
h(t) HELLO
F(t) f(t) g(t) h(t)
HELLO HELLO HELLO

20
Digital Signal Processing

Now consider a more realistic echo effect. We
need to make each succeeding echo softer. We can
do this with multiplication.
g(t) mg(t) h(t) nh(t), 0
F(t) f(t) g(t) h(t)
HELLO HELLO HELLO

21
Digital Signal Processing

When delays of 35-40 ms and greater are used, the
listener perceives them as discrete delays
Reducing the delay to the 15-35 ms range will
create delays that are too closely spaced to be
perceived as discrete delays
When used with instruments, the brain is fooled
into thinking that more instruments are playing
than there actually are
combining several short term delay modules that
are slightly detuned in time, an effect known as
chorusing can be achieved (used by guitarists,
e.g.)

22
Pitch-Related Effects

DSP functions are available that can alter the
speed and pitch of an audio program. These can
Change pitch without changing duration
Change duration without changing pitch
Change both duration and pitch
The process for raising and lowering the pitch of
a sample is shown on the next slides

23
Pitch-Related Effects
24
Pitch-Related Effects
25
Noise Elimination

The noise elimination process can be seen to
consist of three steps
Visual analysis
De-clicking
De-noising
Use visual analysis to determine the type of
noise and to guide the next two steps

26
Noise Elimination

De-clicking involves the removal of noise
generated by analog side effects such as tape
hiss, needle ticks, pops, etc.
This is similar to snow removal in image
processing
(the noise manifests itself as large
discontinuities in the sample waveform)
The noise is likely to have affected more sample
data in the audio file than in the corresponding
image file
A needle skip which affects 1/4 second of the
file affects 11000 samples at the audio CD
sampling rate
Therefore, reconstruction of the affected area is
not the straightforward linear interpolation
process used in images
Must examine a large portion of the waveform to
reconstruct

27
Noise Elimination

De-noising involves the removal of background
noise such as hum, buzzes, air-conditioner
noises, etc
The waveform is analyzed to determine if louder
sounds will mask the softer sound
This involves breaking down the audio spectrum
into a large number of frequency bands
The signal is compared with a signature which
represents the background noise. This is taken
from a silent moment in the samplefile. It must
be determined which portion of a signal is noise
and whether the noise can be deleted without
distorting the program

28
Digital Signal Processing

Other DSP functions include digital mixing and
sample rate conversion
Digital mixing is the integration of a number of
digital audio signals into a single ouput signal
Sample rate conversion is necessary when a signal
sampled at one rate must be played back on or
transferred to equipment which uses another rate
An example is the use of digital audio as the
sound track for video. The incoming rate of 44.1
kHz must be pulled-down to 44.056 kHz

29
Fading

Fading is another important DSP function
During a fade, the calculated sample amplitudes
are either proportionately reduced or
proportionately increased in level, according to
a defined curve ramp
For example, usually when performing a fade out,
the signal will begin at a level that is 100
percent of its current value and will reduce over
the defined time to 0 percent
Examples of various fade curves are shown in the
following slides

30
Fading
31
Fading
32
Fading

To find the linearly faded value of a sample at
time tx, t0txt1, we use the following equation
s(tx) s(tx) (tx - t0) / (t1 - t0)
We can also combine the fade in of one soundfile
with the fade out of another soundfile to produce
the effect known as crossfade

33
Fading
34
Fading

Note that the two curves intersect at 50
attenuation and that the sum of the two values at
any point in time is always 100
Thus, we can add together the two signals to form
our crossfaded signal and the amplitude of the
waveform will never be greater than the maximum
possible amplitude

Write a Comment

User Comments (0)