Title: LAM: Musical Audio Similarity
1LAM Musical Audio Similarity
- Michael Casey
- Centre for Cognition, Computation and Culture
- Department of Computing
- Goldsmiths College, University of London
2Overview
- Machine Music Understanding
- Features / Classes / Clusters
- Real-Time Audio Matching
- Feature Extraction
- Feature Similarity (Indexing / Retrieval)
- PD/MSP Tools
- Music Similarity Applications
- Sound object matching
- Texture matching
3Sound Understanding
Signal Processing
Sound Understanding
4Feature Extraction
5Feature Extraction
6Feature Extraction
7Feature Extraction
8Feature Extraction
9Feature Extraction
10 Statistical Learningfor Decision Making
Partitioning of feature space
p( ) P( )
P( )
p( )
Decision boundary
Music
Speech
11MPEG-7 Audio Tools
Audio
12MPEG-7 Audio Tools
Log Frequency Spectrogram
Audio
AudioSpectrumEnvelopeD
13MPEG-7 Audio Tools
Decorrelating Transform / Dimension Reduction
Log Frequency Spectrogram
Log Amplitude
Audio
AudioSpectrumEnvelopeD
AudioSpectrumProjectionD
14SoundModelStatePathD
Use estimated state sequence as a feature
State Path
15MPEG-7 Audio Tools
Decorrelating Transform / Dimension Reduction
Log Frequency Spectrogram
Hidden Markov Model
Log Amplitude
Audio
AudioSpectrumEnvelopeD
SoundModelDS
AudioSpectrumProjectionD
16MPEG-7 Audio StringsAcoustic Lexicons
Decorrelating Transform / Dimension Reduction
Log Frequency Spectrogram
Hidden Markov Model
Log Amplitude
Audio
AudioSpectrumEnvelopeD
SoundModelDS
State Path
AudioSpectrumProjectionD
SoundModelStatePathD
? 7 1 V 7 1 0 1 ...
SYMBOL STRING
17(No Transcript)
18State Symbol Sequence (40 State Model)
?71V7101 ...
19State Symbol Sequence (40 State Model)
?71V7101 ...
20State Symbol Sequence (40 State Model)
?71V7101 ...
21State Symbol Sequence (40 State Model)
?71V7101 ...
22SoundModelStateHistogramD
state index
0.01s Frames
state index
seconds
23Self-Similarity Matrix
24Self-Similarity Matrix
25Self-Similarity Matrix
26Self-Similarity Matrix
a
27Self-Similarity Matrix
a
b
28Self-Similarity Matrix
a
b
29Self-Similarity Matrix
30S-Matrix
31Efficient Storage / Retrieval
- Real-Time Access
- Large Databases
- Distributed Databases
32PostgreSQL Database Representation of State Path
Strings and Histograms
33Similarity
- Compute distance between feature pairs
- Features SoundModelStateHistogramD
- Similarity Metric
- dist(a,b) gt 0
- dist(a,b) 0 iff ab
- dist(a,b) dist(b,c) gt dist(a,c)
- Vector Dot Product
34Similarity of Feature Trajectories
35Dynamic Time Warping
36Acousticon Strings
- Distance Metric
- String Edit Distance (Levenschtein)
- Scalable to Large Databases
- PostgreSQL Implementation
- Can use built-in Index Structures
- Scalable to Real-Time Implementation
- matching and audio streaming (lt 20ms )
37Information Retrievalfor Creativity
- Utilize sound extant database for new material
- Take the structure of a music clip but replace
the content. - New interfaces for music creativity.
38Audio Information Retrieval
MPEG-7 Database
A pre-indexed Collection of Sounds
39Audio Information Retrieval
MPEG-7 Database
Extract
Segment
Match
Audio Query
A Sound or Scene or List of Sounds
Result List
40Audio Information Retrieval
MPEG-7 Database
Extract
Segment
Match
Audio Query
Feature extraction from audio.
Result List
41Audio Information Retrieval
MPEG-7 Database
Extract
Segment
Match
Audio Query
Partitioning of audio into chunks.
Result List
42Audio Information Retrieval
MPEG-7 Database
Extract
Segment
Match
Audio Query
Result List
Find similar chunks of Audio
43Real-Time Matching
44Musaics
Real-Time Matching
45Musaics
Real-Time Matching
Real-Time Matching
46Musaics
Real-Time Matching
47Musaics
Real-Time Matching
48Musaics
Real-Time Matching
49Musaics
Real-Time Matching
50Musaics
Real-Time Matching
51Musaics
Real-Time Matching
52Musaics
Real-Time Matching
53Musaics
Real-Time Matching
54(No Transcript)