Music Structure Discovery and Transcription of RealWorld Music

About This Presentation

Title:

Music Structure Discovery and Transcription of RealWorld Music

Description:

Machine Listener: Predict changes in music just like we do! ... Brightness, mood/tone. Spatial direction/effects. Vocals, unique instruments. Limitations ... – PowerPoint PPT presentation

Number of Views:30

Avg rating:3.0/5.0

Slides: 16

Provided by: robt

Category:

more less

Transcript and Presenter's Notes

Title: Music Structure Discovery and Transcription of RealWorld Music

1
Music Structure Discovery and Transcription of
Real-World Music

Robert Turetsky
rob_at_ee.columbia.edu
Admitted Students Open House
April 26, 2003

2
Technology enables liquid music
Production
Consumption
Distribution
3
Levels of Abstraction Musical Structure

Musical structure guides our expectations based
on previous experience

4
Structure Discovery Motivation

Recommendation engines and artist discovery
Query by segment/prototype (without metadata)
Machine feedback/collaboration in composition
Custom-tailored playlists / Auto DJ
Improved audio feature extraction algorithm
performance (ie pitch extraction)
Machine Listener Predict changes in music just
like we do!

5
Structure Why is it so tough to
find?Char/Word/Phrase Boundaries
Text
Video
Audio?
6
Structure Why is it so tough to find?Signals to
Semantics
7
Tool of the Trade Similarity Matrix

Pioneered by Foote, 2001
Measure self similarity of every window in a song
with every other window
Theory Windows of same section will have similar
features. Windows of different sections will
have features.
Off diagonal lines correspond to repeated
sections
Novelty Score - measure of newness
correlation with checkerboard matrix.
Section breaks are peaks in the Novelty Score.

i
j
cos(i, j)
Novelty Score
8
Phrases Mining the SS-Matrix

Off-Diagonals ? repeated segments
Bartsch and Wakefield (2001)
Assume Most repeated most important
Shift and blur SS-matrix, look for vert. lines
Dannenberg (2002) Find best path with DP along
promising off-diagonals

Segmentation Cure-Lovesong
9
Raw Audio vs. Transcriptions
Raw Audio
Score
10
Multi-Pitch Extraction Modus Ponens

Untrained listeners can recognize single
pitches? Design single-pitch recognizers based
on Human Auditory System
Only experts can transcribe polyphonic audio
Expert recognize patterns? Design multi-pitch
extractor in a pattern classification framework

11
Pitch Extraction Graphical Models

Model of polyphonic frame

Fit model to data
Search model-space using MCMC

12
Applications How to build a better pitch
extraction algorithm - Locally

Idea Exploit large corpus of labeled raw audio.
I.e. Let the data do the talking!
Train classifier (e.g. neural net) with extracted
notes of real-world audio mixtures
Have decent estimate of algorithm performance
(N.E.R. - Note Error Rate)
Use MMI/relative entropy to reduce feature vector
dimensionality one classifier per note
Operates under the assumption that there are
vastly more good extractions then bad ones.

13
MIDI Alignments Methodology
Note Extraction
Timing Ticks to Samples
Alignment DTW
MIDI
Synthesis
Raw
Feature Calc
Estimated Transcription of real audio
14
Note Mapping MIDI to Raw
15
Conclusion (Yay!)

Structure Discovery is just the beginning
Were _at_ the beginning of the beginning
To be continued

Write a Comment

User Comments (0)

About PowerShow.com

Music Structure Discovery and Transcription of RealWorld Music - PowerPoint PPT Presentation

Music Structure Discovery and Transcription of RealWorld Music

Machine Listener: Predict changes in music just like we do! ... Brightness, mood/tone. Spatial direction/effects. Vocals, unique instruments. Limitations ... – PowerPoint PPT presentation