2007 multimedia system final paper presentation music recognition
DESCRIPTION
2007 Multimedia System Final Paper Presentation Music Recognition. 492410021 蘇冠年 492410070 蔡尚穎. Introduction. In future, the problem is not anymore how to get access to multimedia content, the task is how to find what I’m looking for…. Music Recognition System. Data Base. Input Data. - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: 2007 Multimedia System Final Paper Presentation Music Recognition](https://reader035.vdocuments.net/reader035/viewer/2022062322/56814d12550346895dba4e04/html5/thumbnails/1.jpg)
2007 Multimedia System Final Paper Presentation
Music Recognition 492410021 蘇冠年
492410070 蔡尚穎
![Page 2: 2007 Multimedia System Final Paper Presentation Music Recognition](https://reader035.vdocuments.net/reader035/viewer/2022062322/56814d12550346895dba4e04/html5/thumbnails/2.jpg)
Introduction
• In future, the problem is not anymore how to get access to multimedia content, the task is how to find what I’m looking for…
![Page 3: 2007 Multimedia System Final Paper Presentation Music Recognition](https://reader035.vdocuments.net/reader035/viewer/2022062322/56814d12550346895dba4e04/html5/thumbnails/3.jpg)
Music Recognition System
Training
Data Base
Recognition
Result
Input Data
![Page 4: 2007 Multimedia System Final Paper Presentation Music Recognition](https://reader035.vdocuments.net/reader035/viewer/2022062322/56814d12550346895dba4e04/html5/thumbnails/4.jpg)
Before the Algorithm
• Practical Problems
- Disturbance of noise
- Disturbance of Harmonic
- Singer and instrument
- …
![Page 5: 2007 Multimedia System Final Paper Presentation Music Recognition](https://reader035.vdocuments.net/reader035/viewer/2022062322/56814d12550346895dba4e04/html5/thumbnails/5.jpg)
Algorithm I
• Pitch detection - notes, chords …
• Based on frequency domain
- according to music characteristics, it analyzed spectrum at the music pitches
- using Wavelet Transform and DTFT (Discrete-Time Fourier Transform)
![Page 6: 2007 Multimedia System Final Paper Presentation Music Recognition](https://reader035.vdocuments.net/reader035/viewer/2022062322/56814d12550346895dba4e04/html5/thumbnails/6.jpg)
Frequency Analysis
• Music signal is of typical time-frequency distribution
and has short-time steady property
![Page 7: 2007 Multimedia System Final Paper Presentation Music Recognition](https://reader035.vdocuments.net/reader035/viewer/2022062322/56814d12550346895dba4e04/html5/thumbnails/7.jpg)
Frequency Analysis
• Wavelet Transform
- Daub4 Wavelet base by Mallet Algorithm
• DTFT to calculate amplitude
- pitch frequency as parameter ω
![Page 8: 2007 Multimedia System Final Paper Presentation Music Recognition](https://reader035.vdocuments.net/reader035/viewer/2022062322/56814d12550346895dba4e04/html5/thumbnails/8.jpg)
Frequency Analysis
• Analyzed result
![Page 9: 2007 Multimedia System Final Paper Presentation Music Recognition](https://reader035.vdocuments.net/reader035/viewer/2022062322/56814d12550346895dba4e04/html5/thumbnails/9.jpg)
Notes Recognition
• Step 1: Note Voting - 1. analyzed each data by wavelet transform in frequency domain.
- 2. picked out a numbers of notes that have biggest amplitudes in a data as candidate notes.
- 3. count of the appearance times of the candidate notes in several neighbor dada
• Step 2 : denote the different segments of the music
• Step 3 : selected the note that appears most and has the biggest average amplitude
![Page 10: 2007 Multimedia System Final Paper Presentation Music Recognition](https://reader035.vdocuments.net/reader035/viewer/2022062322/56814d12550346895dba4e04/html5/thumbnails/10.jpg)
- A piece of music
- Wave form of the data
![Page 11: 2007 Multimedia System Final Paper Presentation Music Recognition](https://reader035.vdocuments.net/reader035/viewer/2022062322/56814d12550346895dba4e04/html5/thumbnails/11.jpg)
- the spectrogram of segment 1
![Page 12: 2007 Multimedia System Final Paper Presentation Music Recognition](https://reader035.vdocuments.net/reader035/viewer/2022062322/56814d12550346895dba4e04/html5/thumbnails/12.jpg)
- determine the note
![Page 13: 2007 Multimedia System Final Paper Presentation Music Recognition](https://reader035.vdocuments.net/reader035/viewer/2022062322/56814d12550346895dba4e04/html5/thumbnails/13.jpg)
Chords Recognition
• What is the chord ?
• The chord components always have the similar amplitude in frequency domain
![Page 14: 2007 Multimedia System Final Paper Presentation Music Recognition](https://reader035.vdocuments.net/reader035/viewer/2022062322/56814d12550346895dba4e04/html5/thumbnails/14.jpg)
Chords Recognition
• Step 1 : define as a set of candidate notes
and as the amplitude of the notes p
• Step 2 : calculate likelihood coefficient of each note
• Step 3 : coefficient L is the average likelihood coefficient among the notes in a candidate chord
![Page 15: 2007 Multimedia System Final Paper Presentation Music Recognition](https://reader035.vdocuments.net/reader035/viewer/2022062322/56814d12550346895dba4e04/html5/thumbnails/15.jpg)
- A piece of music
- Wave form of the data
![Page 16: 2007 Multimedia System Final Paper Presentation Music Recognition](https://reader035.vdocuments.net/reader035/viewer/2022062322/56814d12550346895dba4e04/html5/thumbnails/16.jpg)
- the spectrogram of segment 1
![Page 17: 2007 Multimedia System Final Paper Presentation Music Recognition](https://reader035.vdocuments.net/reader035/viewer/2022062322/56814d12550346895dba4e04/html5/thumbnails/17.jpg)
- determine the chords
![Page 18: 2007 Multimedia System Final Paper Presentation Music Recognition](https://reader035.vdocuments.net/reader035/viewer/2022062322/56814d12550346895dba4e04/html5/thumbnails/18.jpg)
Algorithm II
• Items of recognizing
• Single-pitched melody
• Multiple-instrument melody
![Page 19: 2007 Multimedia System Final Paper Presentation Music Recognition](https://reader035.vdocuments.net/reader035/viewer/2022062322/56814d12550346895dba4e04/html5/thumbnails/19.jpg)
![Page 20: 2007 Multimedia System Final Paper Presentation Music Recognition](https://reader035.vdocuments.net/reader035/viewer/2022062322/56814d12550346895dba4e04/html5/thumbnails/20.jpg)
Pre-Processing
![Page 21: 2007 Multimedia System Final Paper Presentation Music Recognition](https://reader035.vdocuments.net/reader035/viewer/2022062322/56814d12550346895dba4e04/html5/thumbnails/21.jpg)
Adaptive Template-matching
• Phase Tracking
• Template Filtering
![Page 22: 2007 Multimedia System Final Paper Presentation Music Recognition](https://reader035.vdocuments.net/reader035/viewer/2022062322/56814d12550346895dba4e04/html5/thumbnails/22.jpg)
Phase Tracking
z : input signal
r , i : possible sound
p : narrow-band filter
![Page 23: 2007 Multimedia System Final Paper Presentation Music Recognition](https://reader035.vdocuments.net/reader035/viewer/2022062322/56814d12550346895dba4e04/html5/thumbnails/23.jpg)
Phase Tracking
• fs : sampling frequency
• fc : center frequency of the band-pass filter
![Page 24: 2007 Multimedia System Final Paper Presentation Music Recognition](https://reader035.vdocuments.net/reader035/viewer/2022062322/56814d12550346895dba4e04/html5/thumbnails/24.jpg)
Template Filtering
• minimization of J
z(k) : input sum of template waveforms
hn(m) : convolution of the filter coefficients
rn(k) : phase-adjusted waveform
![Page 25: 2007 Multimedia System Final Paper Presentation Music Recognition](https://reader035.vdocuments.net/reader035/viewer/2022062322/56814d12550346895dba4e04/html5/thumbnails/25.jpg)
•
Template Filtering
![Page 26: 2007 Multimedia System Final Paper Presentation Music Recognition](https://reader035.vdocuments.net/reader035/viewer/2022062322/56814d12550346895dba4e04/html5/thumbnails/26.jpg)
Music Stream Networks
• Problem of local information
• Bayesian probabilistic network
![Page 27: 2007 Multimedia System Final Paper Presentation Music Recognition](https://reader035.vdocuments.net/reader035/viewer/2022062322/56814d12550346895dba4e04/html5/thumbnails/27.jpg)
![Page 28: 2007 Multimedia System Final Paper Presentation Music Recognition](https://reader035.vdocuments.net/reader035/viewer/2022062322/56814d12550346895dba4e04/html5/thumbnails/28.jpg)
Conclusion
![Page 29: 2007 Multimedia System Final Paper Presentation Music Recognition](https://reader035.vdocuments.net/reader035/viewer/2022062322/56814d12550346895dba4e04/html5/thumbnails/29.jpg)
Reference
1. Zheng Cao, Shengxiao Guan, Zengfu Wang. “A Real-time Algorithm for Music Recognition Based on Wavelet Transform” IEEE June 21 - 23, 2006, Dalian, China
2. Kunio Kashino ,Hiroshi Murase . “Music Recognition using Note Transition Context”
IEEE 1998, NTT Basic Research Laboratories
3. Karlheinz Brandenburg. “Digital Entertainment: Media technologies for the future”
IEEE 2006 , Fraunhofer IDMT & Technische Universität Ilmenau
4. Chen Genfand, Xia Shunren. “The study and prototype system of printed music recognition”. IEEE 2003
5. D Bainbridge , T C Bell. “Dealing with superimposed objects in optical music recognition” IEEE 15-17 July 1997 Universities of Waikato and Canterbury, New Zealand
6. MALLAT'S FAST WAVELET ALGORITHM: RECURSIVE COMPUTATION OF
CONTINUOUS-TIME WAVELET COEFFICIENTS