Download presentation
Presentation is loading. Please wait.
Published byPreston Griffith Modified over 9 years ago
1
[The Band SIG] MPEG7 - Audio 손우람 2007 년 12 월 1 일
2
Why MPEG-7?
3
MPEG standards 압축 (Compression) –MPEG-1 (CD) –MPEG-2 (DVD, DTV) –MPEG-4 (WEB, Mobile) 내용 기술 (Content Description) –MPEG-7 멀티미디어 프레임워크 –MPEG-21 그 외 –MPEG-A, B, C, D, E
4
MPEG-7 Multimedia Indexing and Searching MPEG-7 Indexing & Searching: –Semantics-based (people, places, events, objects, scenes) –Content-based (color, texture, motion, melody, timbre) –Metadata (title, author, dates) MPEG-7 Access & Delivery: –Media personalization –Adaptation & summarization –Usage environment (user preferences, devices, context)
5
MPEG-7 MDS: Free Text Annotation Example The following example gives an MPEG-7 description of a car that is depicted in an image: Car Four wheel motorized vehicle image.jpg
6
오디오 부터 …
7
Audio Fingerprint
10
장르 분류 Genre Classification …
11
Audio Visualization
12
Music Information Retrieval Content-based querying and retrieval Automatic classification Music recommendation and play-list generation Music summarization Musical Feature Extraction –Harmony, chord and tonality –Melody and motives –Rhythm, beat, tempo and form
13
MPEG 7 Audio Low-Level Descriptors Description Schemes Description Definition Language (DDL) BiM (Binary Format for MPEG-7)
14
What is Descriptor(D)? 정의 – 오디오 특징 벡터 혹은 구성물의 의미 Ex) –Audio Power –Audio Envelope –Audio Spectrum Flatness
15
Description Schemes (DSs) 정의 – 쉽게 말해서 DS 의 집합 예 ) –Instrument Timbre ( 악기 음색 ) LogAtackTime HarmonicSpectralCentroid …
16
Description Definition Language (DDL) DS 와 DSs 를 정의하는 언어 XML 로 표현 …??...
17
Scalable Series Original Series Scaled Series 12345678 Index i 231 ratio 215 numOfElements 12 totalNumOfSamples Scalar vs. Vector
18
Low-Level Descriptors Basic Descriptors Basic Spectral Descriptors Signal Parameter Descriptors Timbral Temporal Descriptors Timbral Spectral Descriptors Spectral Basis Descriptors
19
오디오의 기본적 구성 시간 도메인 (Time Domain) 0 Nw Nhop Hop size Lw L0 L1 L2 N: index S(n): signal Fs: Sampling rate L: index of time frames
20
Basic Descriptors Audio Waveform Audio Power
21
다음 시간에는 … 돌아가며 Descriptor 하나씩 준비하기 – 약 20-40 분 – 각각의 Descriptors 의 내용을 추출하기 위한 알 고리즘 생각하기 – 코드로 구현해보기 ( 템플릿 코드 제작예정 ) 각자 자유주제로 세미나 – 약 10-20 분
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.