MIR Lab: R&D Foci and Demos （ MIR實驗室：研發重點及展示）

Slides:

Advertisements

Similar presentations

Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki

Advertisements

Dynamic Time Warping (DTW)

Speaker Associate Professor Ning-Han Liu. What’s MIR  Music information retrieval (MIR) is the interdisciplinary science of retrieving information from.

Digital Interactive Entertainment Dr. Yangsheng Wang Professor of Institute of Automation Chinese Academy of Sciences

Outline Introduction Music Information Retrieval Classification Process Steps Pitch Histograms Multiple Pitch Detection Algorithm Musical Genre Classification.

Overview What : Stroke type Transformation: Timbre Rhythm When: Stroke timing Resynthesis.

Retrieval Methods for QBSH (Query By Singing/Humming) J.-S. Roger Jang ( 張智星 ) Multimedia Information Retrieval.

Chinese Character Recognition for Video Presented by: Vincent Cheung Date: 25 October 1999.

The Chinese University of Hong Kong Department of Computer Science and Engineering Lyu0202 Advanced Audio Information Retrieval System.

09/09/2005ENEE408G Fall 2005 Multimedia Signal Processing 1 ENEE408G: Capstone Design Project: Multimedia Signal Processing Design Project 1: Digital Speech.

Multimedia Data Mining Arvind Balasubramanian Multimedia Lab (ECSS 4.416) The University of Texas at Dallas.

Multimedia Data Mining Arvind Balasubramanian Multimedia Lab The University of Texas at Dallas.

Enhancing discovery of the British Library’s audio collections Richard Ranft 23 June 2014 Making Metadata Work ISKO UK + IRSG + DCMI joint meeting.

GCT731 Fall 2014 Topics in Music Technology - Music Information Retrieval Introduction to MIR Course Overview 1.

Speech Synthesis Markup Language -----Aim at Extension Dr. Jianhua Tao National Laboratory of Pattern Recognition (NLPR) Institute of Automation, Chinese.

Track: Speech Technology Kishore Prahallad Assistant Professor, IIIT-Hyderabad 1Winter School, 2010, IIIT-H.

Exploring a million hours of sounds Richard Ranft, The British Library 27 November 2014 Search Solutions 2014.

Advanced Multimedia Music Information Retrieval Tamara Berg.

Sound Applications Advanced Multimedia Tamara Berg.

NM7613: Music Signal Analysis and Retrieval 音樂訊號分析與檢索 Jyh-Shing Roger Jang ( 張智星 ) CSIE Dept, National Taiwan University.

Li Deng Microsoft Research Redmond, WA Presented at the Banff Workshop, July 2009 From Recognition To Understanding Expanding traditional scope of signal.

Schizophrenia and Depression – Evidence in Speech Prosody Student: Yonatan Vaizman Advisor: Prof. Daphna Weinshall Joint work with Roie Kliper and Dr.

TINONS1 Nonlinear SP and Pattern recognition

2015/9/111 Introduction to ISMIR/MIREX J.-S. Roger Jang （張智星） Multimedia Information Retrieval (MIR) Lab CSIE Dept, National Taiwan Univ.

2015/9/131 Stress Detection J.-S. Roger Jang ( 張智星 ) MIR LabMIR Lab, CSIE Dept., National Taiwan Univ.

August 12, 2004IAML - IASA 2004 Congress, Olso1 Music Information Retrieval, or how to search for (and maybe find) music and do away with incipits Michael.

Speech Assessment 語音評測 J.-S. Roger Jang ( 張智星 ) Multimedia Information Retrieval Lab CS Dept, Tsing.

Student: Mike Jiang Advisor: Dr. Ras, Zbigniew W. Music Information Retrieval.

Music Information Retrieval -or- how to search for (and maybe find) music and do away with incipits Michael Fingerhut Multimedia Library and Engineering.

2015/10/221 Progressive Filtering and Its Application for Query-by-Singing/Humming J.-S. Roger Jang ( 張智星 ) Multimedia Information Retrieval Lab CS Dept.,

Overview of Part I, CMSC5707 Advanced Topics in Artificial Intelligence KH Wong (6 weeks) Audio signal processing – Signals in time & frequency domains.

Demos for QBSH J.-S. Roger Jang ( 張智星 ) CSIE Dept, National Taiwan University.

2016/6/41 Recent Improvement Over QBSH and AFP J.-S. Roger Jang （張智星） Multimedia Information Retrieval (MIR) Lab CSIE Dept, National Taiwan Univ.

RuSSIR 2013 QBSH and AFP as Two Successful Paradigms of Music Information Retrieval Jyh-Shing Roger Jang ( 張智星 ) MIR Lab, CSIE Dept.

Machines that Make Decisions Instructor: Edmondo Trentin

QBSH Corpus The QBSH corpus provided by Roger Jang [1] consists of recordings of children’s songs from students taking the course “Audio Signal Processing.

Audio Fingerprinting as a New Task for MIREX-2014 Chung-Che Wang Jyh-Shing Roger Jang.

Unlocking Audio/Video Content with Speech Recognition Behrooz Chitsaz Director, IP Strategy Microsoft Research Frank Seide Lead.

Query by Singing and Humming System

Some Research Activities in MIR Lab J.-S. Roger Jang ( 張智星 ) Multimedia Information Retrieval Lab CS.

Preparing for the 2008 Beijing Olympics : The LingTour and KNOWLISTICS projects. MAO Yuhang, DING Xiao-Qing, NI Yang, LIN Shiuan-Sung, Laurence LIKFORMAN,

Some research areas:  Medicine: ◦ analysis of bio-signals, ◦ medical imaging ◦…◦… 1.

Discussions on Audio Melody Extraction (AME) J.-S. Roger Jang ( 張智星 ) MIR Lab, CSIE Dept. National Taiwan University.

By:- Punith Sharma Ashwath D S Adithya S Srimatha B V

An introduction to Amazon AI

Teaching Listening Why teach listening?

Introduction to Music Information Retrieval (MIR)

G. Anushiya Rachel Project Officer

Introduction to ISMIR/MIREX

Onset Detection, Tempo Estimation, and Beat Tracking

The Greek Audio Dataset

Query by Singing/Humming via Dynamic Programming

Introduction to Pattern Recognition

Singing Voice Separation via Active Noise Cancellation 使用主動式雜訊消除於歌聲分離

ASRA: Automatic Speech Recognition & Assessment

Intro to Machine Learning

自我介紹學歷：研究方向：經歷： 1984：學士，台大電機系 1992：博士，加州大學柏克萊分校、電機電腦系

Closing Remarks on MSAR-2017

Introduction to Music Information Retrieval (MIR)

Machine Learning Ali Ghodsi Department of Statistics

How to use Roku Voice Search and Roku Mobile App Speaking naturally in a compatible Roku Advanced Remote or Free Roku Mobile app, you can use conversation.

Object Recognition & Detection

Introduction to Music Information Retrieval (MIR)

Intro to Machine Learning

Machine Learning in FinTech

Natural User Interaction with Perceptual Computing

Face Detection Gender Recognition 1 1 (19) 1 (1)

Query by Singing/Humming via Dynamic Programming

Advances in Deep Audio and Audio-Visual Processing

Music Signal Processing

Presentation transcript:

MIR Lab: R&D Foci and Demos （ MIR實驗室：研發重點及展示） J.-S. Roger Jang （張智星） Multimedia Information Retrieval (MIR) Lab CSIE Dept, National Taiwan Univ. http://mirlab.org/jang 2018/5/13

Our R&D Foci About me Mission Approaches Application domains Music Use machine learning to tackle real-world problems with immediate applications Approaches New learning paradigms GPU for big data Application domains Music Retrieval and analysis Speech Recognition, scoring, and synthesis Image Classification and analysis for semiconductor manufacturing automation

Music-related Research Mature technologies Query by singing/humming Audio fingerprinting Music genre classification Music mood classification Beat tracking Query by tapping Pitch/time modification Under development MART Audio watermarking Audio melody extraction Singing voice separation Score following Drum id for gaming Singing scoring Vibrato detection Enthusiasism detection

Focus: Music Retrieval Large-scale music search Query by singing/humming Audio fingerprinting Achievements Top-ranked for some MIREX tasks: Genre classification Mood classification Beat tracking Audio melody extraction Technology transfer to several companies Flow chart Applications on toys Video Clients Cloud servers Request: Acoustic features PC Smartphones Response: search result Mobile devices I’m Billy Bass. I know what you are singing! Pat me and sing to me!

Demos for Music-related Research PC Query by singing & humming Audio fingerprinting Genre classification Beat tracking Singing voice separation Pitch scaling Real-time pitch tracking Drum position id. Embedded systems QBSH over Toys Apps Auto-rhythm game Beat-off drum game I’m Billy Bass. I know what you are singing! Pat me and sing to me!

Speech-related Research Mature technologies Voice commands (語音命令) Speech scoring (語音評分) Mandarin, English, Japanese, Taiwanese Text-to-speech synthesis for Mandarin (語音合成) Speech emotion classification Under development Long utterance and text alignment Speaker recognition

Snapshots of ASRA Applications 華語語音評分軟體（授權給資策會）日語語音評分軟體（授權給巨匠電腦）英語語音評分軟體（授權給Speak2me公司）

Demos for Speech-related Research PC Idiom relay (成語接龍) Recitation machine (唸唸不忘) Bricks of idioms (一語中的) Stress detection（重音偵測） Text-to-speech synthesis Chinese conversation classroom Speech scoring Voice commands Lucy’s Café Embedded systems Toys Voice commands over iOS/Android Mobile apps Speech scoring game

Image-related Research Projects with TSMC Wafer map failure pattern recognition Depth from SEM images Defect circuit image detection Wafer image enhancement Etching width prediction Face-based analysis Face recognition Age estimation Expression ID Gender classification Others Human identification Particle tracking Leaf identification People counting

Thank you for your attention! Questions & comments?