Please use this identifier to cite or link to this item:
http://hdl.handle.net/10174/25010
|
Title: | Segregation of Speech, Music and Instrumentals with LSF-RG features |
Authors: | Mukherjee, Himadri Sk, Obaidullah Santosh, K.C. Gonçalves, Teresa Phadikar, Santanu Roy, Kaushik |
Keywords: | Speech Vocals Line Spectral Frequency Framing |
Issue Date: | 2018 |
Publisher: | IEEE |
Citation: | Himadri Mukherjee, Sk Md Obaidullah, K.C. Santosh, Teresa Gonçalves, Santanu Phadikar, and Kaushik Roy. Segregation of Speech, Music and Instrumentals with LSF-RG features. In SKIMA’2018 – 12th International Conference on Software, Knowledge, Information Management and Applications, page (to appear). IEEE, 2018. |
Abstract: | Music based applications have undergone an evolution in the past decade. Development and optimization of Audio based search engines has attracted the interest of researchers for
quite some time. Audio comes from multifarious sources in real world scenario which demand different processing techniques
based on type. A system which can segregate audio based on type prior to searching can help in elevating the performance of
the search engines. In this paper, a system is proposed towards segregation of speech, music and instrumental clips in order to
aid towards performance enhancement of the search engines. The system works with a newly proposed Line Spectral Pair based feature namely Line Spectral Frequency-Ratio Grade(LSF-RG).
The system has been tested on a database of as many as 105571 clips collected from the internet and different classifiers have been
applied and a highest accuracy of 98.95% has been obtained for multi layer perceptron. |
URI: | http://hdl.handle.net/10174/25010 |
Type: | article |
Appears in Collections: | INF - Publicações - Artigos em Revistas Internacionais Com Arbitragem Científica
|
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
|