Welcome! Here at Lee Laboratory of Nagoya Institute of Technology, we are conducting research on speech recognition, speech dialogue, natural language processing, speech interface, and speech interaction, targeting human-to-human and human-to-machine communication using speech and language.
Our aim is to develop technologies for intelligent information processing, including speech, language, and dialogue, as well as realize natural and easy-to-use voice and language interfaces.
LEE Akinobu was born in Kyoto, Japan, on December 19, 1972. He received the B.E. and M.E. degrees in information science, and the Ph.D. degree in informatics from Kyoto University, Kyoto, Japan, in 1996, 1998 and 2000, respectively. He worked on Nara Institute of Science and
Technology as an assistance professor from 2000-2005. Currently he
is a professor of Nagoya Institute of Technology, Japan. His research interests include speech recognition, spoken language understanding, and spoken dialogue system. He is a member of IEEE, ISCA, JSAI, IPSJ and the Acoustical Society of Japan. He is also a developer of open-source speech recognition software Julius and CG agent-based speech interaction toolkit MMDAgent.
Julius version 4.6 has been released. You can get it from its GitHub site.
What’s new in Julius-4.6 Julius-4.6 is a minor release with new features and fixes, including GPU integration and grammar handling updates.
GPU-based DNN-HMM computation (Take a look at v4.6 performance comparison on YouTube!)
Now Julius can compute DNN-HMM with GPU. Total decoding will be four times faster than CPU-based computation on Julius-4.5.
Requires CUDA version 8, 9 or 10.
Julius has merged a pull request that adds a new feature “grammar search on the 1st pass”. To use it, get the latest code on master branch.
It enables applying full grammar on the 1-pass, thus outputs more reliable (grammar-constrained) result at the 1st pass.
Background The grammar-based recognition on Julius does not apply the full grammar on the 1st pass, but applies only the word-pair constraint extracted from the grammar for efficiency.
The graduation thesis presentation meeting was held. The following ten members gave presentations:
The 2019 master’s thesis review meeting was held. The following members gave presentations:
Ryota Tanaka (M2)‘s paper has been published at Computer Speech & Language jounal.
Ryota Tanaka, Akihide Ozeki, Shugo Kato, Akinobu Lee, “Context and Knowledge Aware Dialogue System and System Combination for Grounded Response Generation” In Proc. Computer Speech & Language Journal, vol. 62, July 2020. http://www.sciencedirect.com/science/article/pii/S0885230820300036
A paper entitled “Speaker Aware BERT for Multi-Party Dialogue Selection” by Tatsuya Nishiyama has been accepted for poster presentation at AAAI2020/DSTC8, which will be held on New York on February 8, 2020.