[ 日本語 | English ]
Showa-ku, Nagoya, Aichi 4668555 JAPAN
ri@nitech.ac.jp
Welcome! Here at Lee Laboratory of Nagoya Institute of Technology, we focus on human-to-human and human-to-machine communication through speech and language, and are conducting research on speech recognition, spoken dialogue systems, natural language processing, speech-based interaction, and avatar communication. Our goal is to advance technologies related to spoken language processing and to realize highly sophisticated, voice- and language-driven man–machine interfaces that are truly natural and user-friendly for everyone.
LEE Akinobu was born in Kyoto, Japan, on December 19, 1972. He received the B.E. and M.E. degrees in information science, and the Ph.D. degree in informatics from Kyoto University, Kyoto, Japan, in 1996, 1998 and 2000, respectively. He worked on Nara Institute of Science and Technology as an assistance professor from 2000-2005. Currently he is a professor of Nagoya Institute of Technology, Japan. His research interests include speech recognition, spoken language understanding, and spoken dialogue system. He is a member of IEEE, ISCA, JSAI, IPSJ and the Acoustical Society of Japan.
He is also a researcher who loves coding and has been involved in open-source activities for over 25 years. Below is a list of open-source software and CG avatars for which he serves as the lead developer:
At the 153rd (Spring 2025) Research Presentation of the Acoustical Society of Japan, held from March 17 to March 19, Keigo Ichikawa (M2), Momone Suzuki (B4), and Professor Sei Ueno gave presentations on the following topics.
「話者遷移確率に基づく話者ダイアライゼーションのためのデータ生成」 Keigo Ichikawa,Sei Ueno,Akinobu Lee 「音響情報を考慮した大規模言語モデルによる音声認識の誤り訂正」 Momone Suzuki,Sei Ueno,Akinobu Lee
「拡散モデルを用いた音声合成による音声認識のデータ拡張」 Sei Ueno,Akinobu Lee
At the 31st Annual Meeting of the Association for Natural Language Processing (NLP2025), held from March 10 to March 14, B4 student Yukito Minari gave a presentation on the following topic.
「LLM ベースのマルチエージェントによる TRPG ゲームマスターシステムの実現」 Yukito Minari,Sei Ueno,Akinobu Lee
At the HAI Symposium 2025, held from February 28 to March 1, M2 students Akari Kawamata and Yuki Fujioka, as well as B4 students Soma Suzuki, Takumi Yoshida, and Koki Yamada, presented on the following topics.
「身体性を持つCG対話エージェントにおけるカートゥーン調表現の方法論および比較評価」 Akari Kawamata, Sei Ueno,Akinobu Lee(Nagoya Institute of Technology)
「CGアバター遠隔対話のための音声からのモーション生成およびCG特有性の分析」 Yuki Fujioka, Sei Ueno,Akinobu Lee
「アバター会話支援のための音声からのリアルタイム会話モーション生成の検討」 Soma Suzuki, Sei Ueno,Akinobu Lee
「医療面接教育のための仮想模擬患者を用いた没入型音声対話システム」 Takumi Yoshida, Sei Ueno,Akinobu Lee
「CGエージェントを用いた対話システムにおける疑似ソーシャルタッチの効果」 Koki Yamada, Sei Ueno,Akinobu Lee
We’ve started uploading VRM version of our CG avatars on VRoid Hub.
Lab’s page on VRoid Hub “Gene” and “Uka” models are set to public, so you can use them with vroid-enabled tools.
Models are provided with CC-BY 4.0 license, and “Gene” has additional restriction for commercial use. Please consult CG avatars page for details.
At the 2024 master’s degree interim presentation, M2 students Akari Kawamata and Ryotaro Kimata won the Best Presentation Award, The Tokai Chapter of Acoustical Society of Japan(日本音響学会東海支部優秀発表賞).
Keigo Ichikawa (M2) made a presentation at the Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2024) held on December 3rd to 6th.
“Data generation for speaker diarization by speaker transition information” Keigo Ichikawa, Sei Ueno, and Akinobu Lee
A New CG-CA “Nirva” has been released, designed by an illustrator Mai Yoneyama.
CG-CA Nirva
The CG-CA page has also been updated to have system information.
We have released MMDAgent-EX, our open-source platform for CG avatar based spoken dialogue system, multimodal dialogue and avatar communication.
Links:
Press release (by NITech, in Japanese) Official site GitHub