Lee Lab., NITech

Welcome! Here at Lee Laboratory of Nagoya Institute of Technology, we focus on human-to-human and human-to-machine communication through speech and language, and are conducting research on speech recognition, spoken dialogue systems, natural language processing, speech-based interaction, and avatar communication. Our goal is to advance technologies related to spoken language processing and to realize highly sophisticated, voice- and language-driven man–machine interfaces that are truly natural and user-friendly for everyone.

Our research topic involves:

Speech recognition and synthesis
Spoken dialogue system
Natural language processing
CG based humanoid agent interaction
Avatar communication

Lee Lab is running in collaboration with Sako laboratory. We also have a cooperative relationship with Tokuda, Nankaku and Hashimoto Laboratory.

About the PI

LEE Akinobu was born in Kyoto, Japan, on December 19, 1972. He received the B.E. and M.E. degrees in information science, and the Ph.D. degree in informatics from Kyoto University, Kyoto, Japan, in 1996, 1998 and 2000, respectively. He worked on Nara Institute of Science and Technology as an assistance professor from 2000-2005. Currently he is a professor of Nagoya Institute of Technology, Japan. His research interests include speech recognition, spoken language understanding, and spoken dialogue system. He is a member of IEEE, ISCA, JSAI, IPSJ and the Acoustical Society of Japan.

He is also a researcher who loves coding and has been involved in open-source activities for over 25 years. Below is a list of open-source software and CG avatars for which he serves as the lead developer:

ASR engine Julius (from 1996)
CG Agent Interaction Toolkit MMDAgent (2011～)
Extended version for CG Avatar interaction: MMDAgent-EX MMDAgent-EX (2020～)
High-quality Open CG Avatars: Gene and Uka (2023～)

News and Posts

At the The Sixth Joint Meeting Acoustical Society of America and Acoustical Society of Japan, held from December 1 to November 5, Keigo Ichikawa(D1), Umi Okamoto(M2), Junichi Shimazaki(M2), Momone Suzuki(M1) gave presentations on the following topics. 「Multi-talker conversational speech generation for training speaker diarization model via text-to-speech」 Keigo Ichikawa，Sei Ueno，Akinobu Lee 「Fine-Tuning Strategies for Large-Scale Face-Conditioned Text-to-Speech」 Umi Okamoto，Sei Ueno，Akinobu Lee 「Speech Synthesis with Diverse Laughter Types using Artificial Data」 Junichi Shimazaki，Sei Ueno，Akinobu Lee

2025年11月15日(土)・16日(日)に開かれる名古屋工業大学祭「工大祭2025」の企画として、李研から遠隔会話 CG アバターシステムを展示します。操作体験コーナーもあり、アバター越しの会話を体験いただけます。詳しくは以下のページをご覧ください。 As part of the NITech Festival 2025 held on November 15 (Sat) and 16 (Sun), 2025, our laboratory (Lee Lab) will exhibit its remote-conversation CG Avatar System. We will also have an interactive hands-on area, where visitors can experience conversations through the avatar interface. For more details, please see the page below (in Japanese). https://leelabz.notion.site/2025-CG-CA-2a9d5612f79380189979f71af9438447

At the The 2025 Conference on Empirical Methods in Natural Language Processing, held from November 4 to November 9, Yukito Minari(M1) gave presentations on the following topics. 「TRPG Game Mastering Using LLM-Based Multi-Agent System」 Yukito Minari，Sei Ueno，Akinobu Lee

At the 17th Asia Pacific Signal and Information Processing Association Annual Summit and Conference, held from October 22 to October 24, Umi Okamoto(M1) gave presentations on the following topics. 「Face-conditioned Large-scale Text-to-Speech via Speaker Embedding Prediction from Facial Images」 Umi Okamoto，Sei Ueno，Akinobu Lee

At the 153rd (Spring 2025) Research Presentation of the Acoustical Society of Japan, held from March 17 to March 19, Keigo Ichikawa (M2), Momone Suzuki (B4), and Professor Sei Ueno gave presentations on the following topics. 「話者遷移確率に基づく話者ダイアライゼーションのためのデータ生成」 Keigo Ichikawa，Sei Ueno，Akinobu Lee 「音響情報を考慮した大規模言語モデルによる音声認識の誤り訂正」 Momone Suzuki，Sei Ueno，Akinobu Lee 「拡散モデルを用いた音声合成による音声認識のデータ拡張」 Sei Ueno，Akinobu Lee

At the 31st Annual Meeting of the Association for Natural Language Processing (NLP2025), held from March 10 to March 14, B4 student Yukito Minari gave a presentation on the following topic. 「LLM ベースのマルチエージェントによる TRPG ゲームマスターシステムの実現」 Yukito Minari，Sei Ueno，Akinobu Lee

At the HAI Symposium 2025, held from February 28 to March 1, M2 students Akari Kawamata and Yuki Fujioka, as well as B4 students Soma Suzuki, Takumi Yoshida, and Koki Yamada, presented on the following topics. 「身体性を持つCG対話エージェントにおけるカートゥーン調表現の方法論および比較評価」 Akari Kawamata, Sei Ueno，Akinobu Lee(Nagoya Institute of Technology) 「CGアバター遠隔対話のための音声からのモーション生成およびCG特有性の分析」 Yuki Fujioka, Sei Ueno，Akinobu Lee 「アバター会話支援のための音声からのリアルタイム会話モーション生成の検討」 Soma Suzuki, Sei Ueno，Akinobu Lee 「医療面接教育のための仮想模擬患者を用いた没入型音声対話システム」 Takumi Yoshida, Sei Ueno，Akinobu Lee 「CGエージェントを用いた対話システムにおける疑似ソーシャルタッチの効果」 Koki Yamada, Sei Ueno，Akinobu Lee

We’ve started uploading VRM version of our CG avatars on VRoid Hub. Lab’s page on VRoid Hub “Gene” and “Uka” models are set to public, so you can use them with vroid-enabled tools. Models are provided with CC-BY 4.0 license, and “Gene” has additional restriction for commercial use. Please consult CG avatars page for details.

We’ve updated part of the HP: Top Research CG avatars Also added a new CG avatar named “Magi” to tge CG avatars page. It is a new gender-free model.

At the 2024 master’s degree interim presentation, M2 students Akari Kawamata and Ryotaro Kimata won the Best Presentation Award, The Tokai Chapter of Acoustical Society of Japan（日本音響学会東海支部優秀発表賞）.

About the PI

LEE Akinobu

News and Posts

At the Sixth Joint Meeting Acoustical Society of America and Acoustical Society of Japan, Keigo Ichikawa, Umi Okamoto, Junichi Shimazaki, Momone Suzuki gave presentations

Exhibition of Our CG Avatar System at the NITech Festival on November 15–16

At the 2025 Conference on Empirical Methods in Natural Language Processing, Yukito Minari gave presentations

At the 17th Asia Pacific Signal and Information Processing Association Annual Summit and Conference, Umi Okamoto(M2) gave presentations

At the 153rd (Spring 2025) Research Presentation of the Acoustical Society of Japan, Keigo Ichikawa (M2), Momone Suzuki (B4), and Professor Sei Ueno gave presentations

At the 31st Annual Meeting of the Association for Natural Language Processing (NLP2025), B4 student Yukito Minari gave a presentation.

At the HAI Symposium 2025, M2 students Akari Kawamata and Yuki Fujioka, along with B4 students Soma Suzuki, Takumi Yoshida, and Koki Yamada, gave presentations.

VRM version of our CG Avatar is now on VRoid Hub

A New CG Avatar Magi, and some updates

M2 students Akari Kawamata and Ryotaro Kimata were awarded at the 2024 Master's Interim Presentation