NCMMSC 2026 (National Conference on Man-Machine Speech Communication) is a CCF C conference held in Zhuhai, China on 2026-11-05. The paper submission deadline is 2026-07-05. Acceptance notifications are sent on 2026-08-31.
The National Conference on Man-Machine Speech Communication (NCMMSC) is one of the most influential academic conferences in the fields of speech science and speech technology in China. Since its establishment in 1990, NCMMSC has continuously served at the forefront of academic research while addressing real-world industrial needs. Over the years, it has become a premier platform bringing together researchers and practitioners from academia and industry to advance innovation in speech and language intelligence technologies. The conference has long focused on key areas including speech science, speech technologies, audio signal processing, and human-computer interaction, playing an important role in promoting scientific exchange, technological breakthroughs, and real-world deployment, while contributing significantly to the high-quality development of speech research and applications.
The 21st National Conference on Man-Machine Speech Communication (NCMMSC 2026) will be held on November 5-8, 2026, at the Tianmu Melody Convention and Exhibition Center, Hengqin, Zhuhai, Guangdong-Hong Kong-Macao Greater Bay Area, China.
NCMMSC 2026 marks the conference's return to the Greater Bay Area after nearly two decades. Leveraging Hengqin's strategic location adjacent to Macao and its strong international connectivity, the conference aims to further strengthen international academic collaboration and deepen industry-academia integration, creating a globally oriented forum for speech and language intelligence research.
The year 2026 also represents a significant milestone for NCMMSC. The conference has been officially included in the China Computer Federation (CCF) Recommended Conference List (Class C), reflecting broad recognition of its academic impact, organizational quality, and research excellence.
NCMMSC 2026 is jointly organized by the China Computer Federation (CCF) and the Chinese Information Processing Society of China (CIPS), and hosted by the CCF Technical Committee on Speech Dialogue and Auditory Processing, the CIPS Technical Committee on Speech Information, Tsinghua Shenzhen International Graduate School, Tsinghua University, The Chinese University of Hong Kong (Shenzhen), and Beijing Institute of Technology (Zhuhai). The conference also serves as the annual academic meeting of both technical committees.
The theme of NCMMSC 2026 is: Speech × Language × Multimodality: Towards a New Paradigm of Intelligent Interaction. Driven by recent advances in large-scale AI models, speech technologies, natural language processing, and multimodal intelligence are increasingly converging. In line with this trend, NCMMSC 2026 will focus on emerging topics including speech foundation models, multimodal interaction, affective and cognitive computing, AI-powered speech technologies for healthcare, and accessible and inclusive AI, promoting the evolution of speech technologies from perception and generation toward cognitive understanding and intelligent interaction.
A major highlight of NCMMSC 2026 is that it will be co-located with NLPCC 2026 and AACL 2026, creating an integrated ecosystem for speech, language, and multimodal intelligence. This initiative aims to encourage interdisciplinary and cross-community collaboration while further enhancing international visibility and impact.
NCMMSC 2026 will feature a rich technical program including keynote speeches, special sessions, young researchers forum, student forum, industry forum, technical competitions, and technology demonstrations. By bringing together leading researchers and industry experts from around the world, the conference aims to discuss future visions and key technological challenges in human–machine speech communication in the era of intelligent systems, while promoting scientific innovation in support of national priorities and societal needs.
Topics of Interest
1. Speech Science and Linguistics
• Speech production, perception, and cognitive mechanisms
• Phonetics, phonology, and prosody
• Speech science, discourse, and conversational analysis
• Multilingual and dialectal speech
• Auditory modeling and neural mechanisms of speech
2. Speech Analysis, Synthesis, and Conversion
• Automatic speech recognition and understanding
• Speech synthesis and voice conversion
• Speaker, language, and paralinguistic analysis
• Speech enhancement, separation, and robust processing
• Low-resource and cross-lingual speech processing
• Speech security, spoofing detection, and adversarial robustness
3. Audio, Music, and Acoustic Signal Processing
• Audio and acoustic signal processing and modeling
• Spatial audio and sound-field analysis
• Acoustic event detection and scene understanding
• Audio information processing and generation
• Music information processing and generation
• Audio–speech integration and auditory modeling
4. Speech and Language Understanding with Large Models
• Speech foundation models and large-scale models
• Unified speech-language modeling
• Multimodal large models
• Speech semantic understanding and generation
• Explainable and controllable speech intelligence
5. Spoken Dialogue Systems and Multimodal Interaction
• Spoken dialogue systems and speech agents
• Multi-turn dialogue and interaction modeling
• Multimodal human-computer interaction
• Affective and empathetic interaction
• Virtual humans and immersive interaction
6. AI for Healthcare, Accessibility, and Wellness
• Pathological speech analysis and applications
• Mental health and cognitive modeling
• Speech rehabilitation and assistive communication
• Accessible speech technologies and inclusive AI
• Medical dialogue systems and health intelligence
7. Speech and Language Data, Evaluation, and Systems
• Speech and multimodal dataset construction
• Data annotation and quality assessment
• Benchmarks and evaluation methodologies
• Industrial-scale systems and deployment
• Real-time and edge speech processing
8. Generative AI and Emerging Interdisciplinary Topics
• Generative speech and audio technologies
• Speech and audio for embodied AI
• Speech in virtual and immersive environments
• Ethics, fairness, explainability, and privacy in speech AI
• Industrial applications and interdisciplinary innovations
暂无评论。