openPR Logo
Press release

Join Nexdata MLC-SLMWorkshop at Interspeech 2025

03-12-2025 07:22 PM CET | Business, Economy, Finances, Banking & Insurance

Press release from: ABNewswire

Nexdata [https://www.nexdata.ai/] are thrilled to announce that our MLC-SLM Workshop proposal has been officially approved! This means MLC-SLM Workshop is now an Interspeech 2025 Satellite Events! This workshop aims to bring together researchers, developers, and industry professionals to explore the latest advancements in multilingual conversational AI. As conversational speech models are essential to bridging communication gaps across languages and cultures, this event will provide a unique opportunity to delve into innovative solutions and the future of AI-driven dialogue systems.

Whether people are a researcher, developer, or enthusiast, Nexdata invite everybody to actively participate in this collaborative workshop and share erery one insights, contributing to the development of cutting-edge multilingual models that will shape the future of global conversations. Join Nexdadt in this pivotal event to network, learn, and push the boundaries of speech technology!

Image: https://www.abnewswire.com/upload/2025/03/9cf565151f81ae5d81f0d2166c4ad36d.jpg

Workshop Motivation

Large Language Models (LLMs) have demonstrated remarkable capabilities in a wide range of downstream tasks, serving as powerful foundation models for language understanding and generation. Furthermore, there has been significant attention on utilizing LLMs in speech and audio processing tasks such as Automatic Speech Recognition (ASR), Audio Captioning, and emerging areas like Spoken Dialogue Models.

However, real-world conversational speech data is critical for the development of robust LLM-based Spoken Dialogue Models, as it encapsulates the complexity of human communication, including natural pauses, interruptions, speaker overlaps, and diverse conversational styles. The limited availability of such data, especially in multilingual settings, poses a significant challenge to advancing the field.

The importance of real-world conversational speech extends beyond technological advancement-it is essential for building AI systems that can understand and respond naturally in multilingual, dynamic, and context-rich environments. This is especially crucial for next-generation human-AI interaction systems, where spoken dialogue serves as a primary mode of communication.

Thus, this workshop aims to bridge the gap by hosting the challenge of building multilingual conversational speech language models together with the release of a real-world multilingual conversational speech dataset.

Task Setting

The event consists of two tasks, both of which require participants to explore the development of speech language model:

Task 1: Multilingual Conversational Speech Recognition

Participants will be provided with oracle segmentation for each conversation.

Objective: Develop a multilingual LLM based ASR model

This task focuses on optimizing transcription accuracy in a multilingual setting.

Task 2: Multilingual Conversational Speech Diarization and Recognition

No prior or oracle information will be provided during evaluation (e.g., no pre-segmented utterances or speaker labels).

Objective: Develop a system for both speaker diarization (identifying who is speaking when), and recognition (transcribing speech to text).

Both pipeline-based and end-to-end systems are encouraged, providing flexibility in system design and implementation.

Other Topics

Participants are encouraged to submit research papers and system descriptions that showcase innovative findings, practical case studies, and forward-looking ideas. Topics of interest include, but are not limited to:

- Novel architectures and algorithms for training speech language models.

- Novel pipelines for processing raw audio data, which are useful for collecting diverse internet data for training speech language models.

- Algorithms designed to generate more natural and emotionally rich conversational speech for dialogue systems.

- Approaches to leverage multi-turn conversational history to improve recognition and diarization results.

- Innovative evaluation techniques or benchmarks for speech language models.

- New datasets (real and synthetic) for training speech and audio language models.

Image: https://www.abnewswire.com/upload/2025/03/991f618e8987bd5045d2681f00146736.jpg

Important (dataset for ai [https://www.nexdata.ai/]) Dates

February 20, 2025: Registration opens

March 10, 2025: Training data release

March 17, 2025: Development set and baseline system release

May 15, 2025: Evaluation set release and leaderboard open

June 1, 2025: Leaderboard freeze and submission portal opens (CMT system)

June 20, 2025: Submission deadline

July 10, 2025: Notification of acceptance

August 22, 2025: Workshop date

Organizers

Lei Xie, Professor, Northwestern Polytechnical University (China)

Shinji Watanabe, Associate Professor, Carnegie Mellon University (USA)

Eng Siong Chng, Associate Professor, Nanyang Technological University (Singapore)

Junlan Feng, IEEE Fellow & Chief Scientist, China Mobile (China)

Khalid Choukri, Secretary General, European Language Resources Association (France)

Qiangze Feng, Co-founder & Data Scientist, Nexdata (USA)

Daliang Wang, Data Scientist, Nexdata (USA)

Pengcheng Guo, PhD Student, Northwestern Polytechnical University (China)

Bingshen Mu, PhD Student, Northwestern Polytechnical University (China)

More about: 3d point cloud data service [https://www.nexdata.ai/service/point-cloud]

Media Contact
Company Name: Nexdata
Email:Send Email [https://www.abnewswire.com/email_contact_us.php?pr=join-nexdata-mlcslmworkshop-at-interspeech-2025]
Address:28 Birchgove Cr
City: Eastwood
State: NSW 2122
Country: United States
Website: https://www.nexdata.ai/

Legal Disclaimer: Information contained on this page is provided by an independent third-party content provider. ABNewswire makes no warranties or responsibility or liability for the accuracy, content, images, videos, licenses, completeness, legality, or reliability of the information contained in this article. If you are affiliated with this article or have any complaints or copyright issues related to this article and would like it to be removed, please contact retract@swscontact.com



This release was published on openPR.

Permanent link to this press release:

Copy
Please set a link in the press area of your homepage to this press release on openPR. openPR disclaims liability for any content contained in this release.

You can edit or delete your press release Join Nexdata MLC-SLMWorkshop at Interspeech 2025 here

News-ID: 3913409 • Views:

More Releases from ABNewswire

The Operator Vault Announces How To Setup OpenClaw a Workshop For Non Techies
The Operator Vault Announces How To Setup OpenClaw a Workshop For Non Techies
The Operator Vault, founded by Kevin Jeppesen, announced a new recorded OpenClaw workshop priced at $19 and expanded access to its Operator Vault community for creators and operators building automation systems. NEW YORK, N.Y. - Feb. 20, 2026 - The Operator Vault, a training platform and community founded by Kevin Jeppesen, today announced the release of a new recorded workshop focused on OpenClaw, a tool used to build practical automation workflows
Interior Painting Contractor in Philadelphia, PA, Observes Spring Surge as Homeowners Seek Refreshed Living Spaces
Interior Painting Contractor in Philadelphia, PA, Observes Spring Surge as Homeo …
Philadelphia, PA - As temperatures warm and daylight extends, Beauty Walls and Floor is experiencing the annual spring surge that owner Sviat Oleksyuk has come to anticipate each year. The seasonal shift brings a wave of homeowners eager to refresh their living spaces, transforming interiors that may have been neglected during the darker winter months. "Summer is the best, as well as spring time, when people get inspired with new ideas
Modern Mountain Home Architects in Asheville, NC Complete Two-Volume Residence on Elevated Blue Ridge Ridgeline
Modern Mountain Home Architects in Asheville, NC Complete Two-Volume Residence o …
Asheville, NC - Vellum Architecture & Design has completed the Windcliff Residence, a striking two-volume private residence positioned at 3,200 feet above sea level on a dramatic ridgeline site in Poplar Ridge. Designed for a professional couple from Florida seeking a modern mountain retreat, the project exemplifies thoughtful integration with the Blue Ridge landscape while navigating stringent height restrictions and complex topography. The residence's distinctive composition features two volumes connected by
Motorcycle Accident Lawyer in Vero Beach, FL, Notes Left-Turn Collisions Remain Leading Cause of Serious Rider Injuries
Motorcycle Accident Lawyer in Vero Beach, FL, Notes Left-Turn Collisions Remain …
Vero Beach, FL - Left-turn collisions continue to represent the most dangerous and frequently occurring type of motorcycle crash in Indian River County, according to analysis from Graves Thomas Injury Law Group in Vero Beach. These intersection accidents typically occur when drivers turning left misjudge an approaching motorcycle's speed or fail to see the rider entirely, resulting in catastrophic impacts that often cause life-altering injuries or fatalities. Joseph H. Graves, Founder

All 5 Releases


More Releases for Nexdata

3D Point Cloud Annotation Services Market Analysis: Uncovering Insights and Tren …
The global market for 3D Point Cloud Annotation Services was estimated to be worth US$ 3645 million in 2024 and is forecast to a readjusted size of US$ 12980 million by 2031 with a CAGR of 20.2% during the forecast period 2025-2031. QY Research (Market Research Report Publisher) announces the release of its lastest report "3D Point Cloud Annotation Services - Global Market Share and Ranking, Overall Sales and Demand
AI Training Dataset Market Recent Trends 2029, Outlook, Emerging Technologies, T …
AI Training Dataset Market by Software (Data Collection Tools, Data Annotation Software, Off-the-Shelf Datasets), Services (Data Validation Services, Dataset Marketplaces), Data Modality (Text, Image, Video, Audio, Multimodal) - Global Forecast to 2029. The global AI training dataset market [https://www.marketsandmarkets.com/Market-Reports/ai-training-dataset-market-153819655.html?utm_campaign=aitrainingdatasetmarket&utm_source=abnewswire.com&utm_medium=paidpr] is expected to grow at a compound annual growth rate (CAGR) of 27.7% during the forecast period, increasing from approximately USD 2.82 billion in 2024 to USD 9.58 billion by 2029. This
Exploring Prosodic Annotation Data: Enhancing Speech Processing and Linguistic R …
Prosodic Annotation Data: Nature, Significance, Sources, Challenges and Applications The importance of prosodic annotation Nexdata [https://www.nexdata.ai/] - Trusted by global AI Companies, Enterprises & Startups, University Research Institutes Image: https://www.abnewswire.com/upload/2025/02/6ea056a4a6cfc7988697c4a7a08ec5a0.jpg - Prosodic Annotation Data: Nature, Significance AI-based application cannot be achieved without the support of massive amount of data. Whether it is conversational AI, autonomous driving or medical image analysis, the diversity and integrity of training datasets largely affect the test result of dataset for
Exploring Datasets for iBeta Certification: A Guide for Biometric System Develop …
In intelligent algorithms driven by data, the quality and quantity of data determine the learning efficiency and decision-making precision of dataset for AI [https://www.nexdata.ai/] systems. Different from traditional programming, machine learning and deep learning models rely on massive training data to "self-learn" patterns and rules. Therefore, building and maintain datasets has become the core mission in AI research and development. Through continuously enriching data samples, AI model can handle more
Emerging Trends in 3D Point Cloud Annotation Services Market 2024 and Global For …
The latest release from WMR titled 3D Point Cloud Annotation Services Market Research Report 2024-2031 (by Product Type, End-User / Application, and Regions / Countries) provides an in-depth assessment of the 3D Point Cloud Annotation Services including key market trends, upcoming technologies, industry drivers, challenges, regulatory policies, key players company profiles, and strategies. Global 3D Point Cloud Annotation Services Market study with 100+ market data Tables, Pie Chat, Graphs &
Upcoming Opportunities in Multimodal Data Services Market: Future Trend and Anal …
A new Report by Worldwide Market Reports, titled "Multimodal Data Services Market: Industry Trends, Share, Size, Growth, Opportunity and Forecast 2024-2031," offers a comprehensive analysis of the industry, which comprises insights on the Multimodal Data Services market analysis. The report also includes competitor and regional analysis, and contemporary advancements in the market. This report has a complete table of contents, figures, tables, and charts, as well as insightful analysis. The Multimodal