
Speech
The mission of the Seed Speech team is to enrich interactive and creative processes through the application of multimodal speech technologies. The team focuses on the forefront of research and product development in speech and audio, music, natural language understanding, and multimodal deep learning.
Latest advancements
Selected papers
Jul 24, 2025
Seed LiveInterpret 2.0: End-to-end Simultaneous Speech-to-speech Translation with Your Voice
Speech&Audio
Feb 25, 2025
You Only Sample Once: Taming One-Step Text-to-Image Synthesis by Self-Cooperative Diffusion GANs
Computer Vision
Sep 13, 2024
Seed-Music: A Unified Framework for High Quality and Controlled Music Generation
Speech&Audio
Featured roles
Research Scientist, Multimodality
San Jose / Seattle
Experienced Hiring
Apply Now
Research Scientist, Foundation Model, Music Intelligence
San Jose
Experienced Hiring
Apply Now
Research Scientist in Foundation Model, Speech & Audio Generation - 2025 Start (PhD)
San Jose / Seattle
Campus Hiring
Apply Now
Research Scientist in Foundation Model, Music - 2025 Start (PhD)
San Jose
Campus Hiring
Apply Now
Student Researcher (Seed - Foundation Model - Speech Understanding) - 2025 Start (PhD)
San Jose / Seattle
Internship
Apply Now
Student Researcher (Seed - Music Foundation Model) - 2025 Start (PhD)
San Jose
Internship
Apply Now