ByteDance | Research Scientist in Foundation Model Music Core Machine Learning Graduates 2024 Start PhD | San Jose, CA | United States | | 6/30/2024

Before u proceed below to check the jobs/CVs, please select your favorite job categories, whose top job alerts you want in your email & Subscribe to our Email Job Alert Service For FREE


Job Location: San Jose, CA

Job Detail:

Founded in 2023, ByteDance Doubao Team is dedicated to crafting the industry’s most advanced LLMs. We aim to lead global research and foster both technological and social progress.

With a long-term vision and a strong commitment to the AI field, the Team conducts research in a range of areas including natural language processing (NLP), computer vision (CV), and speech recognition and generation. It boasts a robust international presence with labs and research facilities in China, Singapore, and the US. Leveraging substantial data and computing resources and through continued investment in these domains, our team has built a proprietary general-purpose model with multimodal capabilities. This model supports over 50 downstream business services including Doubao, Coze, and Dreamina, which are available to enterprise customers via the Volcano Engine. Currently, the Doubao App is the most used AIGC application in the China market.

Why Join Us

Creation is the core of ByteDance’s purpose. Our products are built to help imaginations thrive. This is doubly true of the teams that make our innovations possible.

Together, we inspire creativity and enrich life – a mission we aim towards achieving every day.

To us, every challenge, no matter how ambiguous, is an opportunity; to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always.

At ByteDance, we create together and grow together. That’s how we drive impact – for ourselves, our company, and the users we serve.

Join us.

Team Intro

The Speech team’s mission is to empower content interaction and creation using speech & audio related technologies. The team focuses on cutting-edge R&D in areas like speech & audio, music processing, natural language understanding and multimodal deep learning. The team builds AI training and inference systems based on GPUs and advances the state-of-the-art of AI system technologies to accelerate large audio/music language models. The team is also responsible for the development of the complete engineering cycle of large models, including data preparing/processing, model training/evaluation/deployment, etc.

We are looking for talented individuals to join our team in 2024. As a graduate, you will get unparalleled opportunities for you to kickstart your career, pursue bold ideas and explore limitless growth opportunities. Co-create a future driven by your inspiration with ByteDance.

Successful candidates must be able to commit to a start date before the end of 2024. Please state your availability and graduation date clearly in your resume.

Applications will be reviewed on a rolling basis. We encourage you to apply early.

Candidates can apply to a maximum of two positions and will be considered for jobs in the order you apply. The application limit is applicable to ByteDance and its affiliates’ jobs globally.


  • Conduct cutting-edge machine learning research and development in music understanding and generation.
  • Transfer advanced technologies to ByteDance products.
  • Explore new products with music intelligence technology at its core.

Minimum Qualifications

  • Ph.D. graduate with a background in computer science, mathematics, engineering, or a related field
  • Experience in one or more areas of deep learning and music intelligence, including but not limited to:
  • Deep generative models
  • Representation learning
  • Sequence classification
  • Music information retrieval and source separation
  • Speech recognition and synthesis
  • Natural language processing
  • Must obtain work authorization in the country of employment at the time of hire, and maintain ongoing work authorization during employment.

Preferred Qualifications

  • Publications in top-tier venues such as NeurIPS, ICLR, ICML, ICASSP, INTERSPEECH, ACL, EMNLP, ISMIR.
  • Familiar with deep learning frameworks such as Tensorflow and PyTorch.
  • Highly competent in algorithms and programming; Strong coding skills in Python and C/C++.
  • Familar with large-scale training and data processing.
  • Familiar with engineering principles and best practices.
  • Ability to work collaboratively in a fast-paced, multi-functional environment.

ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.

ByteDance Inc. is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws. If you need assistance or a reasonable accommodation, please reach out to us at

By submitting an application for this role, you accept and agree to our global applicant privacy policy, which may be accessed here:

Apply Here

Submit CV To All Data Science Job Consultants Across United States For Free


Please enter your comment!
Please enter your name here