Hedra Logo

Hedra

Machine Learning Engineer (CUDA)

Posted 21 Days Ago
2 Locations
Mid level
2 Locations
Mid level
The Machine Learning Engineer will optimize GPU performance for 3DVAE and video diffusion models using CUDA, collaborating with teams on model requirements and bottlenecks.
The summary above was generated by AI

Hedra is a pioneering generative media company backed by top investors at Index, A16Z, and Abstract Ventures. We're building Hedra Studio, a multimodal creation platform capable of control, emotion, and creative intelligence.

At the core of Hedra Studio is our Character-3 foundation model, the first omnimodal model in production. Character-3 jointly reasons across image, text, and audio for more intelligent video generation — it’s the next evolution of AI-driven content creation.

Note: At Hedra, we’re a team of hard-working, passionate individuals seeking to fundamentally change content and build a generational company together. You should have start-up experience and be a self-starter that is driven to build impactful products that change the status quo. The Hedra team is fully in-person in SF/NY with a shared love of solving hard problems using whiteboards.

Overview:

We are seeking a talented CUDA ML Engineer to optimize our machine learning models for high-performance computing on GPU hardware. The ideal candidate will have expertise in CUDA programming and a deep understanding of how to leverage GPU acceleration to maximize the efficiency of our 3DVAE and video diffusion models.

Responsibilities:
  • Optimize machine learning models, specifically 3DVAE and video diffusion models, for GPU performance using CUDA, ensuring efficient training and inference.

  • Develop and implement efficient algorithms and data structures for GPU computation, addressing performance bottlenecks in video generation tasks.

  • Work closely with the research and engineering teams to understand model requirements and performance bottlenecks, facilitating collaboration.

  • Stay current with the latest advancements in GPU technology and machine learning optimization techniques.

  • Ensure that our models run efficiently on various GPU architectures, supporting scalability for large-scale training.

Qualifications:
  • Bachelor’s degree in Computer Science, Electrical Engineering, or a related field, with a focus on high-performance computing.

  • Strong programming skills in C++ and CUDA, essential for GPU optimization.

  • Experience with deep learning frameworks that support GPU acceleration, such as PyTorch or TensorFlow, crucial for model implementation.

  • Understanding of parallel computing concepts and GPU architecture, given the need to optimize for hardware constraints.

  • Familiarity with machine learning models, particularly generative models, to align optimizations with model needs.

  • Excellent problem-solving and debugging skills, necessary for addressing performance issues.

Benefits:
  • Competitive compensation and equity

  • 401k (no match)

  • Healthcare (Silver PPO Medical, Vision, Dental)

  • Lunch and snacks at the office

We encourage you to apply even if you don't fully meet all the listed requirements; we value potential and diverse perspectives, and your unique skills could be a great asset to our team.

Top Skills

C++
Cuda
PyTorch
TensorFlow
HQ

Hedra San Francisco, California, USA Office

Hedra HQ Office

South Park

Similar Jobs at Hedra

15 Days Ago
2 Locations
Mid level
Mid level
Consumer Web • Digital Media • Enterprise Web • Marketing Tech • News + Entertainment • Software • Generative AI
As a Full-Stack Engineer, you'll build and deploy frontend and backend services for transformative products, utilizing modern web standards and various frameworks.
Top Skills: DynamoDBEksFfmpegHlsJavaScriptKafkaKubernetesMkvMp-DashMp4NextjsPythonReactSqsTypescriptWebrtc
Yesterday
2 Locations
Senior level
Senior level
Consumer Web • Digital Media • Enterprise Web • Marketing Tech • News + Entertainment • Software • Generative AI
As a Senior Backend Engineer, you will design and deploy backend services for video/audio creation tools, working with Python and cloud infrastructure.
Top Skills: AWSDockerDynamoDBFastapiFfmpegGoKafkaKubernetesPydanticPythonSqs
21 Days Ago
2 Locations
Mid level
Mid level
Consumer Web • Digital Media • Enterprise Web • Marketing Tech • News + Entertainment • Software • Generative AI
The Applied Research Scientist will fine-tune generative models for specific applications, focusing on model adaptation using techniques like transfer learning and data augmentation.
Top Skills: PythonPyTorchTensorFlow

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account