Design, optimize, and deploy multimodal AI models for real-world applications focusing on vision and language understanding, ensuring accuracy and performance in production systems.
VOLT is building the next generation of AI perception systems for the physical world, focused on safety, security, and real-time risk detection.
We are seeking a Senior Applied AI & Machine Learning Engineer to design, optimize, and ship multimodal AI models that operate reliably in real-world environments. This is a deeply applied role, centered on taking models from data to production—across both edge devices and cloud infrastructure.
You will work on vision, video, and language-based models that understand real-world scenes and events, and you will be accountable for their accuracy, latency, robustness, and cost in production systems.
This role reports directly to the Head of Engineering and plays a critical role in advancing VOLT AI’s core perception platform.
Key Responsibilities
- Build, fine-tune, and deploy production-grade multimodal models for safety and security applications, with a focus on visual and video perception, language-assisted and multimodal reasoning, and temporal understanding of real-world environments
- Own the full applied ML lifecycle, including data collection, labeling strategies, and dataset curation, model fine-tuning, evaluation, and iteration, and deployment, monitoring, and continuous improvement in production
- Drive model performance in real-world conditions, optimizing for high precision and recall, low false positives and false negatives, and robustness to noise, lighting changes, occlusion, and domain shift
- Optimize models for edge and cloud deployment, including quantization, pruning, and model compression, latency, throughput, and memory optimization, and hardware-aware tuning for GPUs and edge accelerators
- Build and maintain training and inference pipelines that support scalable experimentation and evaluation, reproducibility and model versioning, and reliable production deployment
- Collaborate closely with infrastructure and systems engineers to integrate models into real-time perception pipelines, balance accuracy, performance, and cost constraints, and diagnose and resolve production inference issues
- Use real-world deployment feedback and metrics to drive data and model improvements
Required Qualifications
- 8+ years of experience in applied machine learning or AI systems
- Strong hands-on experience with vision, video, or multimodal models
- Proven experience taking models into production, not just research prototypes
- Deep understanding of model optimization (quantization, pruning, performance tuning)
- Proficiency in Python and modern ML frameworks (e.g., PyTorch)
- Experience evaluating models using real-world metrics and constraints
- Ability to operate independently and own complex technical systems end to end
Preferred Qualifications
- Experience with multimodal or vision-language models (CLIP-like, BLIP-like, or custom)
- Experience deploying models to edge or resource-constrained environments
- Familiarity with inference optimization stacks (ONNX, TensorRT, CUDA)
- Experience working on physical-world perception systems (video, sensors, environments)
- Background in safety, security, robotics, or autonomous systems
- Experience mentoring senior engineers or providing technical leadership
What Success Looks Like
- Models ship reliably and improve measurable safety outcomes
- Precision and recall improve while inference cost and latency decrease
- Edge and cloud inference pipelines operate at production scale
- Data and model iteration loops accelerate over time
- AI perception becomes a durable competitive advantage for VOLT AI
At VOLT AI, you will build applied AI systems that run in the real world—on live video, in real environments, under real constraints. This role is for an engineer who wants to ship models, optimize them aggressively, and see their impact in production, not publish papers.
Top Skills
Cuda
Onnx
Python
PyTorch
Tensorrt
Similar Jobs
Blockchain • eCommerce • Fintech • Payments • Software • Financial Services • Cryptocurrency
Lead lifecycle marketing initiatives, design multi-channel strategies, analyze performance, partner cross-functionally, and evolve growth infrastructure while maintaining high quality.
Top Skills:
Ai ToolsBrazeIterableSalesforce Marketing CloudSQLTableau
Blockchain • eCommerce • Fintech • Payments • Software • Financial Services • Cryptocurrency
The role involves providing regulatory counsel for banking products, offering legal advice, managing risk, and ensuring compliance with banking regulations.
Top Skills:
Banking RegulationsCompliance Frameworks
Blockchain • eCommerce • Fintech • Payments • Software • Financial Services • Cryptocurrency
Manage and oversee Block's Global Sanctions Program, ensuring effectiveness and regulatory alignment of sanctions screening systems, while coordinating audits and governance reviews.
Top Skills:
And Compliance ToolsChange Management PlatformsGovernanceRiskSanctions Screening Systems
What you need to know about the San Francisco Tech Scene
San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.
Key Facts About San Francisco Tech
- Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Google, Apple, Salesforce, Meta
- Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
- Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
- Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

