Get the job you really want.
Maximum of 25 job preferences reached.
Top Infrastructure Engineer Jobs in San Francisco, CA
Information Technology • Software
The AI Infrastructure Specialist leads GPU infrastructure deployments, optimizes bare metal configurations, validates Kubernetes setups, and documents processes for customers to ensure self-sufficiency and scale operations effectively.
Top Skills:
BashCephCudaGoKubernetesLonghornNvidia Gpu OperatorsPythonRookWeka
Machine Learning • Software
The Infrastructure Engineer will develop DevOps tools and deployment strategies for Metaflow while shaping future innovations in data science and ML.
Top Skills:
CloudFormationKubernetesPulumiTerraform
Software
We are seeking a Senior Backend Infrastructure Engineer to design, build, and scale systems. You will work with backend services, APIs, and cloud infrastructure, focusing on secure and scalable applications. Responsibilities include optimizing performance, collaborating with cross-functional teams, and mentoring junior engineers.
Top Skills:
AWSAws CdkCi/CdCloudFormationDockerLinuxNode.jsPostgresTypescript
Artificial Intelligence • Information Technology • Software
Build and optimize infrastructure for private, personal AI models, ensuring performance and privacy while deploying at scale.
Top Skills:
Amd Sev-SnpConfidential ComputingGpusIntel TdxLoraxMachine LearningNvidia Confidential ComputingPunicaS-LoraSecure EnclavesTeesTransformersVllm
Artificial Intelligence • Cloud • Hardware • Software
The Senior / Staff Infrastructure Engineer will design and manage infrastructure for AI platforms, ensuring security, reliability, and scalability in cloud deployments. Responsibilities include deploying hybrid solutions and implementing infrastructure-as-code.
Top Skills:
AWSAzureCi/CdDockerGCPHelmKubernetesPulumiTerraform
Artificial Intelligence • Cloud • Machine Learning • Software • Database • App development • Generative AI
As a Senior Infrastructure Engineer at Replit, you will improve system reliability, automate infrastructure, optimize performance, and collaborate across teams to enhance developer experience.
Top Skills:
DatadogDockerGCPGoGrafanaKubernetesPrometheusPulumiPythonTerraform
Artificial Intelligence • Cloud • Machine Learning • Software • Database • App development • Generative AI
As a Staff Infrastructure Engineer, you'll ensure the reliability and scalability of Replit's infrastructure, automate processes, improve system performance, and mentor engineering teams on best practices.
Top Skills:
DatadogDockerGCPGoGrafanaKubernetesPrometheusPythonTerraform
Artificial Intelligence • Big Data • Computer Vision • Machine Learning
The Data Infrastructure Engineer will build backend architecture, manage data pipelines, and ensure high-performance compute clusters to support machine learning workloads.
Top Skills:
AWSAzureC++ClickhouseETLFfmpegGCPGoGstreamerOpencvPythonRustTimescaledb
Reposted 9 Days AgoSaved
Fintech • Machine Learning • Payments • Software • Financial Services
Lead AI Engineer responsible for developing AI-powered products and deploying scalable AI solutions using technologies like LLM and machine learning algorithms. Collaborate with cross-functional teams to optimize performance and support AI systems.
Top Skills:
AWSAzureGoGCPHuggingfaceJavaNemo GuardrailsPythonPyTorchScalaVectordbs
Artificial Intelligence • Software • Database • Analytics
The Cloud Infrastructure Engineer will develop and maintain infrastructure using Terraform and Kubernetes, support multi-cloud deployments, and improve CI/CD processes.
Top Skills:
AWSAzureGCPGoKubernetesPythonTerraformTypescript
Reposted 10 Days AgoSaved
Easy Apply
Easy Apply
Fintech • HR Tech
In this role, you'll architect and manage distributed database systems, oversee complex migrations, optimize performance, and mentor engineers, driving operational excellence.
Top Skills:
AuroraAWSKafkaKubernetesMySQLPostgresRdsRedisS3Tidb
Artificial Intelligence • Information Technology • Automation
As a Machine Learning Infrastructure Engineer, you'll design scalable infrastructure for AI research, manage deployments, and ensure reliable customer solutions, working directly with the CEO.
Top Skills:
Ci/CdGpu ComputeKubernetesMl InfrastructureRaySlurm
New
Track Smarter, Apply Better.
Ditch the spreadsheets. Organize your job search with our freeApplication Tracker.
Use For Free
Artificial Intelligence • Software • Automation
Design and build a cloud execution environment for AI agents, developing distributed systems, managing infrastructure, and ensuring observability.
Top Skills:
GoGCPKubernetesNomadRustTerraformTypescript
Information Technology • Automation
The SRE/Infrastructure Engineer will architect and manage secure, scalable systems for automated penetration testing, optimizing reliability, and enhancing infrastructure based on customer demand. Responsibilities include maintaining production environments, leading technical discussions, and promoting high coding standards.
Top Skills:
AWSAzureCloudFormationElkGCPNew RelicOpentelemetryPostgresPrometheusTerraform
News + Entertainment
The role involves designing scalable distributed systems using Scala, enhancing feature stores for performance, and collaborating with ML engineers to resolve challenges.
Top Skills:
AWSCassandraDockerJavaKafkaKubernetesNoSQLPostgresScalaSQL
Artificial Intelligence • Software
As a Platform Deployment Architect, you'll manage customer deployments of Sierra's AI platform, ensuring compliance and operational efficiency while collaborating across teams and establishing scalable processes.
Top Skills:
AWSCloud NetworkingContainer OrchestrationDatadogGrafanaPrometheusTerraform
Artificial Intelligence • HR Tech • Other • Software • Business Intelligence
The role involves designing and maintaining cloud infrastructure and DevOps tools at Compa, focusing on reliability, automation, and scaling of systems. Responsibilities include leading initiatives on multi-cloud support, CI/CD pipeline improvements, and incident response.
Top Skills:
Ci/CdCloud InfrastructureDevOpsInfrastructure As CodeKubernetesMl OpsMulti-Cloud EnvironmentsPython
Reposted 6 Days AgoSaved
Artificial Intelligence • Information Technology
Design, build, and operate research infrastructure to enhance research speed and efficiency. Collaborate with researchers to identify needs and streamline processes.
Top Skills:
JaxPythonPyTorchRayRustSpark
Software
Design and build scalable infrastructure for an AI SaaS platform, focusing on multi-tenant architectures, CI/CD pipelines, and cloud optimization.
Top Skills:
AnsibleAWSAzureGCPGoKubernetesPythonTerraformTypescript
Software • Analytics • Business Intelligence
The Lead Infrastructure Engineer at HOAi is responsible for designing and maintaining cloud architecture, optimizing AI workloads, and ensuring system reliability and performance. The role involves building CI/CD pipelines, managing infrastructure for AI products, and collaborating with engineering teams to enable fast feature deployment, while ensuring security and compliance.
Top Skills:
Apm ToolsCloud PlatformsGpu ManagementMl PipelinesModel Serving FrameworksPostgresRedisVector Databases
Artificial Intelligence • Information Technology
Responsible for building and scaling the GPU Cloud Marketplace, transforming GPUs from suppliers into a programmable, orchestrated pool for AI developers and researchers.
Top Skills:
BmcCi/CdCloud-InitCudaGpuInfinibandIpmiPulumiPxe BootRedfishTerraform
Artificial Intelligence • Marketing Tech • Software • Automation
The role involves building and optimizing cloud infrastructure and platform features, leveraging Golang, Kafka, and PostgreSQL in a fast-paced startup environment.
Top Skills:
GCPGoKafkaPostgres
Professional Services
The IT Infrastructure Engineer designs, deploys, and manages infrastructure and security components while ensuring system reliability and compliance with security principles.
Top Skills:
Active DirectoryAzureDhcpDnsFirewallsMicrosoft Defender SuiteMicrosoft Entra IdMicrosoft SentinelPowershellTcp/IpVpnsWindows Server
Artificial Intelligence • Information Technology • Software • Automation
Design and maintain cloud infrastructure with security for a B2B SaaS platform. Collaborate on security measures, compliance, and mentor junior engineers.
Top Skills:
AWSAzureGCPKubernetesTerraform
Financial Services
As an AI QA & Infrastructure Engineer, you'll design and execute test plans, maintain testing environments, track quality metrics, and improve QA processes for Aidaly's products.
Top Skills:
CypressPlaywrightSelenium
Top San Francisco Companies Hiring Infrastructure Engineers
See AllPopular Job Searches
All Filters
Total selected ()
No Results
No Results











.png)




.png)
















