Top Tech Jobs & Startup Jobs in San Francisco Bay Area, CA

5 Days AgoSaved
In-Office
San Francisco, CA, USA
100K-270K Annually
Mid level
100K-270K Annually
Mid level
Artificial Intelligence • Machine Learning • Security • Software
Design, run, and automate pre-deployment evaluations of frontier AI models. Analyze large model transcripts to surface behaviors, build scalable evals, collaborate with AI labs, and improve evaluation pipelines and tooling.
Top Skills: InspectLarge Language Models (Llms)Python
5 Days AgoSaved
In-Office
San Francisco, CA, USA
100K-270K Annually
Senior level
100K-270K Annually
Senior level
Artificial Intelligence • Machine Learning • Security • Software
Build and maintain backend tools for AGI safety research: eval libraries, orchestration for parallel agentic evaluations, LLM proxy and telemetry, CI optimizations, data warehousing, and researcher-facing tooling. Lead feature development, collaborate with researchers, and promote good software design and reliability.
Top Skills: AWSCi/CdInspectLlmsPython
5 Days AgoSaved
In-Office
San Francisco, CA, USA
100K-180K Annually
Mid level
100K-180K Annually
Mid level
Artificial Intelligence • Machine Learning • Security • Software
Build and maintain full-stack features for an AI-agent safety product: responsive React front-ends, Python backends, REST APIs, scalable data pipelines, monitoring/observability, and secure integrations. Collaborate with researchers, participate in design and code reviews, and deliver production-ready tooling and visualizations for AI safety monitoring.
Top Skills: AWSDjangoDockerFastapiFlaskGCPJavaScriptPythonReactRestful ApisSolidSQLTypescript
5 Days AgoSaved
In-Office
San Francisco, CA, USA
215K-265K Annually
Senior level
215K-265K Annually
Senior level
Artificial Intelligence • Machine Learning • Security • Software
Build and evolve Apollo's cloud platform: define vision, implement IaC and networking, ensure observability and cost control, design security controls for sensitive AI work, operate services, and create safe infrastructure for agent tooling and multi-cloud GPU orchestration.
Top Skills: AWSAzureDockerGCPGoPulumiPythonRustTerraform
5 Days AgoSaved
In-Office
San Francisco, CA, USA
Senior level
Senior level
Artificial Intelligence • Machine Learning • Security • Software
Conduct independent, governance-focused research on AI scheming and loss-of-control risks; produce reports, papers, legal analyses; collaborate with technical teams, policymakers, and external researchers; mentor junior researchers and shape team strategy.
5 Days AgoSaved
In-Office
San Francisco, CA, USA
100K-245K Annually
Mid level
100K-245K Annually
Mid level
Artificial Intelligence • Machine Learning • Security • Software
Build scalable backend systems to ingest, process, and store large volumes of AI agent logs in real time. Design APIs, authentication, rate limiting, webhooks, and SIEM integrations. Implement data pipelines, storage for structured and unstructured data, caching, retention policies, monitoring, observability, and reliability strategies. Collaborate with researchers and frontend engineers to translate prototypes into production-ready, secure, low-latency monitoring products.
Top Skills: Asynchronous ProcessingAWSAzureCloudFormationDistributed SystemsDjangoDockerFastapiFlaskGoGCPKafkaKubernetesMessage QueuesNoSQLPythonRedis StreamsRestful ApisSIEMSQLTerraform
5 Days AgoSaved
In-Office
San Francisco, CA, USA
100K-270K Annually
Senior level
100K-270K Annually
Senior level
Artificial Intelligence • Machine Learning • Security • Software
Build full-stack tools for AGI safety research: evaluation IDE, LLM-powered search, comparison views, streaming results, collaborative log editing, pipelines, agents, and telemetry. Collaborate with researchers and product to deliver robust, extensible software.
Top Skills: Inspect FrameworkLlm AgentsLlm EvaluationsPythonReactTelemetry/Instrumentation
5 Days AgoSaved
In-Office
San Francisco, CA, USA
100K-270K Annually
Mid level
100K-270K Annually
Mid level
Artificial Intelligence • Machine Learning • Security • Software
Conduct empirical research into AI "scheming": design and run RL experiments on LLMs, build model organisms, develop scalable evaluation techniques, study scaling laws and AI cognition, and collaborate with partner labs to influence deployment and mitigation strategies.
Top Skills: GpusLarge Language Models (Llms)Lm Evaluation Tools (Inspect)PythonReinforcement Learning
New

Cut your apply time in half.

Use ourAI Assistantto automatically fill your job applications.

Use For Free
Application Tracker Preview
All Filters
JobType
New Jobs
Job Category
Experience
Industry
Company Name
Company Size

Sign up now Access later

Create Free Account