Dropzone AI

AI Research Engineer

Posted Yesterday

Remote

Hiring Remotely in United States

200K-250K Annually

Senior level

Remote

Hiring Remotely in United States

200K-250K Annually

Senior level

The AI Research Engineer will design next-generation AI systems, focusing on agent architecture, memory engineering, and performance benchmarking, while translating research to scalable systems.

The summary above was generated by AI

About Dropzone AI

Dropzone’s mission is to scale cybersecurity beyond human limits, and augment every single human security engineer/analyst with an army of AI security specialists. Humans alone cannot sufficiently protect our digital future, and AI augmentation is the only way for defenders to reclaim the high ground. We are an award winning company disrupting the $200B+ cybersecurity market.
Powered by Gen AI advancements, our technology offloads repetitive day-to-day work and frees human analysts to focus on real threats and higher-value projects. We are venture-backed, and our team has a rare blend of deep experience across cybersecurity, AI/ML, and SaaS product development. Join us if you want to be on the ground floor of using Gen AI to transform cyber defense. Learn more at www.dropzone.ai.

About the role

We are seeking a Senior to Principal-level AI Research Engineer to lead the design and development of next-generation agentic AI systems. This role sits at the intersection of research and production, with a strong emphasis on:

Agent architecture design
Harness and memory engineering
Robust evaluation and benchmarking of model and agent performance

You will work closely with product and engineering teams to translate cutting-edge research into scalable, real-world systems.

In this role, you will directly shape the core intelligence layer of Dropzone AI. Your work will define how our agents reason, remember, and improve over time, influencing both our product capabilities and the broader direction of applied AI systems.

What we're looking for

Someone who thinks in context/harness engineering, not just models
A learner who can follow latest research and test them in real-world deployment
Deep curiosity about how to convert non deterministic outputs from LLMs to consistent reliable outcomes and replicate expert human intuitions
Strong ownership mindset and ability to drive ambiguous problems to clarity

What you'll do

Agentic Architecture

Design and implement advanced multi-step reasoning agents (tool use, planning, reflection, self-improvement loops)
Develop frameworks for multi-agent coordination and task decomposition
Improve reliability, latency, and cost efficiency of agent execution

Memory Systems

Architect short-term and long-term memory subsystems (episodic, semantic, retrieval-based, hybrid)
Build mechanisms for context compression, retrieval, and grounding
Explore novel approaches to continual learning and state persistence

Evaluation & Reliability

Define and implement evaluation frameworks for agent performance (task success, reasoning quality, robustness)
Build automated eval pipelines (synthetic data, adversarial testing, regression testing)
Establish metrics and benchmarks for agent reliability in production

Research → Production

Translate latest community research ideas into production-grade systems
Run experiments, analyze results, and iterate quickly
Contribute to internal knowledge sharing and technical direction

Requirements

5+ years in software engineering, with at least 1+ year applying GenAI in production
Proven experience building or researching:
- Agent frameworks / tool-using LLMs
- Memory / retrieval systems (RAG, vector DBs, hybrid retrieval)
Expert Python developer
Familiar with openclaw and Claude Code harness architecture
Early-stage startup mindset. You thrive on ambiguity and move with lightspeed execution

Preferred

Experience with agent orchestration frameworks (LangGraph, AutoGen, custom systems)
Familiarity with AI safety guardrails, hallucination mitigation, and structured output enforcement
Experience designing LLM evals (offline + online, human-in-the-loop, synthetic data)
Publications or open-source contributions in relevant areas
Experience applying latest context/harness engineering techniques to customer facing products
Founder or early-stage (first 10 engineers) or experience in standing up a new technology bet within a more established company

Work Environment/Travel

We are a 100% remote company where you will work from your home with company-provided equipment to set you up for success. Semi-frequent travel to professional office settings and other events locally and nationally; some overnight travel expected.

Compensation

In the spirit of pay transparency, we are excited to share the base salary range below, exclusive of fringe benefits or potential bonuses. If you are hired at Dropzone your final base salary compensation will be determined based on factors such as geographic location, skills, education, and/or experience. In addition to those factors, we believe in the importance of pay equity and consider internal equity of our current team members as a part of any final offer. Please keep in mind that hiring at the maximum of the range would not be typical to allow for future and continued salary growth. We also offer a generous benefits package, including company paid health insurance, 401K Plan with employer match, Self-Managed PTO, parental leave, and more.

Similar Jobs

Cribl

Machine Learning Engineer

3 Days Ago

Remote

United States

230K-275K Annually

Senior level

230K-275K Annually

Senior level

Software

This role involves designing, training, and evaluating ML models, collaborating on AI initiatives, and optimizing model performance for practical applications.

Top Skills: KubeflowMlflowPythonPyTorchTensorFlowWeights & Biases

Pathos AI

AI Research Engineer

7 Days Ago

Remote or Hybrid

Mid level

Artificial Intelligence • Software • Biotech • Pharmaceutical

The AI Research Engineer will design, implement, and improve AI systems for therapy, enhance evaluation methods, and collaborate with various teams to innovate and ensure safety in AI therapy.

Top Skills: Ai SystemsDashboardsData PipelinesLarge Language ModelsMachine LearningRegression Testing

Tether.io

AI Research Engineer - Pre training

16 Days Ago

In-Office or Remote

100K-500K Annually

Expert/Leader

100K-500K Annually

Expert/Leader

Blockchain • Software • Analytics • Financial Services • Cryptocurrency

The AI Research Engineer will develop innovative architectures for AI models, enhance model intelligence, and conduct large-scale pre-training using distributed servers and NVIDIA GPUs, while advancing AI performance through novel techniques.

Top Skills: Hugging FaceLlm ArchitecturesNvidia GpusPre-Training OptimizationPyTorch

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
Major Tech Employers: Google, Apple, Salesforce, Meta
Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine