Dropzone AI Logo

Dropzone AI

AI Research Engineer

Posted Yesterday
Remote
Hiring Remotely in United States
200K-250K Annually
Senior level
Remote
Hiring Remotely in United States
200K-250K Annually
Senior level
The AI Research Engineer will design next-generation AI systems, focusing on agent architecture, memory engineering, and performance benchmarking, while translating research to scalable systems.
The summary above was generated by AI

About Dropzone AI


Dropzone’s mission is to scale cybersecurity beyond human limits, and augment every single human security engineer/analyst with an army of AI security specialists. Humans alone cannot sufficiently protect our digital future, and AI augmentation is the only way for defenders to reclaim the high ground. We are an award winning company disrupting the $200B+ cybersecurity market. 
Powered by Gen AI advancements, our technology offloads repetitive day-to-day work and frees human analysts to focus on real threats and higher-value projects. We are venture-backed, and our team has a rare blend of deep experience across cybersecurity, AI/ML, and SaaS product development. Join us if you want to be on the ground floor of using Gen AI to transform cyber defense. Learn more at www.dropzone.ai.

About the role

We are seeking a Senior to Principal-level AI Research Engineer to lead the design and development of next-generation agentic AI systems. This role sits at the intersection of research and production, with a strong emphasis on:

  • Agent architecture design
  • Harness and memory engineering
  • Robust evaluation and benchmarking of model and agent performance

You will work closely with product and engineering teams to translate cutting-edge research into scalable, real-world systems. 

In this role, you will directly shape the core intelligence layer of Dropzone AI. Your work will define how our agents reason, remember, and improve over time, influencing both our product capabilities and the broader direction of applied AI systems.


What we're looking for

  • Someone who thinks in context/harness engineering, not just models
  • A learner who can follow latest research and test them in real-world deployment
  • Deep curiosity about how to convert non deterministic outputs from LLMs to consistent reliable outcomes and replicate expert human intuitions 
  • Strong ownership mindset and ability to drive ambiguous problems to clarity

What you'll do

Agentic Architecture
  • Design and implement advanced multi-step reasoning agents (tool use, planning, reflection, self-improvement loops)
  • Develop frameworks for multi-agent coordination and task decomposition
  • Improve reliability, latency, and cost efficiency of agent execution
Memory Systems
  • Architect short-term and long-term memory subsystems (episodic, semantic, retrieval-based, hybrid)
  • Build mechanisms for context compression, retrieval, and grounding
  • Explore novel approaches to continual learning and state persistence
Evaluation & Reliability
  • Define and implement evaluation frameworks for agent performance (task success, reasoning quality, robustness)
  • Build automated eval pipelines (synthetic data, adversarial testing, regression testing)
  • Establish metrics and benchmarks for agent reliability in production
Research → Production
  • Translate latest community research ideas into production-grade systems
  • Run experiments, analyze results, and iterate quickly
  • Contribute to internal knowledge sharing and technical direction

Requirements

  • 5+ years in software engineering, with at least 1+ year applying GenAI in production
  • Proven experience building or researching:
    • Agent frameworks / tool-using LLMs
    • Memory / retrieval systems (RAG, vector DBs, hybrid retrieval)
  • Expert Python developer
  • Familiar with openclaw and Claude Code harness architecture
  • Early-stage startup mindset. You thrive on ambiguity and move with lightspeed execution
Preferred
  • Experience with agent orchestration frameworks (LangGraph, AutoGen, custom systems)
  • Familiarity with AI safety guardrails, hallucination mitigation, and structured output enforcement
  • Experience designing LLM evals (offline + online, human-in-the-loop, synthetic data)
  • Publications or open-source contributions in relevant areas
  • Experience applying latest context/harness engineering techniques to customer facing products
  • Founder or early-stage (first 10 engineers) or experience in standing up a new technology bet within a more established company

Work Environment/Travel

We are a 100% remote company where you will work from your home with company-provided equipment to set you up for success. Semi-frequent travel to professional office settings and other events locally and nationally; some overnight travel expected.

Compensation

In the spirit of pay transparency, we are excited to share the base salary range below, exclusive of fringe benefits or potential bonuses. If you are hired at Dropzone your final base salary compensation will be determined based on factors such as geographic location, skills, education, and/or experience. In addition to those factors, we believe in the importance of pay equity and consider internal equity of our current team members as a part of any final offer. Please keep in mind that hiring at the maximum of the range would not be typical to allow for future and continued salary growth. We also offer a generous benefits package, including company paid health insurance, 401K Plan with employer match, Self-Managed PTO, parental leave, and more.


Similar Jobs

3 Days Ago
Remote
United States
230K-275K Annually
Senior level
230K-275K Annually
Senior level
Software
This role involves designing, training, and evaluating ML models, collaborating on AI initiatives, and optimizing model performance for practical applications.
Top Skills: KubeflowMlflowPythonPyTorchTensorFlowWeights & Biases
7 Days Ago
Remote or Hybrid
6 Locations
Mid level
Mid level
Artificial Intelligence • Software • Biotech • Pharmaceutical
The AI Research Engineer will design, implement, and improve AI systems for therapy, enhance evaluation methods, and collaborate with various teams to innovate and ensure safety in AI therapy.
Top Skills: Ai SystemsDashboardsData PipelinesLarge Language ModelsMachine LearningRegression Testing
16 Days Ago
In-Office or Remote
100K-500K Annually
Expert/Leader
100K-500K Annually
Expert/Leader
Blockchain • Software • Analytics • Financial Services • Cryptocurrency
The AI Research Engineer will develop innovative architectures for AI models, enhance model intelligence, and conduct large-scale pre-training using distributed servers and NVIDIA GPUs, while advancing AI performance through novel techniques.
Top Skills: Hugging FaceLlm ArchitecturesNvidia GpusPre-Training OptimizationPyTorch

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account