Fiddler AI Logo

Fiddler AI

Staff Data Platform Engineer (Hybrid)

Posted Yesterday
Be an Early Applicant
Hybrid
Palo Alto, CA, USA
190K-300K Annually
Senior level
Hybrid
Palo Alto, CA, USA
190K-300K Annually
Senior level
The Staff Data Platform Engineer will design and build cloud services for AI applications, lead distributed systems implementations, and improve operational metrics, all while mentoring a growing engineering team.
The summary above was generated by AI

Our Purpose

At Fiddler, we understand the implications of AI and the impact that it has on human lives. Our company was born with the mission of building trust into AI. The rise of Generative AI and Agents has unlocked generalized intelligence but also widened the risk aperture and made it harder to ensure that AI applications are working well. Fiddler enables organizations to get ahead of these issues by helping deploy trustworthy, and transparent AI solutions. 

Fiddler partners with AI-first organizations to help build a long-term framework for responsible AI practices, which, in turn, builds trust with their user base. AI Engineers, Data Science, and business teams use Fiddler AI to monitor, evaluate, secure, analyze, and improve their AI solutions to drive better outcomes. Our platform enables engineering teams and business stakeholders alike to understand the "what", “why”, and "how" behind AI outcomes.  

Our Founders

Fiddler AI is founded by Krishna Gade (engineering leader at Facebook, Pinterest, Twitter, and Microsoft) and Amit Paka (product leader at Microsoft, Samsung, Paypal and two-time founder). We are backed by Insight Partners, Lightspeed Venture Partners, and Lux Capital. 

Why Join Us

Our team is motivated to help build trust into AI to enable society harness the power of AI. Joining us means you get to make an impact by ensuring that AI applications at production scale across industries have operational transparency and security.  We are an early-stage startup and have a rapidly growing team of intelligent and empathetic doers, thinkers, creators, builders, and everyone in between. The AI and ML industry has a rapid pace of innovation and the learning opportunities here are monumental. This is your chance to be a trailblazer.  

Fiddler is recognized as a pioneer in the field of AI Observability and has received numerous accolades, including:  2022 a16z Data50 list, 2021 CB Insights AI 100 most promising startups, 2020 WEF Technology Pioneer, 2020 Forbes AI 50 most promising startups of 2020, and a 2019 Gartner Cool Vendor in Enterprise AI Governance and Ethical Response. By joining our brilliant (at least we think so) team, you will help pave the way in the AI Observability space.

👩🏽‍🚀 The Mission:

Our Staff Data Platform Engineers make a real impact on the safety and ROI of large language models and agentic applications across different verticals and domains. You will work on the cutting edge of envisioning and building new types of tools and algorithms to monitor, explain, and improve such applications and in turn empower our customers.

🪐 About The Team:

Our engineering team is a dynamic group of builders and thinkers dedicated to solving some of the most cutting-edge challenges in AI safety and reliability. Working on exciting and an expansive range of topics, from the responsible deployment of machine learning models, large language models (LLMs), to complex agentic applications. Our projects are inherently cross-disciplinary, requiring expertise in systems engineering, product engineering, and data science to build robust, scalable solutions. We thrive in a collaborative environment where continuous learning is at the forefront, ensuring every team member stays on their toes with the latest advancements in AI. Joining our team means you'll have the opportunity to make a tangible impact on how AI evolves for the benefit of humanity.

🚀 What You’ll Do:
  • Design and build core services and components of a world-class cloud platform to help enterprises develop, monitor and improve their full suite of AI based applications (covering predictive models, LLMs, GenAI models and agentic applications)

  • Lead the design and implementation of distributed systems and microservices that compute, persist, and expose new ML + agentic observability metrics (e.g., response relevancy, hallucination scores) from raw trace data

  • Design enterprise-grade, scalable data infrastructure, services and APIs to support enterprise scale workloads and meet compliance needs and SLAs

  • Spearhead the development of new types of metrics and evaluation capabilities to satisfy evolving customer needs. Take part in conversations with customers around discovery and support

  • Define and evolve the operational maturity (reliability, latency, SLOs, observability) of core services, establish best practices and champion improvements to internal CI/CD processes, testing frameworks, error handling, efficiency and resiliency

  • Team & Culture Building: you will take an active role in building a world-class engineering team and actively participate in the talent acquisition process through interviewing, candidate evaluation and coaching

🎯 What We’re Looking For:
  • Masters or Bachelors degree in Computer Science or related field, combined with 7+ years of industry experience, with demonstrated solid foundation in software development.

  • Deep proficiency with Python and a strong command of essential backend technologies like Postgres, Redis, Kafka, RabbitMQ, Ray. This includes the ability to design, build, and debug complex, large-scale systems.

  • Experience with deploying and working with ML/LLM models in production. The candidate should be comfortable with modern LLM frameworks (e.g., Langchain, HuggingFace, vLLM) and evaluation frameworks (e.g., Ragas, MLFlow) to ensure model performance and reliability.

  • Adaptability & Ownership: proven ability to thrive in ambiguity and a fast-paced environment. We need a self-motivated initiator who can take ownership of projects with a high degree of autonomy, confidently filling in the gaps when the full picture isn't available.

  • System Design & Optimization: A strong grasp of distributed systems and the capacity to troubleshoot production issues. A nice to have would be experience with cloud infrastructure (AWS/GCP, Kubernetes) and specialized databases (Clickhouse/Druid), indicating a deeper understanding of system architecture and performance optimization.

  • Technical Leadership & Collaboration: Demonstrated ability to plan, execute, and deliver projects by effectively breaking down complex problems into manageable tasks, and guiding a small team of engineers. Must be adept at cross-functional collaboration across a geographically distributed team, working closely with product managers, designers, frontend developers, and data scientists to ensure alignment and successful project outcomes

  • Coaching & Mentorship: you should be an excellent collaborator and a mentor to other team members, raising the technical bar for the entire team and regularly engage in code and design reviews.

  • Ability to work in our Palo Alto office 3 days a week

🫱🏼‍🫲🏾 Compensation:

$190,000 - $300,000 + equity + benefits

🩺 Benefits & Perks
  • Unlimited PTO

  • Competitive pay + equity

  • Premium health, dental & vision (100% premiums covered for employee)

  • 401(k) plan

  • Monthly fitness reimbursement

  • Paid parental leave

Palo Alto HQ Vibes

  • Free annual Caltrain pass

  • Monthly in-office massages

  • Fastrak reimbursement

  • Free lunch Mon–Thurs

The posted range represents the expected salary range for this job requisition and does not include any other potential components of the compensation package and perks previously outlined. Ultimately, in determining pay, we'll consider your experience, leveling, location, and other job-related factors.

Fiddler is proud to be an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. If you require special accommodations in order to complete the interviews or perform job duties, please inform the recruiter at the beginning of the process.

Beware of job scam fraud. Our recruiters use @fiddler.ai email addresses exclusively. In the US, we do not conduct interviews via text or instant message, or ask for sensitive personal information such as bank account or social security numbers.

Top Skills

AWS
Clickhouse
Druid
GCP
Huggingface
Kafka
Kubernetes
Langchain
Mlflow
Postgres
Python
RabbitMQ
Ragas
Ray
Redis
Vllm
HQ

Fiddler AI Palo Alto, California, USA Office

291 Lambert Street, Palo Alto, CA, United States, 94306

Similar Jobs

3 Hours Ago
In-Office
212K-311K Annually
Senior level
212K-311K Annually
Senior level
Aerospace • Information Technology • Software • Cybersecurity • Design • Defense • Manufacturing
The CostPoint System Administrator supports and maintains the Deltek Costpoint ERP system, collaborates with finance and IT, and leads ERP projects while ensuring compliance and security.
Top Skills: CognosDeltek CostpointSQL
3 Hours Ago
In-Office
119K-175K Annually
Senior level
119K-175K Annually
Senior level
Aerospace • Information Technology • Software • Cybersecurity • Design • Defense • Manufacturing
Responsible for creating a secure environment for classified programs, enforcing security policies, conducting training, and ensuring compliance with federal regulations.
Top Skills: Classified SystemsDefense Information System For Security (Diss)Dodm 5205.07 Sap Security ManualIcd-503Icd-704Icd-705Risk Management Framework (Rmf)
3 Hours Ago
In-Office
92K-199K Annually
Senior level
92K-199K Annually
Senior level
Aerospace • Information Technology • Software • Cybersecurity • Design • Defense • Manufacturing
As a Sr. Test Engineer, you will develop manufacturing processes for testing and launching satellites, implement test procedures, and ensure product reliability and safety.
Top Skills: AccelerometersDaqsPxiThermal ChambersThermocouplesTvac ChambersVibration Shakers

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account