Grafana Labs Logo

Grafana Labs

Senior AI Engineer - Grafana Ops, AI/ML | Canada | Remote

Reposted 21 Days Ago
Remote
Hiring Remotely in Canada
164K-197K Annually
Senior level
Remote
Hiring Remotely in Canada
164K-197K Annually
Senior level
The role involves developing high-performance AI features for observability, rapid prototyping, and cross-functional collaboration, aimed at improving incident response and system behavior understanding.
The summary above was generated by AI

Grafana Labs, the company behind the open observability cloud, is founded on the principles of open source, open standards, open ecosystems, and open culture. Grafana Cloud, our fully managed observability platform, is flexible and built for scale. With Grafana Cloud's actually useful AI, organizations can see, understand, and act on all their disparate data to move at the speed of their ambitions. Today, more than 35 million users and 7,000+ customers – including Anthropic, Bloomberg, NVIDIA, Microsoft, and Salesforce – trust Grafana Labs to ensure reliability of their applications and systems, resolve incidents quickly, and optimize their telemetry to reduce noise and cost. We are a 100% remote company with 1,600+ team members across 40+ countries, and we’re backed by leading investors including Lightspeed Venture Partners, Sequoia Capital, GIC, Coatue, J.P. Morgan, CapitalG, and Lead Edge Capital. Learn more at grafana.com and follow us on LinkedIn and X.

We’re scaling fast and staying true to what makes us different: an open-source legacy, a global collaborative culture, and a passion for meaningful work. Our team thrives in an innovation-driven environment where transparency, autonomy, and trust fuel everything we do.

You may not meet every requirement, and that’s okay. If this role excites you, we’d love you to raise your hand for what could be a truly career-defining opportunity.

This is a remote opportunity and we would be interested in applicants from Canada time zones only at this time.

Senior AI Engineer 

The Opportunity: 

At Grafana, we build observability tools that help users understand, respond to, and improve their systems – regardless of scale, complexity, or tech stack. The Grafana AI teams play a key role in this mission by helping users make sense of complex observability data through AI-driven features. These capabilities reduce toil, lower the barrier of domain expertise, and surface meaningful signals from noisy environments. 

What makes our team different is how we work: we operate with a high degree of autonomy and ownership, both as individuals and as a team. Engineers are empowered to make decisions, move quickly, and validate ideas early – while being supported by a deeply collaborative culture that values curiosity, feedback, and cross-functional partnership.

We’re looking for an AI Software Engineer with a strong software engineering background, a quick iteration mindset, and a passion for experimentation – balanced by a focus on shipping and scaling impactful features that deliver value to users. You’ll work closely with cross-functional teams to develop, test, and ship AI-powered features that contribute to improving infrastructure and observability quality through automation, while also expanding the capabilities of AI agents across the observability stack to assist users with incident response. As the team matures, there’s a broad opportunity to expand or redefine this role based on impact and initiative.

What You’ll Be Doing:

  • Build and deliver AI solutions: Take ownership of developing high-performance AI features to help users detect, triage, and resolve incidents using observability data and tools. 
  • Rapid experimentation and iteration: Implement a highly iterative process where you quickly prototype, test, and validate with real users, including shipping and evolving LLM- or agent-powered workflows for incident lifecycle management and automated analysis tasks.
  • Collaborate cross-functionally: Work with data analysts, product managers, and designers to shape AI-driven product features, including integration of agentic components with internal tools, alerting systems, runbooks, and developer workflows. 
  • Utilize AI tools effectively: Use AI and automation tools to enhance both product functionality and your own development workflows. 
  • Effective communication: You’ll be working in a highly dynamic and collaborative environment, so we need someone who can communicate effectively and contribute across teams.
  • Ownership and impact: Take full ownership of the AI solutions you develop, ensuring they are not only innovative but also scalable, maintainable, and aligned with real user workflows. 

We invest heavily in developer productivity. You can use modern AI coding assistants as part of your daily workflow (your choice of tools, within security guidelines), backed by a company-funded usage budget so you can iterate quickly without unnecessary friction. We encourage pragmatic AI-assisted development: faster prototyping, test generation, refactors, documentation, and incident follow-ups—always paired with strong code review and quality standards. You’ll also have access to frontier models (e.g., GPT-Codex 5/3, Claude Opus 4.6, Gemini 3 Pro).

What Makes You a Great Fit:

  • Strong engineering skills: Solid experience building production software systems (backend and / or full stack). You’re a self-starter, capable of tackling complex engineering problems with minimal supervision.
  • AI experience with a practical mindset: You’re familiar with AI technologies and frameworks, and you focus on delivering high-quality solutions that work in the real world, not just in theory. 
  • Quick iteration and experimentation: You’re comfortable releasing prototypes, collecting feedback, and iterating with a pragmatic mindset.
  • Proven initiative: You take ownership and drive projects forward, pushing boundaries to find the most impactful solutions. You can deal with ambiguity and are able to define scope where things are loosely defined. 
  • Collaborative attitude: You communicate effectively with peers, product managers, and designers. You’re open to feedback, and you bring a solutions-oriented mindset to the table.

Requirements: 

  • Experience with LLMs, prompt engineering, and building applications powered by GenAI.
  • Proven track record of delivering software that made it into production and is actively used by users. 
  • Exposure to working in cloud-native environments (e.g., AWS, GCP, Azure).
  • Experience using observability tools to understand and troubleshoot system behavior.

Bonus Points For:

  • Experience building or working with agent frameworks or multi‑agent workflows.
  • Experience with infrastructure / devops related tooling: Kubernetes, Docker, Terraform or similar for deployments.
  • Familiarity with model fine-tuning techniques.
  • Experience building observability tooling.

Compensation & Rewards:

In Canada, the Base compensation range for this role is CAD 129,392  -  CAD 217,128. Actual compensation may vary based on level, experience, and skillset as assessed throughout the interview process. All of our roles include Restricted Stock Units (RSUs), giving every team member ownership in Grafana Labs' success. We believe in shared outcomes—RSUs help us stay aligned and invested as we scale globally.

All of our roles include Restricted Stock Units (RSUs), giving every team member ownership in Grafana Labs' success. We believe in shared outcomes—RSUs help us stay aligned and invested as we scale globally.


*Compensation ranges are country specific. If you are applying for this role from a different location than listed above, your recruiter will discuss your specific market’s defined pay range & benefits at the beginning of the process.

Why You’ll Thrive at Grafana Labs:

  • 100% Remote, Global Culture - As a remote-only company, we bring together talent from around the world, united by a culture of collaboration and shared purpose.
  • Scaling Organization – Tackle meaningful work in a high-growth, ever-evolving environment.
  • Transparent Communication – Expect open decision-making and regular company-wide updates.
  • Innovation-Driven – Autonomy and support to ship great work and try new things.
  • Open Source Roots – Built on community-driven values that shape how we work.
  • Empowered Teams – High trust, low ego culture that values outcomes over optics.
  • Career Growth Pathways – Defined opportunities to grow and develop your career.
  • Approachable Leadership – Transparent execs who are involved, visible, and human.
  • Passionate People – Join a team of smart, supportive folks who care deeply about what they do.
  • In-Person onboarding - We want you to thrive from day 1 with your fellow new ‘Grafanistas’ to learn all about what we do and how we do it. 
  • Balance is Key - We operate a global annual leave policy of 30 days per annum. 3 days of your annual leave entitlement are reserved for Grafana Shutdown Days to allow the team to really disconnect. *We will comply with local legislation where applicable.

Equal Opportunity Employer: Grafana Labs is an equal opportunities employer. We welcome applications from everyone regardless of race, colour, nationality, origin, caste, sex, gender reassignment identity or expression, sexual orientation, age, religion or belief, disability, veteran status, genetic information, pregnancy, maternity, marital, family or carer status, or any other characteristic which is protected by local law. We believe that equality and diversity build a strong organisation, and we work hard to ensure that is the foundation of our organisation as we grow.

Grafana Labs may utilize AI tools in its recruitment process to assist in matching information provided in CVs to job postings. The recruitment team will continue to review inbound CVs manually to identify alignment with current openings.

#LI-Remote

For information about how your personal data is used once you’ve applied to a job, check out our privacy policy. 
 

Similar Jobs

2 Hours Ago
Remote or Hybrid
US
100K-130K Annually
Senior level
100K-130K Annually
Senior level
Cloud • Insurance • Payments • Software • Business Intelligence • App development • Big Data Analytics
Lead UX research and design for B2B/SaaS insurance products: create wireframes, mockups, and prototypes; run user research and usability tests; use analytics to measure outcomes; collaborate with product and engineering to implement consistent, validated UX solutions.
Top Skills: BalsamiqFigmaMiroWhiteboards
2 Hours Ago
Remote or Hybrid
US
100K-160K Annually
Senior level
100K-160K Annually
Senior level
Cloud • Insurance • Payments • Software • Business Intelligence • App development • Big Data Analytics
Design, build, and operate enterprise-scale multi-cloud infrastructure (Azure primary, GCP, AWS exposure). Own landing zones, Terraform modules, production AKS/GKE Kubernetes, Vault secrets, hybrid networking, CI/CD pipelines, monitoring, DR, and automation (Ansible, Python/Bash). Mentor engineers, document runbooks, and collaborate with security, application teams, and leadership to ensure secure, reliable, cost-optimized cloud platforms.
Top Skills: AksAnsibleApp GatewayArtifact RegistryAWSAwxAzureAzure DevopsAzure MonitorAzure StorageBashBgpBigQueryCloud BuildCloud LoggingCloud RunCloud SqlCloudboltDatadogDnsEc2EksGitlab CiGkeGoogle Cloud MonitoringGoogle Cloud Platform (Gcp)Hashicorp VaultHelmIamJenkinsKubernetesLoad BalancingManaged IdentityNsgPowershellPrivate EndpointsPythonS3SignozTerraformVertex AiVpcVpc Service ControlsVpnWorkload Identity
3 Hours Ago
Remote or Hybrid
5 Locations
124K-280K Annually
Senior level
124K-280K Annually
Senior level
Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Lead finance transformation engagements using Oracle Cloud ERP and EPM. Design and implement Oracle Financials and Hyperion solutions, integrate RPA/ML/analytics, ensure compliance, manage stakeholder relationships, coach teams, and drive strategic outcomes on large, cross-border projects.
Top Skills: Ahcs/FahAnalyticsFixed Assets (Fa)Hyperion Financial ManagementMachine LearningOracle ApOracle ArOracle Cloud ErpOracle CmOracle EpmOracle ExpensesOracle FinancialsOracle GlOracle Ppm (Grants)Project BillingProject CostingRpa

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account