PlayStation Logo

PlayStation

Software Development Engineer in Test, ML/AI

Posted 2 Days Ago
In-Office
San Mateo, CA, USA
141K-211K Annually
Mid level
In-Office
San Mateo, CA, USA
141K-211K Annually
Mid level
Lead QA strategy and automation for ML/AI-powered services. Develop LLM-assisted test generation, build scalable automation frameworks, validate ML/LLM outputs, debug model and service issues, and integrate testing into CI/CD for distributed systems.
The summary above was generated by AI

Why Sony Interactive Entertainment?

Sony Interactive Entertainment isn’t just the Best Place to Play — it’s also the Best Place to Work. Sony Interactive Entertainment (SIE) is the company behind the PlayStation brand. As a subsidiary of Sony Group Corporation, we’re part of a proud legacy of innovation and excellence. SIE is a dynamic technology company, delivering cutting-edge hardware and network services to more than 100 million people and an entertainment leader, home to some of the most beloved and recognizable intellectual properties (IP) in the world. Our role at SIE is to create and nurture the experiences under the PlayStation brand, a name synonymous with entertainment excellence and creativity.

Software Development Engineer In Test, ML/AI

PlayStation offers more than just the Best Place to Play; it is also a top workplace. Today, we are a global entertainment leader. Our portfolio includes PlayStation®5, PlayStation®4, PlayStation®VR, PlayStation®Plus, and well-known software from PlayStation Studios.

PlayStation works to foster an inclusive environment where employees feel supported and diversity is valued. We encourage individuals with passion and curiosity for innovation, technology, and play to apply for our open roles and become part of our growing distributed team.

The PlayStation brand falls under Sony Interactive Entertainment, a wholly-owned subsidiary of Sony Group Corporation.

At SIE, we believe in making play safer for everyone. Within our Information Security team, we protect our people, platforms, and products with smart, scalable, and thoughtful solutions. Our mission is to provide clear, actionable risk and security insights that guide the business and enable amazing player experiences.

We’re building a team that thrives on collaboration, curiosity, and delivering real value. Come help us make a difference!

Overview

We are hiring a Software Engineer in Test specializing in ML/AI quality, including automation for model evaluation, LLM-assisted test generation, and validation of AI-powered workflows. The role includes guiding quality strategy and developing automation frameworks. You will also manage implementation for complex, cross-functional, machine learning-powered products and services. This role involves more than test development. You will lead quality initiatives from start to finish, ensuring teams stay synchronized, dependencies are managed, and releases are delivered confidently. You will work closely with ML, engineering, product, and infrastructure teams. You will shape how quality is built, monitored, and expanded while leading embedded QE efforts within projects.

What you’ll do
  • Define and complete quality strategies, test plans, and automation coverage for ML-powered services and platform components.
  • Use LLMs and other AI-assisted techniques to generate, expand, and maintain high-value test cases for ML-powered workflows.
  • Design scenario-based test suites for AI features, including adversarial prompts, edge cases, ambiguous inputs, and underrepresented user scenarios.
  • Lead QE efforts for multi-functional projects, driving risk assessment, dependency management, and release readiness.
  • Design, develop, and maintain scalable automation frameworks for backend services, APIs, and ML inference systems using Python and/or Java.
  • Build automated validation for ML and LLM outputs, including ranking behavior, score distributions, prompt/response quality, hallucination indicators, and probabilistic model evaluation.
  • Debug test failures, service anomalies, model inconsistencies, and AI behavior regressions to identify root causes and drive resolution.
  • Perform functional, integration, regression, API, end-to-end, performance, and reliability testing for distributed systems.
  • Improve automation reliability, reduce flakiness, and optimize execution efficiency.
  • Partner with engineering and ML teams to integrate automated testing into CI/CD pipelines and release workflows.
  • Collaborate across teams to establish scalable quality standards, tooling, and guidelines.
Required qualifications
  • Bachelor’s degree in Computer Science or equivalent practical experience.
  • 3+ years of experience as an SDET or QE engineer focused on backend and distributed systems.
  • Experience using LLMs to generate, transform, and prioritize test cases for AI-powered experiences.
  • Experience with AI evaluation tooling, prompt evaluation frameworks, model monitoring, or human-in-the-loop review workflows.
  • Strong experience testing RESTful APIs, microservices, and distributed architectures.
  • Proficiency in Python, Java, JS or similar languages for automation development.
  • Hands-on experience with automation frameworks such as pytest, JUnit, Selenium, Playwright, Cypress, or Appium.
  • Experience with CI/CD systems and test pipelines (Jenkins, GitHub Actions, etc.).
  • Experience with cloud and container technologies (AWS, GCP, Kubernetes, Docker).
  • Familiarity with databases, monitoring, and observability tools.
  • Strong understanding of SDLC, Agile methodologies, and release processes.
  • Excellent problem-solving, debugging, and communication skills.
Preferred qualifications
  • Experience validating ML outputs using statistical analysis or scenario-based testing approaches.
  • Familiarity with ML infrastructure, data pipelines, or model-serving platforms (Seldon, KServe, Ray Serve, etc.).
  • Prior work in content moderation ML, security, fraud detection, or adversarial ML.
  • Experience testing high-scale, low-latency online services.
  • Experience with Databricks or similar ML platform tooling.
  • Familiarity with Node.js, React, or modern frontend technologies.
  • Experience testing mobile, console, or other non-PC platforms.
What sets you apart
  • Strong combination of automation engineering and delivery ownership.
  • Ability to drive quality across complex cross-functional initiatives.
  • Practical understanding of how to test non-deterministic AI systems and separate model variance from quality regressions.
  • Proven risk management and dependency coordination skills.
  • Ability to influence engineering teams and promote quality guidelines.
  • Passion for scalable, reliable, and maintainable automation systems.

At SIE, we consider several factors when setting each role’s base pay range, including the competitive benchmarking data for the market and geographic location.

Please note that the base pay range may vary in line with our hybrid working policy and individual base pay will be determined based on job-related factors which may include knowledge, skills, experience, and location. 

In addition, this role
is eligible for SIE’s top-tier benefits package that includes medical, dental, vision, matching 401(k), paid time off, wellness program and coveted employee discounts for Sony products. This role also may be eligible for a bonus package. Click here to learn more.


The estimated base pay range for this role is listed below.
$140,500$210,700 USD

Please note, Sony Interactive Entertainment conducts background checks at the offer stage for all new employees (which may include criminal background checks for some roles) and will need to process personal information to support these checks.

Please refer to our Candidate Privacy Notice for more information about what personal information we collect, how we use it, who we share it with, and your data protection rights.

Equal Opportunity Statement:

Sony is an Equal Opportunity Employer. All persons will receive consideration for employment without regard to gender (including gender identity, gender expression and gender reassignment), race (including colour, nationality, ethnic or national origin), religion or belief, marital or civil partnership status, disability, age, sexual orientation, pregnancy, maternity or parental status, trade union membership or membership in any other legally protected category.

We strive to create an inclusive environment, empower employees and embrace diversity. We encourage everyone to respond. 

Sony Interactive Entertainment is a Fair Chance employer and qualified applicants with arrest and conviction records will be considered for employment.


PlayStation San Francisco, California, USA Office

400 2nd Street., San Francisco, CA, United States

PlayStation San Mateo, California, USA Office

2207 Bridgepointe Parkway, San Mateo, CA, United States

Similar Jobs

14 Days Ago
In-Office
San Mateo, CA, USA
183K-275K Annually
Senior level
183K-275K Annually
Senior level
Gaming
Lead quality engineering for PlayStation services by architecting scalable automated test frameworks (UI, API, integration, system), building CI/CD test pipelines, mentoring SDETs, defining test strategy and metrics, analyzing telemetry and production issues, and driving shift-left testing and quality improvements across teams.
Top Skills: AppiumC++Ci/CdCircleCIGithub ActionsJavaJavaScriptJenkinsMicroservicesPythonReactRestful ApisSdksSeleniumTypescriptUi Automation
14 Days Ago
In-Office
San Mateo, CA, USA
183K-275K Annually
Senior level
183K-275K Annually
Senior level
Gaming
Lead ML/AI quality strategy, design scenario-based test suites, and build scalable automation for backend services and ML inference. Use LLMs for test generation, validate model outputs, run functional/integration/performance testing, debug AI regressions, and integrate testing into CI/CD across cross-functional teams.
Top Skills: AppiumAWSCi/CdContainersCypressDatabricksDockerGCPGithub ActionsJavaJavaScriptJenkinsJunitKserveKubernetesLlmsMicroservicesNode.jsPlaywrightPytestPythonRay ServeReactRestful ApisSeldonSelenium
38 Minutes Ago
Hybrid
Pleasanton, CA, USA
37K-66K Hourly
Senior level
37K-66K Hourly
Senior level
Fintech • Financial Services
Grow and manage relationships with affluent customers through proactive outreach, discovery-driven planning, and multi-product advice. Acquire new clients, manage a defined book of business, coordinate with Wealth, Home Lending, and Business Banking, ensure accurate documentation and regulatory compliance, and support branch service needs including account openings and digital adoption.

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account