Evaluate generative AI safety by creating and testing prompts, documenting failures, reviewing multimodal outputs, and applying safety taxonomies and guidelines during high-volume production sprints. Participate in calibration, quality reviews, and flag unclear guidelines or recurring model behaviors.
About the Role
We are hiring Red Teaming | Generative AI Analyst to support generative AI safety evaluation. In this role, you will interact with AI models, create and evaluate prompts, and identify where model responses fail against defined safety expectations.
Project Details
- Job Title: Red Teaming | Generative AI Analyst
- Location: Remote with the option to work onsite in Santa Clara, CA area.
- Hours: 40 hours per week
- Employment Type: W2 Full-Time Employee
- Pay Rate: $47.44/hour
What You’ll Do
- Interact with generative AI models using project-provided guidelines, safety taxonomies, and attack-vector guidance.
- Create and evaluate prompts designed to test model behavior across safety-related categories.
- Identify where model responses become unsafe, noncompliant, inconsistent, or otherwise problematic.
- Document model breakability, effort level, point of failure, and relevant category alignment.
- Review text, image, audio, video, or other multimodal content as required by the workflow.
- Apply detailed guidelines consistently across short, high-volume production sprints.
- Use sound judgment to evaluate ambiguous, edge-case, or policy-sensitive outputs.
- Conduct self-review to ensure work is accurate, complete, and aligned with project expectations.
- Flag unclear guidelines, tooling issues, or recurring model behavior patterns.
- Participate in calibration, feedback, and quality review sessions to improve consistency.
- Maintain readiness to pivot quickly between different red teaming runs when active work is launched.
Requirements:
- Native-level or near-native English proficiency with excellent written communication skills.
- Work Authorization is required for the role.
- Strong creative writing ability and comfort constructing varied prompts.
- Experience with red teaming, safety data annotation, content evaluation, safety review, content moderation, QA, or AI model evaluation preferred.
- Strong attention to detail and ability to follow complex project guidelines.
- Ability to think critically and evaluate open-ended model responses.
- Comfort working with sensitive, adult, NSFW, or policy-relevant content where required.
- Interest in generative AI, AI safety, large language models, or emerging AI technologies.
- Ability to work quickly and accurately during short production windows.
- Bachelor’s degree or equivalent practical experience preferred.
Ways to Stand Out from the Crowd
- Background in creative writing, English, linguistics, journalism, communications, policy, trust and safety, or content moderation.
- Experience evaluating generative AI prompts and responses.
- Familiarity with AI safety, red teaming, jailbreak testing, RLHF, or model evaluation workflows.
- Experience working with safety taxonomies, policy guidelines, evaluation rubrics, or defect categories.
- Prior experience reviewing sensitive, adult, NSFW, or policy-relevant content in a professional setting.
- Experience with multimodal AI workflows involving text, image, audio, or video.
- QA/testing experience within AI, data operations, content review, or annotation environments.
- Ability to explain a repeatable approach for staying consistent during high-volume, judgment-based work.
Similar Jobs
Artificial Intelligence • Legal Tech
Embed with customers to design, build, and deploy GC AI integrations into legal workflows. Develop production-grade API/webhook integrations, troubleshoot deployments, create reference implementations and documentation, and feed product insights back to Engineering to improve the platform. Travel to customer sites up to 25% as needed.
Top Skills:
APIsLlmsPythonSdksTypescriptWebhooksWorkflow Automation
Consumer Web • eCommerce • Internet of Things
Lead developer distribution strategy and a cross-functional team to drive developer adoption of DNSid via SDKs, docs, portals, standards work, integrations, and measurable adoption signals while staying hands-on technically and managing hiring and performance.
Top Skills:
A2AAgent FrameworksAi-OrchestrationAPIsGitGoIdentity/AuthMcpPythonSdksTypescript
Consumer Web • eCommerce • Internet of Things
Lead security architecture and threat modeling for the DNSid platform. Design cryptographic core, build secure SDKs (TypeScript, Go, Python), enforce supply-chain and deployment security, partner on standards (IETF), and own org-wide security posture including secrets management, SOC 2 readiness, and incident response.
Top Skills:
DidDnsDnssecEd25519GoIetfJwksJwtOauth2OidcPkiPythonSoc 2TlsTxt RecordsTypescriptVerifiable CredentialsWebauthn
What you need to know about the San Francisco Tech Scene
San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.
Key Facts About San Francisco Tech
- Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Google, Apple, Salesforce, Meta
- Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
- Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
- Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

