FriendliAI Logo

FriendliAI

Software Engineer – AI Agents

Posted 2 Days Ago
Hybrid
San Francisco, CA, USA
Mid level
Hybrid
San Francisco, CA, USA
Mid level
Design, build, and maintain agent APIs and production agent applications for document understanding, advanced RAG, and customer support automation. Integrate open-source models, collaborate with backend and infra for deployment and monitoring, and ensure APIs are robust, scalable, and developer-friendly.
The summary above was generated by AI
About the Job

We’re seeking an Agent Engineer to design and build agentic features in our platform, including document understanding, advanced RAG, and customer support automation. In this role, you will develop not only the agent components themselves, but also the Friendli Agent API, which serves as the core developer interface for building and extending agent applications. You will also build agent applications as production-ready examples of how agents can solve real-world problems.

These applications will be primarily written in Python and will serve as reference implementations for our customers and community. We are looking for a hands-on engineer who is passionate about building agent systems and making AI easy for developers to adopt. The ideal candidate is comfortable creating agent applications that showcase what is possible, is curious about and experienced with open-source models, and enjoys turning them into reliable, high-impact features.

Key Responsibilities
  • Design, build, and maintain agent APIs and applications that deliver document understanding and other high-value features

  • Evaluate and integrate open-source models to power production-ready agent features where possible

  • Develop reference agent applications to showcase workflows and accelerate customer adoption

  • Collaborate with backend and infrastructure teams to integrate agents with deployment, orchestration, and monitoring systems

  • Ensure APIs are robust, developer-friendly, and enterprise-ready through strong design principles and documentation

  • Continuously improve the reliability, scalability, and performance of agent features in production

Qualifications
  • 3+ years of experience in software engineering, preferably in backend, ML systems, or API development

  • Bachelor’s or Master's degree in Computer Science, Computer Engineering, or equivalent

  • Strong programming skills in Python; experience with various Python frameworks

  • Solid understanding of LLM workflows, agent patterns, or tool invocation systems

  • Experience designing and delivering production APIs

  • Familiarity with open-source LLMs and multimodal models (HuggingFace, LangChain, LlamaIndex, etc.)

  • Strong foundations in cloud-native development

Preferred Experience
  • Experience with document understanding pipelines (e.g., OCR, RAG, summarization, structured extraction)

  • Familiarity with Kubernetes or container orchestration in production

  • Built or contributed to agent frameworks, SDKs, or CLIs

  • Have worked in a startup or fast-paced environments with ownership and ambiguity

  • Passion for developer experience and enabling AI adoption

Benefits
  • Flexible working hours

  • Daily lunch and dinner provided; unlimited snacks and beverages

  • Supportive and highly collaborative work environment

  • Health check-up support and top-tier equipment/hardware support

  • A front-row seat to the generative AI infrastructure revolution

  • Competitive compensation, startup equity, health insurance, and other benefits.

About FriendliAI

FriendliAI is building the world’s best AI inference platform that makes large language and multi-modal models fast, efficient, and deployable at scale. We power high-throughput, low-latency AI workloads for organizations worldwide and integrate directly with Hugging Face, giving developers instant access to over 500,000 open-source models.

We are a small, fast-moving team doing work that matters at one of the most exciting moments in the history of technology. With our world-class inference engine, we are building a platform that the AI industry can actually rely on.

Similar Jobs

7 Days Ago
In-Office
San Francisco, CA, USA
240K-300K Annually
Senior level
240K-300K Annually
Senior level
Information Technology • Logistics • Software • 3PL: Third Party Logistics • Industrial • Manufacturing
Lead design and delivery of Traba's agent platform: runtime orchestration, eval/observability, integrations to customer systems (WMS/TMS/ERP), model and retrieval strategy, deployment patterns, and field-driven productization. Build evaluation infrastructure, ship reliable production agents, and hire/mentor engineers while partnering with leadership on a multi-year roadmap.
Top Skills: ErpKafkaLlmsNode.jsPostgresPythonRabbitMQTmsTypescriptWms
12 Days Ago
In-Office
San Francisco, CA, USA
180K-300K Annually
Junior
180K-300K Annually
Junior
Artificial Intelligence • Software
As a Software Engineer at Pylon, you'll build AI features, enhance their quality and performance, and have autonomy in your work.
Top Skills: AWSGoGraphQLReact
14 Days Ago
Hybrid
San Francisco, CA, USA
Mid level
Mid level
Fintech • Software • Financial Services
Develop AI agents for finance workflows, design APIs, enhance engineering practices, ensure data security, and facilitate user collaboration.
Top Skills: AWSElixirNext.JsPhoenixTerraformTypescript

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account