Speechify Logo

Speechify

AI Engineer & Researcher, Inference - Raleigh-Durham, USA

Posted Yesterday
Be an Early Applicant
Easy Apply
In-Office or Remote
Hiring Remotely in Raleigh, NC
140K-200K Annually
Mid level
Easy Apply
In-Office or Remote
Hiring Remotely in Raleigh, NC
140K-200K Annually
Mid level
The role involves deploying ML inference workloads, improving model performance and efficiency, and operating Python-based services in cloud environments.
The summary above was generated by AI

PLEASE APPLY THROUGH THIS LINK: https://job-boards.greenhouse.io/speechify/jobs/5287658004 

DO NOT APPLY BELOW

The mission of Speechify is to make sure that reading is never a barrier to learning.

Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Google Docs, news articles, websites – into audio, so they can read faster, read more, and remember more. Speechify’s text-to-speech reading products include its iOS app, Android App, Mac App, Chrome Extension, and Web App. Google recently named Speechify the Chrome Extension of the Year and Apple named Speechify its App of the Day.

Today, nearly 200 people around the globe work on Speechify in a 100% distributed setting – Speechify has no office. These include frontend and backend engineers, AI research scientists, and others from Amazon, Microsoft, and Google, leading PhD programs like Stanford, high growth startups like Stripe, Vercel, Bolt, and many founders of their own companies.

This is a key role and ideal for someone who thinks strategically, enjoys fast-paced environments, passionate about making product decisions, and has experience building great user experiences that delight users.

We are a flat organization that allows anyone to become a leader by showing excellent technical skills and delivering results consistently and fast. Work ethic, solid communication skills, and obsession with winning are paramount. 

Our interview process involves several technical interviews and we aim to complete them within 1 week. 

What You’ll Do
  • Work alongside machine learning researchers, engineers, and product managers to bring our AI Voices to their customers for a diverse range of use cases
  • Deploy and operate the core ML inference workloads for our AI Voices serving pipeline
  • Introduce new techniques, tools, and architecture that improve the performance, latency, throughput, and efficiency of our deployed models
  • Build tools to give us visibility into our bottlenecks and sources of instability and then design and implement solutions to address the highest priority issues
An Ideal Candidate Should Have
  • Experience shipping Python-based services
  • Experience being responsible for the successful operation of a critical production service
  • Experience with public cloud environments, GCP preferred
  • Experience with Infrastructure such as Code, Docker, and containerized deployments.
  • Preferred: Experience deploying high-availability applications on Kubernetes.
  • Preferred: Experience deploying ML models to production

What We Offer

  • A dynamic environment where your contributions shape the company and its products
  • A team that values innovation, intuition, and drive
  • Autonomy, fostering focus and creativity
  • The opportunity to have a significant impact in a revolutionary industry
  • Competitive compensation, a welcoming atmosphere, and a commitment to an exceptional asynchronous work culture
  • The privilege of working on a product that changes lives, particularly for those with learning differences like dyslexia, ADD, and more
  • An active role at the intersection of artificial intelligence and audio – a rapidly evolving tech domain

Salary

  • The United States base salary range for this full-time position is $140,000-$200,000 + bonus + equity depending on experience

Think you’re a good fit for this job? 

Tell us more about yourself and why you're interested in the role when you apply.
And don’t forget to include links to your portfolio and LinkedIn.

Not looking but know someone who would make a great fit? 

Refer them! 

Speechify is committed to a diverse and inclusive workplace. 

Speechify does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status.

Top Skills

Docker
GCP
Kubernetes
Python

Similar Jobs

Yesterday
Easy Apply
In-Office or Remote
Raleigh, NC, USA
Easy Apply
140K-200K Annually
Mid level
140K-200K Annually
Mid level
Software
Work with machine learning teams to deploy AI Voices, optimizing ML inference workloads and improving system performance and efficiency.
Top Skills: DockerGCPKubernetesPython
An Hour Ago
Remote or Hybrid
2 Locations
205K-257K Annually
Senior level
205K-257K Annually
Senior level
Fintech • Machine Learning • Payments • Software • Financial Services
The role involves leading technology projects, optimizing distributed systems, collaborating on cloud-based solutions, and mentoring others while leveraging various technologies to enhance services.
Top Skills: AWSCassandraDockerGoKafkaNode.jsOpensearchPostgresPython
An Hour Ago
Remote or Hybrid
3 Locations
99K-136K Annually
Junior
99K-136K Annually
Junior
Fintech • Machine Learning • Payments • Software • Financial Services
As a Senior Business Analyst on the AI/ML team at Velocity Black, you will analyze data, support model governance, and collaborate with cross-functional teams to enhance product offerings through AI and ML tools.
Top Skills: AIMlSQL

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account