Work with engineers to design, implement, and deploy scalable inference solutions for top AI models. Implement and optimize models (Python, C++, CUDA, NCCL), monitor live services, develop features, fix bugs, participate in code reviews, and collaborate in agile teams.
DeepInfra is seeking a talented and motivated Software Engineering Intern to join our team. As an intern, you will be working closely with our experienced engineering team to design, develop, and deploy the top open AI models at scale. This is an excellent opportunity to gain hands-on experience in building scalable and efficient software systems, while working on cutting-edge AI models and algorithms.
- Collaborate with the engineering team to design, develop, and test inference solutions for the top AI models.
- Implement and optimize AI models using Python, C++, CUDA, NCCL
- Monitor and maintain the live service.
- Work on feature development, bug fixing, and code reviews to ensure high-quality software delivery
- Participate in daily stand-ups, code reviews, and design discussions to ensure seamless collaboration
- Stay up-to-date with industry trends and advancements in AI and machine learning
- Try new things
- Ship stuff
- Currently pursuing a Bachelor's or Master's degree in Computer Science, Computer Engineering, or a related field
- Strong fundamental knowledge in computer science, including data structures, algorithms, and software design patterns
- Proficiency in Python, including experience with AI/ML libraries and frameworks (e.g., NumPy, pandas, SciPy, TensorFlow, PyTorch)
- Familiarity with AI models, Transformers and Diffusers
- Experience with version control systems (e.g., Git) and agile development methodologies
- Excellent problem-solving skills, with the ability to debug and optimize code
- Strong communication and teamwork skills, with the ability to effectively collaborate with cross-functional teams
- Work on cutting-edge AI model serving - the systems that power the next generation of LLMs and multimodal models.
- Small team, huge impact: your work ships directly to customers.
- Opportunity to learn from engineers building high-performance inference at scale.
- Fast-paced environment with ownership, autonomy, and end-to-end responsibility.
Compensation range: 7000-8000/month USD
Deep Infra Inc. Palo Alto, California, USA Office
Palo Alto, California, United States, 94306
Similar Jobs
Aerospace • Artificial Intelligence • Hardware • Machine Learning • Software • Defense • Manufacturing
As a Senior Flight Software Engineer, you will develop software for spacecraft systems, integrating algorithms, and maintaining databases essential for space missions. You'll work through design to flight implementation, collaborating on software and hardware simulations.
Top Skills:
Azure RtosC/C++CanEthernetI2CPythonRs422Rs485RtlinuxRtosSpiTcp/IpUdp
Artificial Intelligence • Fintech • Payments • Business Intelligence • Financial Services • Generative AI
Build and maintain payment reconciliation frameworks and safeguarding controls across global payment flows. Produce reporting and analytics on cash positions, funding needs, and variances. Partner with Product, Engineering, and Ops to design data models and automated workflows, drive process standardization, and provide actionable insights to support Treasury, Finance, and senior management.
Top Skills:
Ai ApplicationsData StudioLookerPythonRSQLTableau
AdTech • Artificial Intelligence • Marketing Tech • Software • Analytics
Lead editorial strategy and the company blog to drive top-of-funnel growth. Pitch, write, and edit product-focused content; build and manage an editorial roadmap and calendar; collaborate with Product Marketing, Growth, and Product teams; optimize content for search and AI/LLM answers (GEO); coordinate contributors and approvals; use performance data to inform and optimize content.
Top Skills:
Generative Engine Optimization (Geo)Large Language Models (Llms)SeoZeta Marketing Platform (Zmp)
What you need to know about the San Francisco Tech Scene
San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.
Key Facts About San Francisco Tech
- Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Google, Apple, Salesforce, Meta
- Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
- Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
- Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine



