As a Junior Data Engineer, you will assist in designing ETL/ELT workflows, maintain Python and SQL codebases, debug data pipelines, document processes, and collaborate with product teams on technical solutions.
About the Organization
Now is a great time to join Redhorse Corporation. We are a solution-driven company delivering data insights and technology solutions to customers with missions critical to U.S. national interests. We’re looking for thoughtful, skilled professionals who thrive as trusted partners building technology-agnostic solutions and want to apply their talents supporting customers with difficult and important mission sets.
About the Role
Redhorse is seeking a highly motivated Junior Data Engineer to join our team supporting the US National Defense Strategy. In this role, you will play a vital part in transforming data into actionable insights that directly impact national security and the modernization of critical defense systems. If you are a problem-solver passionate about leveraging data to support mission-critical objectives, we encourage you to apply and grow your career with us.
Key Responsibilities
- Assist in the design and development of ETL/ELT workflows using Databricks to move and transform data efficiently.
- Maintain and enhance existing Python and SQL codebases, ensuring data integrity and pipeline reliability.
- Debug and resolve defects in data processing pipelines under the guidance of senior team members.
- Clearly document code changes, data models, and technical processes to ensure team transparency.
- Work closely with other data engineers and product teams to translate basic business needs into technical solutions.
Required Experience/Clearance
- Must have an Active Secret Security Clearance. (Applicants without this clearance will not be considered).
- 2+ years of experience in a data engineering or highly technical data analysis role.
- Proficiency in Python, including familiarity with modular code structures.
- Understanding of Object-Oriented Programming (OOP) patterns.
- Strong foundational knowledge of SQL for querying and basic data manipulation.
- Exposure to or hands-on experience with Databricks or Jupyter Notebooks, and Pandas or PySpark or similar cloud-based data environments.
- Demonstrated ability to troubleshoot technical issues and process complex data sets.
Desired Experience
- Familiarity with PySpark or other distributed computing frameworks.
- Familiarity with version control systems like Gitlab and project management tools like Jira.
Redhorse Corporation is an equal opportunity employer. All qualified applicants will receive consideration for employment and will not be discriminated against on the basis of race, color, religion, sex, sexual orientation, gender identity, national origin, veteran status, disability, or any other protected class.
Accommodations:
If you are a qualified individual with a disability or a disabled veteran, you may request a reasonable accommodation if you are unable or limited in your ability to access job openings or apply for a job on this site as a result of your disability. You can request reasonable accommodations by contacting Talent Acquisition at [email protected]
Redhorse Corporation shall, in its discretion, modify or adjust the position to meet Redhorse’s changing needs.
This job description is not a contract and may be adjusted as deemed appropriate in Redhorse’s sole discretion.
Similar Jobs
Consumer Web • eCommerce • Machine Learning • Software • Sports • Analytics
Lead design and implementation of the PSA Liquidity Platform connecting buyers and sellers. Build buyer tooling, matching/ranking engines, and seller experiences. Architect and operate microservices and Kafka-driven systems on Kubernetes/AWS, instrument observability, apply AI tooling across the SDLC, and drive cross-functional delivery from 0-to-1 while owning end-to-end outcomes.
Top Skills:
AWSDatadogJavaKafkaKubernetesNew RelicOpentelemetryReactSpring BootSvelte
Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Lead and manage Oracle CPQ and customer experience implementations: analyze client needs, design/configure solutions, oversee project planning and budgets, mentor teams, train users, maintain client relationships, and ensure successful adoption to drive revenue realization.
Top Skills:
Oracle Configure Price Quote (Cpq) CloudOracle Customer Experience (Cx)Oracle Lead ManagementOracle Marketing AutomationOracle Sales Automation
Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
Design, build, and architect IAM-focused security software, APIs, and automation. Integrate security into CI/CD and cloud-native environments, perform code reviews and threat modeling, monitor security events, support incident response, and collaborate with infra/app teams to embed identity best practices.
Top Skills:
AWSAzureC++Ci/CdDockerEntraidGCPGoIamJavaKubernetesPythonSailpoint
What you need to know about the San Francisco Tech Scene
San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.
Key Facts About San Francisco Tech
- Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Google, Apple, Salesforce, Meta
- Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
- Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
- Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

.png)

