Tiger Analytics Logo

Tiger Analytics

Gen AI Data Engineer

Reposted 5 Days Ago
Remote
Hiring Remotely in United States
Expert/Leader
Remote
Hiring Remotely in United States
Expert/Leader
The Gen AI Data Engineer will design and build distributed data systems, develop data pipelines, manage data infrastructure, and integrate technologies for real-time and batch processing, contributing to scalable analytics solutions.
The summary above was generated by AI

Tiger Analytics is looking for experienced Machine Learning Engineers with Gen AI experience to join our fast-growing advanced analytics consulting firm. Our employees bring deep expertise in Machine Learning, Data Science, and AI. We are the trusted analytics partner for multiple Fortune 500 companies, enabling them to generate business value from data. Our business value and leadership has been recognized by various market research firms, including Forrester and Gartner.

We are looking for top-notch talent as we continue to build the best global analytics consulting team in the world. You will be responsible for:

Technical Skills Required:

Programming Languages: Proficiency in Python, SQL, and PySpark.

Data Warehousing: Experience with Snowflake, NOSQL and Neo4j.

Data Pipelines: Proficiency with Apache Airflow.

Cloud Platforms: Familiarity with AWS (S3, RDS, Lambda, AWS batch, SageMaker processing Job, CloudFormation, etc.) or GCP (Vertex AI RAG, Data pipeline, Bigquery, GKE)

Operating Systems: Experience with Linux.

Batch/Realtime Pipelines: Experience in building and deploying various pipelines.

Version Control: Experience with GitHub.

Development Tools: Proficiency with VS Code.

Engineering Practices: Skills in testing, deployment automation, DevOps/SysOps.

Communication: Strong presentation and communication skills.

Collaboration: Experience working with onshore/offshore teams.


Requirements

Desired Skills:

·        Big Data Technologies: Experience with Hadoop and Spark.

Data Visualization: Proficiency with Streamlit and dashboards.

·        APIs: Experience in building and maintaining internal APIs.

·        Machine Learning: Basic understanding of ML concepts.

·        Generative AI: Familiarity with generative AI tools and techniques.

Additional Expertise:

·        Knowledge Graphs: Experience with creation and retrieval.

·        Vector Databases: Proficiency in managing vector databases.

·        Data Persistence: Ability to develop and maintain multiple forms of data persistence and retrieval methods (RDMBS, Vector Databases, buckets, graph databases, knowledge graphs, etc.).

·        Cloud Technologies: Experience with AWS, especially SageMaker, Lambda, OpenSearch.

·        Automation Tools: Experience with Airflow DAGs, AutoSys, and CronJobs.

·        Unstructured Data Management: Experience in managing data in unstructured forms (audio, video, image, text, etc.).

·        CI/CD: Expertise in continuous integration and deployment using Jenkins and GitHub Actions.

·        Infrastructure as Code: Advanced skills in Terraform and CloudFormation.

·        Containerization: Knowledge of Docker and Kubernetes.

·        Monitoring and Optimization: Proven ability to monitor system performance, reliability, and security, and optimize them as needed.

·        Security Best Practices: In-depth understanding of security best practices in cloud environments.

·        Scalability: Experience in designing and managing scalable infrastructure.

·        Disaster Recovery: Knowledge of disaster recovery and business continuity planning.

·        Problem-Solving: Excellent analytical and problem-solving abilities.

·        Adaptability: Ability to stay up-to-date with the latest industry trends and adapt to new technologies and methodologies.

·        Team Collaboration: Proven ability to work well in a team environment and contribute to a positive, collaborative culture.

GenAI Engineer Specific Skills:

·        Industry Experience: 8+ years of experience in data engineering, platform engineering, or related fields, with deep expertise in designing and building distributed data systems and large-scale data warehouses.

·        Data Platforms: Proven track record of architecting data platforms capable of processing petabytes of data and supporting real-time and batch ingestion processes.

·        Data Pipelines: Strong experience in building robust data pipelines for document ingestion, indexing, and retrieval to support scalable RAG solutions. Proficiency in information retrieval systems and vector search technologies (e.g., FAISS, Pinecone, Elasticsearch, Milvus).

·        Graph Algorithms: Experience with graphs/graph algorithms, LLMs, optimization algorithms, relational databases, and diverse data formats.

·        Data Infrastructure: Proficient in infrastructure and architecture for optimal extraction, transformation, and loading of data from various data sources.

·        Data Curation: Hands-on experience in curating and collecting data from a variety of traditional and non-traditional sources.

·        Ontologies: Experience in building ontologies in the knowledge retrieval space, schema-level constructs (including higher-level classes, punning, property inheritance), and Open Cypher.

·        Integration: Experience in integrating external databases, APIs, and knowledge graphs into RAG systems to improve contextualization and response generation.

·        Experimentation: Conduct experiments to evaluate the effectiveness of RAG workflows, analyze results, and iterate to achieve optimal performance.


Benefits

This position offers an excellent opportunity for significant career development in a fast-growing and challenging entrepreneurial environment with a high degree of individual responsibility.

Top Skills

Apache Airflow
AWS
CloudFormation
Docker
GCP
Git
Github Actions
Hadoop
Jenkins
Kubernetes
Linux
Neo4J
NoSQL
Pyspark
Python
Snowflake
Spark
SQL
Streamlit
Terraform
Vs Code

Similar Jobs

9 Days Ago
Remote
United States
50K-130K Annually
Senior level
50K-130K Annually
Senior level
Software
Lead the development of Big Data solutions and machine learning systems, mentor engineers, and maintain high standards of engineering excellence.
Top Skills: AWSAzureCi/CdDeltaDockerGCPHudiIcebergKubernetesPysparkSpark
52 Minutes Ago
In-Office or Remote
2 Locations
Senior level
Senior level
Artificial Intelligence • Enterprise Web • Machine Learning • Natural Language Processing • Software • Conversational AI • Automation
As a Senior Product Manager, lead product strategy for Voice and Knowledge Base, collaborate cross-functionally, manage execution from concept to launch, and serve as key stakeholder liaison.
Top Skills: AIContact CenterCRMVoice
56 Minutes Ago
Easy Apply
Remote or Hybrid
United States
Easy Apply
80K-122K Annually
Senior level
80K-122K Annually
Senior level
Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
The Senior Knowledge Management Specialist will drive KM strategies, manage multiple projects, enhance KM processes with AI, and support cross-team collaboration.
Top Skills: Ai PlatformsCmsConfluenceJIRAKmsShowpadTableauZendesk

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account