Senior Applied AI/ML Engineer (Foundation Models & Data)

crudcook • India

Remote

Apply

AI Summary

We're seeking a Senior Applied AI/ML Engineer to own the foundation model layer of our client's stack, fine-tune open-weight large language models, and design model pipelines. The role involves applying classical machine learning and building data infrastructure. The ideal candidate has 4-6 years of experience in production machine learning systems and a strong foundation in classical machine learning and forecasting.

Key Highlights

Own the foundation model layer of the client's stack

Fine-tune open-weight large language models

Design model pipelines and build data infrastructure

Key Responsibilities

Own the foundation model layer of the client's stack

Fine-tune open-weight large language models

Design model pipelines and build data infrastructure

Apply classical machine learning and forecasting

Work closely with the founding team and cross-functional stakeholders

Technical Skills Required

Python PyTorch Hugging Face ecosystem SQL Google Cloud Platform AWS Azure

Benefits & Perks

Remote work

Competitive salary

Opportunity to build the machine learning function from the ground up

Nice to Have

Experience with agent frameworks

Model Context Protocol (MCP)

Tool-use evaluation

Multi-agent orchestration

Job Description

Company Description

Crudcook is a specialist talent solutions partner focused on senior technical hiring for high-growth technology and fintech companies. We work with venture-backed startups, scaling businesses, and category-defining teams to identify, engage, and place senior engineering and AI/ML talent across India and globally. Our approach combines deep technical understanding with a curated, relationship-led search process — we partner with a small, focused set of clients to ensure depth, quality, and meaningful candidate experiences. Our team has supported hiring across some of the most innovative startups and global technology organizations, with a track record of placing senior engineers, machine learning practitioners, and technical leaders into roles where they can do their best work. We believe great hiring is about fit, not filtering — and we're committed to a candidate-first process that respects both the time and the trajectory of the people we represent.

Role Description

This is a remote role for an Applied AI/ML Engineer (Foundation Models & Data), hired on behalf of our client — a well-funded fintech building payments infrastructure at scale. The Machine Learning Engineer will own the foundation model layer of the company's stack end-to-end, including fine-tuning open-weight large language models on proprietary transaction, partner, and operational data; designing model pipelines that move from raw event data to production inference; and building the data infrastructure that supports the full machine learning lifecycle. The role also involves applying classical machine learning where it is the right tool — including liquidity and volume forecasting, anomaly detection across transaction flows, and partner behavior modeling. The Machine Learning Engineer will work closely with the founding team, senior engineers, and cross-functional stakeholders to ship reliable, production-grade systems in a regulated, latency-sensitive domain. This is a first-ML-hire role, which means the scope is unusually broad, the ownership is real, and the engineer will help build the machine learning function from the ground up. Collaboration, systems thinking, technical leadership, and continuous learning are core aspects of this role.

Qualifications

Interested in remote work opportunities in Machine Learning & AI? Discover Machine Learning & AI Remote Jobs featuring exclusive positions from top companies that offer flexible work arrangements.

4–6 years of experience building production machine learning systems, with significant hands-on work on transformer-based models
Demonstrable experience fine-tuning open-weight LLMs (Llama, Qwen, Mistral, Gemma) using techniques such as LoRA, QLoRA, full fine-tuning, DPO, ORPO, or continued pre-training
Deep understanding of transformer architecture, including attention mechanisms, positional encodings, tokenization tradeoffs, and context length considerations
Proven track record of shipping at least one fine-tuned LLM to production
Strong foundation in classical machine learning and forecasting — gradient boosting, time-series methods (Prophet, statsforecast, SARIMA), and statistical reasoning
Experience designing and optimizing machine learning models across both classical and deep learning paradigms
Proficiency in Python, with fluency in PyTorch and the Hugging Face ecosystem (transformers, peft, trl, datasets)
Hands-on experience with at least one inference server such as vLLM, TGI, or SGLang
Real data engineering capability — SQL fluency, pipeline orchestration, schema design, and familiarity with feature store concepts including point-in-time correctness and online/offline parity

Browse our curated collection of remote jobs across all categories and industries, featuring positions from top companies worldwide.

Comfort with Google Cloud Platform (Vertex AI, GKE, BigQuery, GCS) or equivalent experience on AWS or Azure
Strong foundation in computer science, algorithms, statistics, and applied mathematics
Strong analytical, problem-solving, and design-documentation skills
Bachelor's or Master's degree in Computer Science, Machine Learning, Statistics, or a related field
Experience with agent frameworks, Model Context Protocol (MCP), tool-use evaluation, or multi-agent orchestration is a plus
Background in fintech, payments, fraud, or other regulated domains is a plus
Open-source contributions to machine learning or LLM tooling is a plus
Distributed training experience (FSDP, DeepSpeed, multi-node) is a plus
Experience with liquidity, treasury, or financial forecasting in a payments or trading context is a plus

Looking forward for your application!

Job Overview

Posted Date May 13, 2026

Employment Type Full-time

Experience Level Mid-Senior level

Location India

Category Machine Learning

Company crudcook

Mentioned Skills

Similar Jobs

Explore other opportunities that match your interests

MLOps Engineer (JAX, PyTorch, Pallas/Triton) for Large Language Model Training

Machine Learning

•

2d ago

Visa Sponsorship Relocation Remote

Job Type Full-time

Experience Level Mid-Senior level

para ai labs

India

MLOps Engineer (JAX, PyTorch, Pallas/Triton)

Machine Learning

•

4d ago

Visa Sponsorship Relocation Remote

Job Type Part-time

Experience Level Not Applicable

Mercor

India

Machine Learning Engineer

Machine Learning

•

4d ago

Visa Sponsorship Relocation Remote

Job Type Full-time

Experience Level Entry level

Stealth Startup

India

Senior Applied AI/ML Engineer (Foundation Models & Data)

Key Highlights

Key Responsibilities

Technical Skills Required

Benefits & Perks

Nice to Have

Job Description

Job Overview

Mentioned Skills

Industries

Similar Jobs

MLOps Engineer (JAX, PyTorch, Pallas/Triton) for Large Language Model Training

para ai labs

MLOps Engineer (JAX, PyTorch, Pallas/Triton)

Mercor

Machine Learning Engineer

Stealth Startup

Subscribe our newsletter