Senior LLM Engineer

KPG99 INC • India

Remote

This Job is No Longer Active This position is no longer accepting applications

AI Summary

Seeking a skilled LLM Engineer for 9+ months to develop, deploy, and optimize large language models Using Python, Lang Chain, FastAPI/Flask, and AWS Contribute to cutting-edge AI solutions and generative applications

Key Highlights

Develop and deploy large language models using Python, Lang Chain, and FastAPI/Flask

Optimize and integrate LLMs with vector stores and retrievers

Work on scalable solutions for AI applications on AWS

Key Responsibilities

Design, develop, and maintain scalable web services using FastAPI or Flask frameworks

Implement Lang Chain to build custom pipelines for document indexing, retrieval, and summarization

Architect and deploy Retrieval-Augmented Generation (RAG) systems for chatbots, knowledge systems, and other generative AI applications

Technical Skills Required

Python Lang Chain FastAPI Flask AWS Pinecone FAISS SQL NoSQL

Nice to Have

Solid understanding of SQL and NoSQL databases

Familiarity with dashboarding tools such as Grafana and Tableau

Job Description

Hi,

Hope you are doing well.

Please find the job description below and let me know your interest.

Position: LLM Engineer

Location: 100% Remote

Duration: 9+ months

Mode of Interview: Video Interview

Job Description:

Role Overview

We are seeking a skilled LLM Engineer proficient in Python programming and experienced in developing, deploying, and optimizing large language models (LLMs). The ideal candidate will have hands-on experience with FastAPI or Flask frameworks, Lang Chain implementation, and building Retrieval-Augmented Generation (RAG) pipelines. You will play a key role in integrating cutting-edge AI technologies to solve complex business problems, focusing on vector stores and retrievers while deploying scalable solutions on AWS.

Key Responsibilities

1. Python Development:

a. Design, develop, and maintain scalable web services using FastAPI or Flask frameworks.

b. Write efficient, reusable, and modular Python code to support API-driven LLM applications.

2. Lang Chain & Supporting Frameworks:

a. Implement Lang Chain to build custom pipelines for document indexing, retrieval, and summarization.

Interested in remote work opportunities in Development & Programming? Discover Development & Programming Remote Jobs featuring exclusive positions from top companies that offer flexible work arrangements.

b. Integrate Lang Chain’s RAG capabilities with other components like vector stores and retrievers to support real-time querying and document processing.

3. RAG Pipelines:

a. Architect and deploy Retrieval-Augmented Generation (RAG) systems for chatbots, knowledge systems, and other generative AI applications.

b. Optimize RAG systems for speed, accuracy, and scalability across multiple use cases.

4. Vector Stores & Retrievers:

a. Work with vector databases like Pinecone, Chroma, FAISS, or Milvus to store and manage embeddings.

b. Implement retrievers and re-rankers to improve query efficiency, ensuring high-quality and relevant outputs for users.

5. AWS Cloud Deployment:

a. Deploy and manage LLM-based applications on AWS, leveraging services such as Lambda, EC2, S3, EKS, and RDS.

b. Ensure the scalability, availability, and reliability of deployed applications.

6. Dashboards and Monitoring (Optional):

a. Create monitoring dashboards using tools like Grafana or Tableau for real-time system monitoring, analytics, and performance insights.

7. Experimentation with Generative AI:

a. Research and integrate the latest advancements in generative AI technologies.

b. Experiment with fine-tuning and adapting large language models (like GPT, BERT) for new, innovative use cases.

Required Technical Skills

• Python proficiency, especially with web frameworks like FastAPI or Flask.

• Strong experience with Lang Chain and associated libraries.

Browse our curated collection of remote jobs across all categories and industries, featuring positions from top companies worldwide.

• Proven expertise in building and optimizing RAG pipelines.

• Proficiency in using vector databases (e.g., Pinecone, FAISS).

• Experience with retrievers and re-rankers.

• Solid understanding of AWS services (Lambda, EC2, RDS, etc.).

• Knowledge of SQL and NoSQL databases.

• Familiarity with dashboarding tools such as Grafana and Tableau.

Soft Skills

• Problem-solving: Ability to handle complex and dynamic challenges with AI solutions.

• Collaboration: Experience working in multidisciplinary teams (data scientists, DevOps, etc.).

• Adaptability: Eagerness and passion to keep up with the latest AI advancements and incorporate them into solutions.

• Communication: Excellent verbal and written communication skills to convey technical information to both technical and non-technical stakeholders.

This role is ideal for engineers who are passionate about pushing the boundaries of generative AI and have the technical skills to create cutting-edge, deployable solutions.

Thanks & Regards

Mohit Kumar

mk@kpgtech.com

Contact: +91 7840854069

KPG99,INC

3240 E STATE ST EXT

Hamilton, NJ 08619

Minority Certified

www.kpg99.com

Job Overview

Posted Date Mar 27, 2026

Employment Type Contract

Experience Level Mid-Senior level

Location India

Category Programming

Company KPG99 INC

Mentioned Skills

Similar Jobs

Explore other opportunities that match your interests

Remote WordPress Developer

Programming

•

1h ago

Visa Sponsorship Relocation Remote

Job Type Full-time

Experience Level Associate

fetchjobs.co

India

Machine Learning Engineer

Programming

•

5h ago

Visa Sponsorship Relocation Remote

Job Type Full-time

Experience Level Associate

netrolynx ai

India

Full Stack AI Developer

Programming

•

9h ago

Visa Sponsorship Relocation Remote

Job Type Full-time

Experience Level Entry level

heyamara

India

Senior LLM Engineer

Key Highlights

Key Responsibilities

Technical Skills Required

Nice to Have

Job Description

Job Overview

Mentioned Skills

Industries

Similar Jobs

Remote WordPress Developer

fetchjobs.co

Machine Learning Engineer

netrolynx ai

Full Stack AI Developer

heyamara

Subscribe our newsletter