Hello, I'm

Nainesh Rathod Agentic AI & ML Engineer

Specializing in Multi-Agent Systems, RAG Pipelines, and Production LLM Applications. Building scalable AI solutions that drive real-world business impact.

🤖 Agentic Systems
🔍 RAG Optimization
⚙️ Production MLOps
Nainesh Rathod - ML Engineer
Python
PyTorch
Azure
LLMs
Scroll to explore

About Me

Passionate ML Engineer building the future with AI

Transforming Ideas into Production-Ready AI Solutions

As an ML Engineer with expertise in Large Language Models and production AI systems, I specialize in bridging the gap between cutting-edge research and real-world applications. My journey spans from healthcare startups to enterprise-level solutions, always focusing on delivering measurable business impact.

Currently, I'm developing sophisticated RAG pipelines on Azure and fine-tuning LLMs for specialized domains. My approach combines deep technical knowledge with practical implementation skills to create AI solutions that actually work in production.

Technical Expertise

Agentic AI & LLMs

LangChain LangGraph RAG Pipelines Fine-Tuning (QLoRA) Vector Databases (Pinecone, Chroma) Prompt Engineering CrewAI Gemini/OpenAI APIs

Cloud & MLOps

AWS (S3, EC2, Lambda) Azure Machine Learning Docker Kubernetes MLflow CI/CD (GitHub Actions) vLLM / TensorRT-LLM

Full Stack & Data

Python & SQL FastAPI React.js Node.js MongoDB PyTorch / TensorFlow Data Visualization (Tableau)

Featured Projects

Production-ready AI solutions that deliver real business impact

Agentic AI

Agentic AI Travel Planner

Multi-agent workflow using CrewAI and Gemini 2.5 Flash. Orchestrates concurrent search vectors and parallel execution, reducing planning latency by 50%.

50% Faster
Multi Agents
CrewAI FastAPI Gemini
Agentic AI

Supply Chain Agent System

Autonomous agent system using LangGraph and Claude 3.5 Sonnet to query complex contracts and track live shipments using real-time external data.

Hybrid Search
Real-time News
LangGraph MongoDB Claude 3.5
NLP & Data

Reddit Thread Analyzer

Latency-optimized pipeline using asyncio and custom scoring algorithms to filter signal from noise in large Reddit threads.

40% Faster
Async Pipeline
Python AWS S3 Asyncio
Frisbee Rules Assistant
Full Stack RAG

Frisbee Rules Assistant

End-to-end RAG system with FastAPI and React 19. Uses LlamaIndex and pgvector to answer complex rule queries with citations.

RAG System
React 19 Frontend
LlamaIndex PostgreSQL Docker
Computer Vision

Real-time Traffic Sign Classification

Deep learning model for real-time traffic sign recognition using MobileNetV2. Achieved 98% F1-score across 43 classes.

98% F1 Score
30fps Real-time
PyTorch MobileNetV2 Streamlit

Professional Experience

Building production-ready AI solutions across diverse industries

Machine Learning Engineer

Kelley School of Business
Oct 2024 - Present

Architecting Multi-Agent RAG systems and Agentic AI workflows.

  • Developed an automated AI system that helps lawyers find relevant case laws faster, improving search accuracy to 92% and saving hours of manual research time per case.
  • Built a quality control tool that automatically checks AI responses for errors, reducing the need for manual human review by 60%.
  • Orchestrated a multi-agent RAG framework using LangGraph and Azure AI Search with hybrid semantic ranking, implementing dynamic query routing between vector stores and live case law databases.
  • Fine-tuned Llama 3.2 Vision (11B) using 4-bit QLoRA and DeepSpeed to optimize inference on Azure Managed Endpoints, achieving 3x lower latency via vLLM serving.
LangGraph Azure AI vLLM DeepSpeed

Machine Learning Engineer Intern

Hyphenova
Jul 2024 - Aug 2024

Scalable ML pipelines and NLP model optimization.

  • Created an automated system to analyze customer sentiment from large volumes of text, helping the marketing team identify key trends and improve campaign strategies with 85% accuracy.
  • Streamlined the software release process by automating manual tasks, drastically cutting down the time to deploy new features from several hours to just under 10 minutes.
  • Architected an end-to-end MLOps pipeline on AWS using CodePipeline and SageMaker, integrating MLflow for experiment tracking and Docker for reproducible model containerization.
  • engineered high-volume feature extraction pipelines using Databricks PySpark to process unstructured text data, fine-tuning a BERT transformer model for multi-class sentiment classification.
AWS BERT MLflow Docker

R&D Software Engineer

Sanskritech Smart Solutions
Feb 2022 - Apr 2023

End-to-end ML systems and full-stack development.

  • Built an AI system for medical imaging that automatically detects health anomalies, speeding up the diagnosis process by 75% and freeing up doctors to focus on patient care.
  • Led a major cost-saving initiative by migrating software infrastructure to open-source systems, saving the company over $500,000 annually in licensing fees.
  • Designed and trained custom YOLOv5 object detection models on Amazon SageMaker for medical anomaly detection, optimizing inference pipelines for real-time edge deployment.
  • Developed an event-driven file ingestion service using AWS Lambda and S3 to handle 5000+ daily multipart uploads, integrating with a React/Redux frontend for seamless medical record management.
YOLOv5 React Node.js AWS S3

Core Competencies

🤖

LLM Development

Fine-tuning, RAG systems, prompt engineering

👁️

Computer Vision

Image classification, object detection, real-time processing

☁️

Cloud ML

Azure ML, AWS, scalable deployment

⚙️

MLOps

CI/CD, monitoring, production optimization

Education

Academic background and research focus

Master of Science in Data Science

Indiana University Bloomington
Aug 2023 - May 2025 GPA: 3.7/4.0

Specializing in Deep Learning, NLP, and Computer Vision.

  • Core Coursework: Applied Machine Learning, Deep Learning Systems, Natural Language Processing.
  • Research Focus: Agentic Workflows and Large Language Model Optimization.

Bachelor of Engineering in Computer Engineering

University of Mumbai
Aug 2019 - May 2023 GPA: 3.5/4.0

Foundation in Computer Science, Algorithms, and Software Engineering.

  • Key Subjects: Data Structures & Algorithms, Database Management Systems, Operating Systems.
  • Capstone: Real-time Medical Imaging Analysis using CNNs.

Let's Work Together

Ready to bring your AI ideas to life? Let's discuss your next project

Get In Touch

I'm always interested in discussing new opportunities, innovative projects, and collaborations in the AI/ML space. Whether you're looking to build production-ready AI solutions or need consultation on ML strategy, I'd love to hear from you.

Location

Available for Remote Work
Available for new projects

Currently accepting freelance projects and full-time opportunities