S.G. Sakthivel
AI & ML Engineer

S.G. Sakthivel Profile
settings_suggest
Core Stack: LLM Optimization

Advanced Fine-Tuning

terminal

Hands-on experience in fine-tuning and optimizing Large Language Models, including LLaMA-based variants, GPT-2, and GPT-Neo, using parameter-efficient methods such as PEFT and LoRA.

analytics

Strong understanding of tokenization behavior, context window constraints, and prompt construction.

account_tree

Skilled in designing efficient input pipelines that minimize token usage while preserving semantic fidelity, enabling cost-effective inference and improved latency.

architecture

Experienced in context engineering, including system prompt structuring, dynamic context injection, and managing context window utilization to balance relevance, coherence, and computational efficiency in real-time applications.

dns

Proficient in working with locally deployed models via Ollama, including model selection, environment configuration, and inference tuning for stable performance.

compress

Familiar with handling model loading strategies, memory constraints, and experience with model quantization techniques to reduce memory footprint and accelerate inference, enabling deployment on limited-resource hardware without significant degradation in output quality.

database
CORE STACK: RAG

RAG Architectures

Implementing complex Retrieval-Augmented Generation workflows across diverse vector databases for production-grade intelligence.

memory
CORE STACK: PROTOCOLS

Custom MCP Servers

Building custom Model Context Protocol servers to bridge language models with proprietary data and specialized tools.

Technical Focus

Bridging the gap between cutting-edge research and functional deployments. My work focuses on scalable, local-first AI solutions that maintain data sovereignty and high performance.

Multimodal Systems Specialist
Enterprise AI Engineering
Technical Capability Matrix

The Technical Core

A precise engineering stack for modern AI systems. Focusing on scalable RAG architectures, model fine-tuning, and robust context management.

neurology

Core AI Expertise

  • Transformer Architecture from-scratch
  • CNN & Vision Transformer (ViT)
  • Sequence-to-Sequence Modeling
  • Reinforcement Learning basics
psychology

Generative AI & LLM

  • LLM Fine-tuning (PEFT, LoRA)
  • RAG & CAG Architectures
  • Agentic AI Systems
handyman

Frameworks & Tools

PyTorch TensorFlow Transformers LangChain Gradio Docker
architecture

Applied Systems & Architecture

Multimodal AI Systems

Text + Image + Speech Integration

End-to-end Pipeline Design

Production Grade RAG Pipelines

API Integration

Seamless Enterprise Connectivity

Real-time Inference

Low Latency System Optimization

settings_input_component

Context Engineering

Methodology Prompt Engineering & Context Optimization
Architectural Design Advanced RAG Architecture
Automation Custom Pipeline Orchestration
terminal

Programming

  • Python

    Lead Architect

  • R Language

    Statistical Computation

  • MySQL

    Optimization

database

Data Science

Processing

High-quality Synthetic Data Generation

blur_on

Core Methodology

Advanced Data Preprocessing

model_training
Archive.v02 / Deployments

Selected Projects

Healthcare AI ViT + LLM

DPT Clinical Decision Support Engine

A state-of-the-art diagnostic reasoning platform utilizing deep probabilistic tensors to assist clinicians in real-time differential diagnosis and treatment mapping.

RAG Pipeline

Multimodal RAG Pipeline

Multimodal RAG pipeline with DeepSeek-R1-Qwen2.5-1B, FAISS, and BLIP image captioning for cross-modal understanding.

PyTorch NMT

Transformer from Scratch

Implemented an Esperanto to Hungarian neural machine translation model based on the Attention Is All You Need paper.

Emotion AI Real-time

Emovibes

AI-powered communication system using Spotify API, detecting emotional tone and suggesting context-aware replies, bundled into a seamless app experience.

Professional
Neural Journey

A recursive history of growth across industrial automation, generative artificial intelligence, and software architectural design.

precision_manufacturing Active Node
Feb 2026 — Present

Automation Engineer

CATERPILLAR

Architecting automated systems leveraging advanced Machine Learning protocols to optimize industrial throughput and predictive maintenance pipelines.

AI Ops Neural Networks Automation
neurology Legacy Node
Sep — Oct 2024

Generative AI Intern

NeubAItics Tech Pvt. Ltd

Researched and implemented transformer-based architectures for synthetic data generation and natural language processing tasks.

LLMs Prompt Engineering Python

school Academic Core

System.edu_manifest
01

INSTITUTION: Sathyabama Institute of Science and Technology

02

DEGREE: B.E. in Computer Science (AI & ML)

03

TENURE: 2022 — 2025

04

METRIC: CGPA 8.30 / 10.0

verified_user Certifications

DeepLearning.AI Certificate

DeepLearning.AI

Machine Learning Specialization

Google Cloud Certificate

Google

Cloud Engineering Protocols

IBM Data Science Certificate

IBM

Data Science Professional

TensorFlow Developer Certificate

TensorFlow

Developer Certification