bash — 80x24

narain@portfolio:~$ whoami

Narain Pattabhiraman

AI/ML Infrastructure Engineer · Agentic Systems, LLM Serving, Computer Use, GPU Inference

narain@portfolio:~$ cat now.txt

info About System

AI/ML infrastructure engineer specializing in agentic systems, LLM serving, and GPU inference. I build production-grade agent runtimes, computer-use pipelines, and the infrastructure that makes them fast and reliable at scale.

GET_RESUME.SH VIEW_BLOG

sensors Live Status ONLINE

Location:San Francisco, CA
Open_To:New Roles
Current_Task:agentic-coding/generative-ui

$ ls -la ~/expertise/

Stack & Skills

Python C/C++ CUDA Triton vLLM ReAct Agents RAG LoRA / GRPO Kubernetes AWS Docker Terraform PyTorch MLflow

$ cat ./history/professional_log.md

History & Deployment

history WORK_EXPERIENCE.LOG

PRESENT_ACTIVE

MLOps Engineer

@ Arizona State University

Jan 2024 - Present

Built CreateAI — an agentic LLM platform serving 30K MAU and 30B+ tokens/month across 100+ models — plus agentic coding/computer-use pipelines, ReAct agent runtimes, multimodal RAG, and GPU inference infrastructure on ASU's SOL cluster.

STABLE_ARCHIVE

Software Engineer — ML

@ Fidelity Investments

Apr 2022 - Jul 2023

Deployed BLOOM and Flan-T5 on 4×A100 GPUs at 40 tok/s; built a low-touch AWS deployment pipeline that cut adoption from 3 months to 2 weeks and shipped 15 models in the first quarter.

STABLE_ARCHIVE

AI Researcher

@ QPiAI Technologies

May 2021 - Apr 2022

Built a scalable CV/NLP inference platform with dynamic batching and queue-based scheduling; led hardware/software co-design for multi-GPU throughput optimization.

STABLE_ARCHIVE

Data Analyst

@ Thoughtware Analytics

Jan 2020 - May 2021

Applied machine learning for demand forecasting, computer vision-based quality assurance, and operations research for logistics and route planning.

$ ls /home/narain/projects

Projects

agentic_coding_&_compute.py

Agentic Coding & Computer-Use Widget Engine

2025

Built an agent that interprets natural-language specs to autonomously write, execute, and debug code, then render interactive UI widgets (charts, forms, dashboards) as live browser components.

Integrated computer-use capabilities — screenshot perception and DOM interaction — enabling the agent to visually verify and self-correct rendered output in a sandboxed browser environment.

Orchestrated multi-step tool chains (code generation, execution, visual feedback, iterative refinement) using a ReAct loop with MCP-compatible tool adapters.

ReAct MCP Computer Use Python JavaScript

reasoning-tuned_llm_via_.py

Reasoning-Tuned LLM via Synthetic Data and GRPO

2024

Built a multi-stage synthetic data generation pipeline producing JSON-formatted reasoning QA pairs via structured relationship sampling, entity assignment, and bidirectional validation.

Fine-tuned Qwen3 14B with LoRA + SFT, then post-trained with GRPO using weighted multi-component rewards (correctness, format, completeness) to improve reasoning-chain quality.

GRPO LoRA SFT PyTorch AMD GPUs

READ_WRITEUP article

robotic_arm_maneuvering_.py

Robotic Arm Maneuvering Using Deep Reinforcement Learning

2020

Designed a Unity3D simulation environment in PyTorch and built a physical prototype with 3D-printed components.

Trained deep RL agents (DDPG, A2C, PPO) for continuous control tasks with >30 DoF using neural policies over sensor and image inputs, then transferred simulation-trained policies to real-world hardware.

PyTorch Deep RL Unity3D Robotics

llm_model_inference_in_b.py

LLM Model Inference in Browser

2024

Built zero-install browser inference for Gemma, LLaMA, and Mistral using WebAssembly SIMD, enabling local LLM execution without backend infrastructure.

JavaScript WebAssembly C GPU

LAUNCH_DEMO rocket_launch

indic_llama.py

Indic LLaMA

2024

Adapted LLaMA for multilingual and domain-specific use cases by extending the tokenizer and training on translated instruction datasets and domain-specific dialogue corpora.

WASM LLM JavaScript Deep Learning

LAUNCH_DEMO rocket_launch VIEW_SOURCE terminal

compiler_system_for_deep.py

Compiler System for Deep Learning

2022

Engineered a graph-optimization compiler for mobile inference backends using TVM and TASO; wrote ARM NEON SIMD and custom CUDA kernels for Jetson Orin with measurable speedups over OpenCV C++ and TorchScript.

TVM LLVM MLIR NEON CUDA

steganalysis_detection.py

Steganalysis Detection

2022

Built a CNN-based steganalysis model for detecting hidden content in digital images by modifying ResNet with Efficient Channel Attention and reduced convolution stride for improved sensitivity.

Python Deep Learning Computer Vision

VIEW_SOURCE terminal LAUNCH_DEMO rocket_launch

$ cat ACHIEVEMENTS.EXE

Achievements

workspace_premium

WINNER

Winner, AMD Synthetic Data Hackathon

Built a multi-stage synthetic data pipeline, fine-tuned Qwen3 14B with LoRA+SFT, then post-trained with GRPO using weighted multi-component rewards for reasoning-chain quality.

Winner GRPO / RLHF LoRA LLM AMD GPUs

$ whoami --contact

Connect

bash — contact — 80x24

NAME:Narain Pattabhiraman

EMAIL:narain13579@gmail.com

LOCATION:Remote / San Francisco, CA

External Links

terminalGitHub linkLinkedIn rss_feedBlog mailEmail

narain@workstation:~$