$ cat about.md

I'm an AI engineer interested in how machines can understand the world and act within it. My work focuses on building multimodal models that connect perception and action — particularly vision-language and vision-action systems for robotic applications.

I enjoy working on problems that go beyond model training. Much of my work involves building the full pipeline around machine learning systems: training large multimodal models, designing efficient inference infrastructure using tools like Python and Rust, and integrating learned policies into real-time environments and simulations.

Recently, I've been working on vision-language models and exploring how they can be extended into vision-action models that allow robots to interpret scenes and generate meaningful actions. I'm particularly interested in scalable training workflows, real-time inference architectures, multimodal reasoning, and building robust ML systems that operate in real-world environments.

My goal is to build AI systems that bridge perception, reasoning, and action in real-world robotic settings. My background spans machine learning, distributed systems, and high-performance programming, and I enjoy bridging research ideas with practical engineering.

~/focus_areas
Vision-Language Models
Vision-Action Models for Robotics
Multimodal Learning
ML Systems & Inference Infrastructure

$ history

My journey started in mechanical engineering, pivoted through industrial automation at Amara Raja, and found its home in deep learning and robotics at OVGU Magdeburg. Over four years at Agile Robots, I've grown from a software developer to leading the Foundation Models team, driving the technical strategy for next-generation robotic intelligence.

$ cat education.log

M.Sc. Digital Engineering (Computer Science)

@ Otto-von-Guericke University Magdeburg

Oct 2018 — Jun 2021 // Magdeburg, Germany

Focus: Deep Learning, Computer Vision, Neural Networks. GPA: 1.8 (German scale).

B.Tech. Mechanical Engineering

@ JNTUA College of Engineering, Ananthapuram

2011 — 2015 // Ananthapuram, India

Score: 75%.

$ man skills

# Machine Learning & AI

$Vision-Language Models (VLMs)$Multimodal Learning$Foundation Models$Transformers$Model Training & Fine-Tuning$Model Evaluation & Benchmarking$GANs$Transfer Learning$Knowledge Distillation$Semi-Supervised Learning$RAG$Dataset Preparation & Preprocessing

# Robotics AI

$Vision-Action Models$Imitation Learning$Robot Action Prediction$Vision-Based Manipulation$Simulation-Based Evaluation$Policy Inference Pipelines$Object Detection & Tracking$Semantic Segmentation

# Systems & Infrastructure

$Async ML Inference$WebSocket Architectures$GPU Training Pipelines$Multi-GPU / Distributed Training$AWS SageMaker / HyperPod$Docker$Kubernetes$TensorRT$ONNX$CUDA$GitLab CI/CD$MLflow$Linux

# Programming

$Python$Rust$C++$PyTorch$Hugging Face$LangChain$OpenCV$SQL$Bash

$ ls certs/

Build Basic Generative Adversarial Networks (GANs) — Coursera / DeepLearning.AI

Data Science Math Skills — Coursera / Duke University

Introduction to NumPy — Coursera / Google

Python Programming — GUVI / IIT Madras

$ locale -a

Parseltongue

> Mastery

Telugu

> Native

Tamil

> Native

English

> Professional

German

> Elementary (A2)

Hindi

> Barely

Kannada

> Barely