Capgemini
Dec 2024 – Present
India
Agentic AI Engineer
On-Prem LLM Deployments
Running inference on bare metal with Ollama and open-source models for air-gapped environments
Multi-Agent Orchestration
Building collaborative agent pipelines where specialized agents handle routing, retrieval, and generation using LangGraph and AG2
vLLM Inference at Scale
High-throughput serving of Qwen and other models with continuous batching
Agentic RAG Architectures
Combining autonomous agents with retrieval pipelines for context-aware, self-correcting orchestration
FastAPI, async patterns, REST APIs, microservices architecture, and scalable data pipelines.
Retrieval-augmented generation, vector search, tool calling, function calling, AG2 agent orchestration, and data pre-processing pipelines.
Azure AI services, Docker containerization, CI/CD pipelines, and production deployments.
B.Tech in Information Technology
GL Bajaj Institute of Technology & Management
2020 – 2024 · Greater Noida, Uttar Pradesh