LD.
Laba Deka

Laba Deka

AI & Distributed Systems Engineer · New York, NY

Agentic AI · MCPLLM SystemsDistributed @ ScaleSr. SWE @ Arya Health
github.com/Kuxhalinkedin.com/in/laba-dekal.deka@nyu.edu

About

Senior software engineer building at the intersection of large-scale distributed infrastructure and autonomous AI agents. 4+ years at BookMyShow — India's largest entertainment platform — engineering recommendation systems that serve 500M+ users at 500K RPM with 99.99% availability. Currently at Arya Health as a Senior Engineer, building multi-tenant health data pipelines while completing my M.S. CS at NYU (2026). On the side I design agentic systems using MCP, LangChain, and tool-calling LLMs — from AI SRE copilots to autonomous workforce orchestration. Looking to bring both sides — distributed systems at scale and modern agentic AI — to an ambitious team.

Experience

Arya HealthFeb 2026 – Present

Senior Software Engineer · Data & Platform

  • Reduced API latency 40% with an Aerospike distributed cache over PostgreSQL
  • Built serverless event pipelines using AWS Lambda, SQS, and InfluxDB for multi-tenant health data
  • Wired Dagster pipeline telemetry into SigNoz and Grafana for real-time cross-tenant observability
GoAWS LambdaSQSAerospikePostgreSQLGrafana
BookMyShowApr 2023 – Aug 2024

Software Development Engineer 2 · Personalization & Recommendations

  • Scaled recommendation systems to 500K RPM at 99.99% availability during blockbuster events
  • Cut p99 latency 50% (300ms → 150ms) via a Go Scatter-Gather pattern for parallel data aggregation
  • Led migration from Java monoliths to microservices, reducing AWS infrastructure costs by 40%
GoJavaKafkaAerospikeElasticsearchAWS
BookMyShowJul 2020 – Apr 2023

Software Development Engineer 1

  • Built Spark/Scala + Kafka pipelines for zero-loss terabyte-scale cloud migration of user data
  • Powered real-time recs by integrating Redshift clickstream data into Elasticsearch and Aerospike
  • Maintained 100% uptime through JVM tuning and Elastic APM alerts during post-pandemic traffic surges
JavaScalaApache SparkKafkaElasticsearch

AI / Agentic Projects

Autonomous agents, MCP servers, RAG pipelines, and LLM-powered systems.

Agentic · SRE
pocketops-ai

AI SRE copilot that connects to GitHub, AWS, Sentry, and runbooks to autonomously investigate incidents, summarize alerts, query logs/metrics, and propose fixes — with human-in-the-loop approval for destructive actions.

PythonFastAPIPydantic AIMCPAWS
MCP · Orchestration
health-ops-mcp

MCP server for autonomous home-care workforce orchestration — schedules caregivers, detects scheduling conflicts, and enforces compliance constraints over synthetic patient data.

PythonMCPStreamlitPydantic
Multi-Agent
crimeboard

Multi-agent investigation platform that orchestrates specialized LLM agents to correlate evidence, detect inconsistencies across sources, and generate structured deduction boards.

TypeScriptLangChainMulti-AgentLLM
RAG · Robotics
rag-robotics-nav

RAG pipeline for ROS2 documentation using Qdrant vector search and locally-hosted LLMs to serve context-aware navigation code to robotics developers.

PythonQdrantDockerOllamaGradio
Reference · LLMOps
ai-engineering-handbook

Progressive AI engineering reference spanning basic RAG to production autonomous agents — covers LangChain, LangGraph, AWS Bedrock, prompt caching, and MCP tool use.

PythonLangChainLangGraphAWS BedrockChromaDB

Side Projects

Geo-spatial · Crisis
disasterscout

Crisis response platform built for low-bandwidth environments — geo-spatial incident mapping, MongoDB Atlas Search for resource lookup, and real-time coordination between responders.

PythonMongoDBGeo-spatialFastAPI

Currently Building

Niche AI infrastructure projects — in active development.

Geo-spatial Agent
llm-geo

Agent combining PostGIS + OpenStreetMap spatial data with LLM reasoning to answer location-aware queries. Explores geo-spatial retrieval as a first-class RAG layer — 'best hospitals near X accepting Y insurance with Z rating'.

PythonPostGISOpenStreetMapLangGraphMCP
MCP Infrastructure
mcp-gateway

API gateway purpose-built for the MCP ecosystem — auth, rate limiting, routing across multiple MCP servers, and built-in observability. Think Nginx/Kong but for agentic tool infrastructure.

GoMCPRedisPrometheusDocker
LLMOps · Evaluation
agent-eval-kit

Evaluation harness for multi-agent systems. Runs deterministic scenario suites, scores task completion, tracks token cost and latency distribution, and detects behavioral drift across model versions.

PythonLangChainPydanticpytestGrafana

Competitive Programming

LeetCode@Kuxha
0Solved
0Easy
0Medium
0Hard

Contest Rating 1,580 · Top 26% · 27 contests · 773 submissions / year

Codeforces@kuxha
0Peak Rating
SpecialistPeak Rank
0Contests

Peak 1420 (Specialist) · 51 contests · active since 2017

AI Focus

The problems I think about most deeply.

Agentic Pipelines

Multi-step LLM workflows — tool use, planning loops, state management, and human-in-the-loop approval gates for destructive actions.

RAG Infrastructure

Vector retrieval, context assembly, hybrid search, chunking strategy, and eval-driven quality loops for grounded generation at scale.

MCP & Tool Ecosystems

Designing, composing, and operating Model Context Protocol servers — auth, routing, observability — for production agentic systems.

Writing

Draft

Scatter-Gather at 500K RPM: parallelizing BookMyShow recommendations in Go

How we cut p99 latency in half using a scatter-gather pattern across 12 microservices — and what the Go runtime taught us.

Draft

MCP in production: what nobody tells you about building tool servers

Auth, versioning, error taxonomy, and why your MCP server will time out when you least expect it.

Draft

Why RAG evals are broken — and a simple harness that actually works

Most RAG eval frameworks measure the wrong thing. Here's what to measure instead, and how to wire it into CI.

Education

New York University

M.S. Computer Science · GPA 3.77 / 4.0

2024 – 2026
National Institute of Technology Silchar

B.Tech Computer Science · CGPA 8.06 / 10.0

2016 – 2020

Stack

Languages
JavaGoPythonScalaSQL
Data & Storage
KafkaApache SparkAerospikeElasticsearchPostgreSQLCassandra
AI / ML
LangChainLangGraphRAGMCPPydantic AI
Infra
AWSDockerGrafanaSigNoz

Let's build something.

Open to Senior SWE and AI Engineering roles.

l.deka@nyu.edu
GitHub ↗LinkedIn ↗

© 2026 Laba Deka