IIT MADRAS 2028 / IDDD DATA SCIENCE

NAGRAJ GAONKAR.

I build ML systems, HPC renderers, quant backtests, and low-level C++ tools that show their work. Finance, pixels, compilers, search engines.

TINY MODULES. BIG WORK.

01

AMERICAN EXPRESS

Incoming Software Engineering Intern, May '26. Working around GenAI for finance: LLM caching, smart agentic routing, regulatory compliance, and secure model orchestration.

GenAIFinanceLLM CacheSecure Routing
02

PROBABLYONFIRE

California wildfire-risk pipeline over a 580 km x 560 km region. Built fire/no-fire labels, cut imbalance from 1:641 to 1:140, stacked PatchTST + DLA with boosted models, and pushed PR-AUC from 0.2713 to 0.8765.

PatchTSTDLASHAPMonte Carlo
OPEN REPO >
03

TORIRENDER

CMake-based C++ CPU ray tracer for donut/toroid scenes. Uses MPI row distribution plus OpenMP threading across 40 CPUs, dropping a 1600 x 900 render from 19.64h to 0.81h.

C++MPIOpenMPPBS
OPEN REPO >
04

CRANFIELD SEARCH

CS6370 NLP search engine over 1400+ Cranfield docs. Added LSA, K-Means grouping, custom preprocessing, cached SVD, and beat the VSM baseline by +13.3% recall, +9.3% mAP, and 11% response time.

PythonLSAVSMK-Means
OPEN REPO >
05

SILICON

Minimal production-oriented macOS background agent in C++17. Phase 1 nails lifecycle, POSIX shutdown, thread-safe logs, launchd integration, and timed execution safety.

C++17macOSlaunchdLogging
OPEN REPO >
06

PITWALL-SPY

Multithreaded Formula-1 simulation engine. A producer thread emits 20 ms telemetry snapshots into a bounded ring buffer; a consumer renders a live ANSI leaderboard with tyres, pits, laps, and speed.

C++17ThreadsRing BufferJSON
OPEN REPO >
07

LLVM CONSTEXPR

Added constexpr support for X86 shuffle intrinsics in Clang, including bytecode interpreter and constant-expression evaluator paths across SSE, AVX2, and AVX512 variants.

LLVMClangX86SIMD
OPEN PR >
08

REGIME TRADER

InterIIT quant stack for masked exchanges: regime-aware intraday trading, trend following, mean reversion, no-lookahead constraints, parallelized backtests, and cross-dataset generalization.

HFTTick DataBacktestsRisk
09

VERITAS

Adobe InterIIT vision work that became a real/fake image pipeline: CNN + ViT ensembles, adversarial checks with FGSM/PGD, 15,000+ synthetic images, GradCAM, CLIP, and VLM explanations.

ViTCNNGradCAMVLM
READ PAPER >

CURRENT LOADOUT

ONE ENGINEER. MANY MODES.

IDDD in Data Science at IIT Madras, B.Tech in Civil Engineering, incoming American Express SWE intern, and a course stack that keeps pulling me toward systems, data, finance, and mathematical foundations.

ACADEMICS IIT Madras 2028 CGPA 9.32/10 IDDD Data Science
AMEX '26 Software Engineering Intern GenAI for finance Secure model orchestration
COURSES
NLP Machine Learning Techniques Parallel Computing Multi-Threading Data Science Mathematics Probability + Statistics Stochastic Processes Quant Finance Algorithmic Trading Modern C++

OPEN SOCKET

BUILD SOMETHING FAST AND MEANINGFUL.

Send a problem with hard constraints, noisy data, or sharp latency edges. I probably want to poke it.