maleshep

Aman maleshep

HPC ML Engineer — serving 5+ production LLMs on bare-metal Slurm (8× B200). Fine-tuning 31B agents with GRPO. Building AI infrastructure.

Pinned Loading

hpc-llm-serving hpc-llm-serving Public

Production-grade multi-model LLM inference + LoRA fine-tuning on bare-metal Slurm clusters (8× B200). SGLang, vLLM, EAGLE, MTP speculative decoding.

Python
llm-gateway llm-gateway Public

OpenAI-compatible multi-provider LLM gateway with failover, profile-based routing, and free-tier model catalog

Python
second-brain second-brain Public

AI-powered second brain — project registry, Claude agents, and automation tools for analytics & AI strategy work

Python
voicetype voicetype Public

Lightweight Electron dictation overlay for Windows — Azure STT, push-to-talk via Win+Ctrl

JavaScript