Siddharth Yadav

Software Engineer · M.S. Computer Science @ Virginia Tech

Software Engineer focused on creating robust, scalable applications and exploring the intersection of AI and software development.

About Me

I'm a Software Engineer pursuing my M.S. in Computer Science at Virginia Tech. With 5+ years of industry experience at companies like TATA AIG and Infosys, I've built everything from serverless policy management portals on AWS to retry microservices that reduced failure rates by 70%.

Currently, I'm a Researcher at the Commonwealth Cyber Initiative, where I build LLM agent systems on LangGraph for security alert triage and detection-rule generation, along with reusable RAG pipelines on LlamaIndex and ChromaDB. I also interned at the American Chemical Society, where I shipped an agentic GitHub PR analytics system that cut manual review effort by 85%.

My sweet spot is at the intersection of AI and software engineering — building multi-agent systems, fine-tuning LLMs for code analysis, and designing cloud-native architectures on AWS.

5+
Years of Experience
MS
CS @ Virginia Tech
AI
LLMs & Multi-Agent Systems

Experience

Commonwealth Cyber Initiative

Researcher

Jan 2025 - Present

Arlington, VA

Current

Implemented an LLM agent system on LangGraph to triage security alerts and auto-generate detection rules, reducing mean time to detect by 50%. Built reusable RAG ingestion and retrieval pipelines on LlamaIndex and ChromaDB supporting vector and knowledge-graph modes, citation-aware querying, and incremental hash-based re-indexing. Designed a tool-first agent platform enabling composition of specialist AI assistants from a shared tool registry (RAG, SIEM, formal verification, web search, fine-tuning), with a provider-abstracted LLM layer and MCP server exposure for AI-IDE integration.

LangGraphLlamaIndexChromaDBRAGMCPSIEM

American Chemical Society

Software Engineering Intern

May 2025 - Aug 2025

Washington, DC

Architected an agentic workflow on the ACS Publications integration platform using LangGraph's ReAct loop, orchestrating multi-step API integrations across AWS services and third-party SaaS vendors, reducing edge-case integration failure resolution time by 70%. Engineered an agentic GitHub PR analytics system to eliminate manual identification of high-risk PRs by automating data collection, cutting review effort by 85%. Set up a GitHub Actions CI/CD pipeline for ACS Pubs repos, increasing deployment frequency from weekly to over 10 deployments per day.

LangGraphReActAWSGitHub ActionsCI/CDPython

TATA AIG General Insurance

Software Engineer

Sep 2022 - July 2024

Gurugram, India

Developed backend services of a highly scalable portal for creating and modifying policies using Lambda, RDS, NodeJS, S3, achieving a 90% improvement in TAT. Built a cross-organization report and access management platform using ReactJS, EC2 and ExpressJS, reducing report generation time by 80%. Integrated the legacy Genesis Configurator API into the backend system, and implemented product APIs with authorization and rate limiting using AWS API Gateway, VPC, Redis — enabling new vendor integrations and partnerships.

AWS LambdaReactNode.jsRedisAPI Gateway

Infosys Limited

System Engineer

Nov 2020 - Aug 2022

Mysore, India

Built a data sync pipeline for a banking client to sync customer and loan records across RDS, S3, and downstream systems, implementing incremental batch processing with offset-based pagination and automated status tracking. Designed and implemented a retrying microservice for syncing cases to downstream leveraging Step Functions and Redis, reducing failure rate by 70%. Mentored new employees on clean code and best practices.

AWS RDSS3Step FunctionsRedisMicroservices

Projects

N-PORT Filing Tool (Corgi)

An automated N-PORT filing generation and compliance tool for ETF issuers. Ingests fund holdings via CSV/JSON, runs 30+ SEC compliance validations, generates EDGAR-ready XML, and enforces multi-role approval workflows with immutable audit trails. Built as a demo for Corgi — compressing weeks of fund operations into hours.

PythonFastAPIReactTypeScriptSQLAlchemyTailwind CSS

AI Code Review Agent

A multi-agent code review system using LangGraph with 4 parallel specialist agents (security, performance, style, test coverage) and a multi-provider LLM abstraction supporting Groq, Gemini, OpenAI, and Anthropic. Smart diff chunking with priority-based file ranking achieved 100% context precision, 86% F1 score, and 61% token savings — validated against real freeCodeCamp PRs.

PythonLangGraphFastAPINext.jsDockerGroq

Bug Localization & Repair Using LLMs

A dual-model system using fine-tuned Llama 3B Instruct for automated bug localization and repair, benchmarked against GPT-4 and Qwen Coder 32B. Augmented the QuixBugs dataset with syntactic and logical single-line bugs, demonstrating that targeted fine-tuning can outperform larger general-purpose LLMs while improving accuracy.

PyTorchLlama 3BGPT-4Fine-tuningPython

Portfolio Website

A modern, responsive developer portfolio built with Next.js and Tailwind CSS. Deployed on Vercel with optimized performance and SEO.

Next.jsTypeScriptTailwind CSSVercel

Skills & Technologies

Languages

PythonTypeScriptJavaScriptJavaSQL

Frontend

ReactNext.jsTailwind CSSHTML/CSS

Backend

Node.jsFastAPIREST APIsGraphQL

AI / ML

LLMsNLPLangChainOpenAI API

DevOps & Tools

GitDockerAWSVercelCI/CD

Databases

PostgreSQLMongoDBRedisFirebase

Get In Touch

I'm currently open to new opportunities and always happy to connect. Whether you have a question, a project idea, or just want to say hi — feel free to reach out.

Say Hello