SRC ETL DWH API EC2 S3 RDS KAFKA STREAMS AI AGENT </> {...}

Vishal Srivastava

Building scalable systems and intelligent AI

About Me

My journey in software engineering has been driven by a deep curiosity for how technology transforms industries. From building enterprise-grade distributed data platforms in banking at Barclays, to developing geospatial analysis algorithms for subsurface exploration at Boundary Remote Subsurface Solutions, to pushing the boundaries of computer vision research at UC Irvine — each experience has deepened my conviction that great software engineering is about understanding the domain as deeply as the code.

I discovered my passion for engineering during my undergraduate years at SRM, where building my first distributed system showed me the elegance of solving complex problems through thoughtful architecture. This led me to Cognizant and MAQ Software, where I learned the discipline of enterprise-scale development. At Ford, I saw how automation and intelligent systems could transform manufacturing workflows. At Barclays, I experienced the rigor demanded by financial systems processing millions of transactions daily.

Now pursuing my MS in Data Science at UC Irvine, I'm at the intersection of my two greatest interests — scalable systems and artificial intelligence. Whether it's designing event-driven architectures, building ML pipelines, or crafting multi-agent AI systems, I'm energized by problems that require both engineering depth and creative thinking.

Download Resume

Software Architecture

Designing event-driven, scalable systems with microservices, distributed data platforms, and cloud-native architectures.

AI & Machine Learning

Building ML pipelines, training computer vision models, and deploying multi-agent AI systems for complex problem-solving.

Data Systems

Engineering data pipelines with medallion architecture, streaming systems with Kafka, and analytics platforms.

Full Stack Development

End-to-end project ownership from architecture design and implementation to deployment and monitoring.

Education

University of California, Irvine

Master of Science in Data Science
Sep 2024 – Dec 2025
Irvine, California, USA

SRM Institute of Science & Technology

Bachelor of Technology in Computer Science
Jul 2017 – Jun 2021
Chennai, Tamil Nadu, India

Experience

Barclays

Software Developer (Data)
Feb 2022 – Jun 2024 | India
Architected migration of Barclays' data platform from legacy on-prem to AWS. Designed fault-tolerant orchestration with Step Functions and scalable ETL with Glue/Lambda, reducing cloud costs by 45% while handling 100TB+ daily ingestion. Built event-driven notification system serving 250,000+ customers.

Ford Motor Company

Software Developer
Oct 2021 – Feb 2022 | India
Created reusable post-deployment health-check framework reducing production incidents. Automated Confluence documentation generation saving 7-8 person-days/month. Built Knowledge Graph automation eliminating 90% of manual data workflow effort.

UC Irvine

Graduate Researcher
Jun 2025 – Sep 2025 | Irvine
Improved object detection accuracy by 23% using custom CenterNet-ResNet50 with CNN Super-Resolution. Achieved 94% precision, 89% recall for satellite imagery analysis. Reduced inference time by 35% through Residual Deblurring in PyTorch.

Boundary Remote Subsurface Solutions

Algorithm Developer
Jun 2025 – Sep 2025 | Irvine
Built Python-based geospatial pipeline processing 50GB+ Magnetotelluric sensor data, reducing analysis time by 70%. Implemented RBF interpolation for subsurface hydrocarbon detection with 95% accuracy in resistivity profiling.

MAQ Software

SWE Intern
India
Developed data-driven BI solutions and automated reporting pipelines for enterprise clients. Contributed to dashboard development and data integration projects improving client decision-making processes.

Cognizant

Full Stack Developer Intern
India
Built full-stack web applications with modern frameworks. Worked on enterprise applications with microservices architecture, RESTful APIs, and CI/CD practices in large-scale environment.

Projects

E-Commerce Analytics

Production-grade data platform implementing medallion architecture (bronze→silver→gold) for real-time e-commerce analytics with sub-minute latency.
Read More

AgentForge AI

Multi-agent AI platform on AWS with modular infrastructure-as-code, featuring coordinated agents for complex software engineering tasks.
Read More

FlashDB

High-performance distributed key-value store from scratch using LSM-tree architecture with Bloom filters and consistent hashing.
Read More

Moderation AI

Multi-agent NLP system for real-time Reddit content moderation using open-source LLMs achieving 91% toxicity detection accuracy.
Read More

β-VAE Research

Systematic research on β-Variational Autoencoders for learning disentangled representations achieving 92% reconstruction accuracy.
Read More

Satellite Detection

End-to-end computer vision pipeline for detecting vehicles in low-resolution satellite imagery with 94% precision and 89% recall.
Read More

Skills

Programming

Python Java C++ SQL R Scala JavaScript Bash

ML & AI

PyTorch TensorFlow Transformers LLMs LangChain LangGraph RAG LoRA / PEFT NLP Computer Vision Scikit-Learn HuggingFace MLflow AI Agents Optuna

Data Engineering

Apache Spark Kafka Airflow dbt ETL Pipelines Hadoop Streaming Great Expectations Data Modeling

Databases

PostgreSQL MySQL MongoDB DynamoDB Snowflake Redis Pinecone Elasticsearch

Cloud & DevOps

AWS Terraform Docker Kubernetes CI/CD CloudFormation GitHub Actions Lambda Step Functions

Frameworks & Tools

FastAPI Flask Django Streamlit Git Jenkins Tableau Linux
"Data is the new oil. It's valuable, but if unrefined, it cannot really be used."
— Clive Humby

Get In Touch

Location

Irvine, California, USA