Hi, I'm Kapil Mirchandani

A Machine Learning Engineer, passionate about building robust models and scalable serving pipelines.

About

I am a Machine Learning Engineer with two years of experience, currently studying at The University of Ottawa, Specializing in Applied Artificial Intelligence. I enjoy applying Machine Learning to end-to-end real-world tasks, including model research, as well as building inference pipelines for model deployment. I am proficient in TensorFlow and PyTorch, as well as the web frameworks Django and Flask for model serving. I have worked with a variety of Machine Learning applications, including Generative Networks, OCR,Semantic Similarity, Object Detection, Optical Character Recognition, Text Clustering and most recently, Prompt Engineering. My skills also include MLOps – this includes hosting Tensorflow models on AWS, optimizing their performance and writing the backend to call these models, and parse and store their outputs.

Education

MEng, Electrical and Computer Engineering (Concentration in Applied Artificial Intelligence)

University of Ottawa

September 2023 - April 2025 (Expected)

B.E., Electronics and Telecommunication Engineering

Pune Institute of Computer Technology

CGPA: 9.45/10
August 2017 - May 2021

Higher Secondary Education

Fergusson College

Grade: 86.66%
August 2015 - May 2017

Secondary Education

The Bishop's School, Pune

Grade: 94.63%
June 2003 - May 2015

Experience

Data Scientist Intern –
Quantum AI Research
  • Building classical neural networks with parameter constraints to classify image data for comparative analysis with quantum neural networks.
  • Conducting detailed comparisons to explore the interplay and advantages of classical and quantum neural networks.
January 2025 - Present | Ottawa, Ontario, Canada
Software Developer – SYNC HMI Intern
  • Implemented an automated bug ticket triaging system using word embeddings and a machine learning classifier, achieving 80% accuracy in routing tickets based on titles and descriptions, saving 15 hours of total developer time per week.
  • Analyzed over 2 billion log lines, extracting insights such as high-memory processes and low-impact crashes, leading to fixes that reduced crashes by 20% and memory usage by 10%.
  • Automated configuration validations and detection of common issues, integrating these solutions with Jira, which reduced the number of tickets requiring manual triaging by 50%.
  • Parsed over 1 billion log lines to build a classification dataset of over 2000 data points for machine learning, extracting key features based on developers' manual analysis processes.
  • Applied machine learning classifiers to the dataset, effectively handling imbalanced data using ADASYN, and achieved an F1 score of 82%.
May 2024 – December 2024 | Ottawa, Ontario, Canada
Machine Learning Engineer
  • Fine-tuned transformer-based embeddings, using bi-encoder and cross-encoder architectures to build a semantic similarity model which achieved an accuracy of 88% on meeting transcript data.
  • Leveraged LLMs, along with RAG, particularly GPT-3.5 and GPT-4, for various summarization, rephrasing, question-answering, and sentence classification tasks, to deploy 6 new features, including transcript summarization and automated notetaking.
  • Implemented image processing and OCR techniques to identify the current speaker from virtual meeting interfaces (Zoom, Google Meet, MS Teams), improving the accuracy of the current speaker recognition system from 67% to 88%.
  • Developed a feature using clustering on over 100,000 meetings to rank top FAQs and reveal key themes; its deployment increased average CSAT score from 7 to 8.
  • Involved in system and data platform design for deployment of these models, using Django, PostgreSQL and AWS EC2 and ECS to build end-to-end Machine Learning inference and training pipelines for real-time sentence classification and transcription.
  • Optimized inference speed of Tensorflow models, using the gRPC protocol and AVX acceleration achieving a 30% decrease in latency.
  • Set-up and maintained automated CI/CD pipelines, using Tensorflow serving, Docker and Github actions, automating and reducing deployment times from approximately 10 minutes to 2 minutes.
March 2022 - July 2023 | Palo Alto, CA, USA (Remote)
Software Engineer I (Machine Learning)
  • Developed advanced machine learning solutions to automate end-user issue resolution in customer service.
  • Conducted research, scoping, benchmarking, and deployment of cutting-edge NLP algorithms, including user message classification and language detection.
  • Engineered scalable microservice inference pipelines for machine learning models using Django, Flask, and FastAPI, which supported over 1 million requests per day.
  • Diagnosed and resolved critical bugs, optimizing underperforming models for clients, by using oversampling or adjusting class weights on imbalanced datasets, boosting classification accuracy from 70% to 90%.
July 2021 - February 2022 | Pune, Maharashtra, India
Deep Learning Intern
  • Contributed to the development of CRAL, a library used for abstraction of well known deep learning architectures for Computer Vision.
  • Worked on the addition of well known deep learning architectures for computer vision into the library.
  • Intensively involved in implementation, integration and testing of object detection models.
  • Achieved mAP scores of more than 0.6 on standard benchmark datasets for all the integrated object detection models.
July 2020 - October 2020 | Remote (Bangalore, India)

Publications

DPSRGAN: Dilation Patch Super-Resolution Generative Adversarial Networks

Full paper at the 6th International Conference for Convergence in Technology (I2CT), 2021
  • Developed a novel Generative Adversarial Network architecture to increase the resolution of images.
  • The input to the network is a low resolution image, which is upscaled natively by the network.
  • The model is capable of upscaling input image by 4x the original resolution.
  • The metrics obtained from our DPSRGAN are better than the previously proposed SRGAN, with a MOS of 3.91 out of 5 and a PSNR of 32.24.

Big Data Analytics for Sustainable Cities: Pune Tree Census Data Exploratory Analysis

Full paper at the 11th International Conference for Computing, Communication and Networking Technologies (ICCCNT), 2020
  • Developed a pipeline for analysis of tree census data using data of Pune, India.
  • Introduced a novel metric, the Flora Biodiversity Index (FBI), to quantify the diversity of trees in a region.
  • Drew insights from the data to determine uniformity of tree cover, areas deficient in trees and areas having a lower biodiversity.
  • Our pipeline will be useful for cities to analyse their current green cover and work on making it better.

Projects

quiz app
ChargeRoute

A route optimizer for Electric Vehicles.

Accomplishments
  • Tools: Django, HTML, CSS, Javascript
  • An application that optimizes routes for Electric Vehicles, providing charging stations while minimizing the time and distance required for the journey.
  • Won first place for the Ford Hack at uOttaHack 6.
quiz app
Cell Free Layer Detector

A Deep Learning approach to detect the cell free layer in blood vessels.

Accomplishments
  • Tools: Tensorflow, MATLAB, Streamlit
  • An Image Segmentation model to detect the Cell Free Layer in blood vessels.
  • The algorithm achieves an DICE coefficient of 0.91 on the test set.
music streaming app
License Plate Reader

An application that can read license plates of vehicles and register offences.

Accomplishments
  • Tools: Django, PyTorch, HTML, CSS, SQLite, PyTesseract
  • Uses YOLOv3 for detecting license plates and PyTesseract for reading the number.
  • Can also search the database for corresponding vehicle owner and automatically send them an email.
quiz app
Network Anomaly Detector

A Machine Learning approach towards network threat detection.

Accomplishments
  • Tools: Scikit-Learn, Matplotlib, Seaborn, Django, HTML, CSS
  • Uses a Machine Learning algorithm to detect network anomalies from an input .pcap file.
  • The algorithm achieves an accuracy of 99.6% on benchmark datasets.
Screenshot of web app
Survey and Rescue Drone

An autonomous drone for survey and rescue.

Accomplishments
  • Tools: ROS, OpenCV
  • A drone capable of detecting beacons and autonomously maneuvering itself in response.
  • Part of the e-Yantra Robotics Competition (eYRC) held by IIT Bombay.
Screenshot of  web app
XOdia 2019

An AI Combat game, part of Credenz 2019, PICT's tech fest.

Accomplishments
  • Tools: Django, Docker
  • A game where participants create bots to play an original board game.
  • Led a team of 20 juniors for development of this game.
Screenshot of  web app
XOdia 2018

An AI Combat game, part of Credenz 2018, PICT's tech fest.

Accomplishments
  • Tools: Django, Docker
  • A game where participants create bots to play an original board game.
  • Volunteered as a programmer to work on the backend of this game.
Screenshot of  web app
FaceGAN

A Generative Adversarial Network (GAN) that generates faces of people.

Accomplishments
  • Tool: Tensorflow
  • A neural network that can generate faces of people (that may not even exist!).
  • Based on the DCGAN architecture.
Screenshot of  web app
Web Portal for Guest Speakers

A web portal to track and invite speakers for seminars in college.

Accomplishments
  • Tool:Django
  • A web portal, where guest speakers can sign up for invitation to take seminars in college.
  • Also developed admin functionality for speaker approval, deletion etc.
Screenshot of  web app
Weather Monitoring System

A hardware system that monitors various aspects of the weather.

Accomplishments
  • Tools:Arduino, various sensors
  • A system capable of measuring and monitoring temperature, AQI, humidity and current light intensity.

Skills

Languages

Python
C/C++
MATLAB
Java
Clojure
SQL
Javascript

Libraries

NumPy
Pandas
OpenCV
Matplotlib
Seaborn

Frameworks

TensorFlow
PyTorch
Scikit-Learn
Django
Flask
FastAPI
Selenium

Developer Tools

Git
Docker

Databases

MySQL
PostgreSQL
MongoDB

Cloud Technologies

AWS (S3, EC2, Lambda, ECS/ECR)
Tensorflow Serving

Contact

Clicky