Kapil Mirchandani

Hi, I'm Kapil Mirchandani

A Machine Learning Engineer, passionate about building robust models and scalable serving pipelines.

About

I am a Machine Learning Engineer with two years of experience, currently studying at The University of Ottawa, Specializing in Applied Artificial Intelligence. I enjoy applying Machine Learning to end-to-end real-world tasks, including model research, as well as building inference pipelines for model deployment. I am proficient in TensorFlow and PyTorch, as well as the web frameworks Django and Flask for model serving. I have worked with a variety of Machine Learning applications, including Generative Networks, OCR,Semantic Similarity, Object Detection, Optical Character Recognition, Text Clustering and most recently, Prompt Engineering. My skills also include MLOps – this includes hosting Tensorflow models on AWS, optimizing their performance and writing the backend to call these models, and parse and store their outputs.

Education

MEng, Electrical and Computer Engineering (Concentration in Applied Artificial Intelligence)

University of Ottawa

September 2023 - April 2025 (Expected)

B.E., Electronics and Telecommunication Engineering

Pune Institute of Computer Technology

CGPA: 9.45/10

August 2017 - May 2021

Higher Secondary Education

Fergusson College

Grade: 86.66%

August 2015 - May 2017

Secondary Education

The Bishop's School, Pune

Grade: 94.63%

June 2003 - May 2015

Experience

Natural Resources
Canada

Data Scientist Intern –
Quantum AI Research

Building classical neural networks with parameter constraints to classify image data for comparative analysis with quantum neural networks.
Conducting detailed comparisons to explore the interplay and advantages of classical and quantum neural networks.

January 2025 - Present | Ottawa, Ontario, Canada

Ford Motor Company

Software Developer – SYNC HMI Intern

Implemented an automated bug ticket triaging system using word embeddings and a machine learning classifier, achieving 80% accuracy in routing tickets based on titles and descriptions, saving 15 hours of total developer time per week.
Analyzed over 2 billion log lines, extracting insights such as high-memory processes and low-impact crashes, leading to fixes that reduced crashes by 20% and memory usage by 10%.
Automated configuration validations and detection of common issues, integrating these solutions with Jira, which reduced the number of tickets requiring manual triaging by 50%.
Parsed over 1 billion log lines to build a classification dataset of over 2000 data points for machine learning, extracting key features based on developers' manual analysis processes.
Applied machine learning classifiers to the dataset, effectively handling imbalanced data using ADASYN, and achieved an F1 score of 82%.

May 2024 – December 2024 | Ottawa, Ontario, Canada

Avoma

Machine Learning Engineer

Fine-tuned transformer-based embeddings, using bi-encoder and cross-encoder architectures to build a semantic similarity model which achieved an accuracy of 88% on meeting transcript data.
Leveraged LLMs, along with RAG, particularly GPT-3.5 and GPT-4, for various summarization, rephrasing, question-answering, and sentence classification tasks, to deploy 6 new features, including transcript summarization and automated notetaking.
Implemented image processing and OCR techniques to identify the current speaker from virtual meeting interfaces (Zoom, Google Meet, MS Teams), improving the accuracy of the current speaker recognition system from 67% to 88%.
Developed a feature using clustering on over 100,000 meetings to rank top FAQs and reveal key themes; its deployment increased average CSAT score from 7 to 8.
Involved in system and data platform design for deployment of these models, using Django, PostgreSQL and AWS EC2 and ECS to build end-to-end Machine Learning inference and training pipelines for real-time sentence classification and transcription.
Optimized inference speed of Tensorflow models, using the gRPC protocol and AVX acceleration achieving a 30% decrease in latency.
Set-up and maintained automated CI/CD pipelines, using Tensorflow serving, Docker and Github actions, automating and reducing deployment times from approximately 10 minutes to 2 minutes.

March 2022 - July 2023 | Palo Alto, CA, USA (Remote)

Helpshift

Software Engineer I (Machine Learning)

Developed advanced machine learning solutions to automate end-user issue resolution in customer service.
Conducted research, scoping, benchmarking, and deployment of cutting-edge NLP algorithms, including user message classification and language detection.
Engineered scalable microservice inference pipelines for machine learning models using Django, Flask, and FastAPI, which supported over 1 million requests per day.
Diagnosed and resolved critical bugs, optimizing underperforming models for clients, by using oversampling or adjusting class weights on imbalanced datasets, boosting classification accuracy from 70% to 90%.

July 2021 - February 2022 | Pune, Maharashtra, India

Segmind

Deep Learning Intern

Contributed to the development of CRAL, a library used for abstraction of well known deep learning architectures for Computer Vision.
Worked on the addition of well known deep learning architectures for computer vision into the library.
Intensively involved in implementation, integration and testing of object detection models.
Achieved mAP scores of more than 0.6 on standard benchmark datasets for all the integrated object detection models.

July 2020 - October 2020 | Remote (Bangalore, India)

Publications

DPSRGAN: Dilation Patch Super-Resolution Generative Adversarial Networks

Full paper at the 6th International Conference for Convergence in Technology (I2CT), 2021

Developed a novel Generative Adversarial Network architecture to increase the resolution of images.
The input to the network is a low resolution image, which is upscaled natively by the network.
The model is capable of upscaling input image by 4x the original resolution.
The metrics obtained from our DPSRGAN are better than the previously proposed SRGAN, with a MOS of 3.91 out of 5 and a PSNR of 32.24.

Publication Code

Big Data Analytics for Sustainable Cities: Pune Tree Census Data Exploratory Analysis

Full paper at the 11th International Conference for Computing, Communication and Networking Technologies (ICCCNT), 2020

Developed a pipeline for analysis of tree census data using data of Pune, India.
Introduced a novel metric, the Flora Biodiversity Index (FBI), to quantify the diversity of trees in a region.
Drew insights from the data to determine uniformity of tree cover, areas deficient in trees and areas having a lower biodiversity.
Our pipeline will be useful for cities to analyse their current green cover and work on making it better.

Publication Code

Projects

                
ChargeRoute
                  A route optimizer for Electric Vehicles.
                
AccomplishmentsTools: Django, HTML, CSS, Javascript
An application that optimizes routes for Electric Vehicles, providing charging stations
                    while minimizing the time and distance required for the journey. 
Won first place for the Ford Hack at uOttaHack 6.

Cell Free Layer Detector
                  A Deep Learning approach to detect the cell free layer in blood vessels.
                
AccomplishmentsTools: Tensorflow, MATLAB, Streamlit
An Image Segmentation model to detect the Cell Free Layer in blood vessels.
The algorithm achieves an DICE coefficient of 0.91 on the test set.

License Plate Reader
                  An application that can read license plates of vehicles and register offences.
                
AccomplishmentsTools: Django, PyTorch, HTML, CSS, SQLite, PyTesseract
Uses YOLOv3 for detecting license plates and PyTesseract for reading the number.
Can also search the database for corresponding vehicle owner and automatically send them an email.
                  
Network Anomaly Detector
                  A Machine Learning approach towards network threat detection.
                
AccomplishmentsTools: Scikit-Learn, Matplotlib, Seaborn, Django, HTML, CSS
Uses a Machine Learning algorithm to detect network anomalies from an input .pcap file.
The algorithm achieves an accuracy of 99.6% on benchmark datasets.

Survey and Rescue Drone
                  An autonomous drone for survey and rescue.
                
                Accomplishments
                Tools: ROS, OpenCV
A drone capable of detecting beacons and autonomously maneuvering itself in response.
Part of the e-Yantra Robotics Competition (eYRC) held by IIT
                    Bombay.

XOdia 2019
                  An AI Combat game, part of Credenz 2019, PICT's tech fest.
                
AccomplishmentsTools: Django, Docker
A game where participants create bots to play an original board game.
Led a team of 20 juniors for development of this game.

XOdia 2018
                  An AI Combat game, part of Credenz 2018, PICT's tech fest.
                
AccomplishmentsTools: Django, Docker
A game where participants create bots to play an original board game.
Volunteered as a programmer to work on the backend of this game.

FaceGAN
                  A Generative Adversarial Network (GAN) that generates faces of people.
                
AccomplishmentsTool: Tensorflow
A neural network that can generate faces of people (that may not even exist!).
Based on the DCGAN architecture.

Web Portal for Guest Speakers
                  A web portal to track and invite speakers for seminars in college.
                
AccomplishmentsTool:Django
A web portal, where guest speakers can sign up for invitation to take seminars in college.
Also developed admin functionality for speaker approval, deletion etc.

Weather Monitoring System
                  A hardware system that monitors various aspects of the weather.
                
AccomplishmentsTools:Arduino, various sensors
A system capable of measuring and monitoring temperature, AQI, humidity and current light
                    intensity.