
Multi Constrained Code-Switched Text Modelling
Guide: Prof. Preethi Jyothi, CSALT Lab | B.Tech. Project at IIT Bombay in collaboration with Google Research India | 2022 -
- Currently working on a generalised framework for code-switched text generation, with multiple possible constraints on formality, politeness, semantic similarity, etc – test tasks on English-Hindi and English-Spanish
- Exploring multiple MLM techniques based on word language to improve code-switched text generataion model.
- Working on Frequency MLM technique which improves the text generation without the word language information

Snapshot Metadata Garbage Collection
Software Engineering Internship | Rubrik, Bangalore, India | 2022
- Worked on a GC algorithm for a snapshot tool in a cloud data backup and recovery system using Go Lang & Korg.
- Developed a mechanism to track expired critical data snapshots which were blocking 30-40% of the database table.
- Ideated & implemented a framework to periodically find unnecessary snapshots and remove them from the database

Protein Language Modeling with ESM1b-e2e
Prof. Martin J. Lercher, Alexander Kroll | Research Internship at HHU, Dusseldorf, Germany | 2021
- Worked on a BERT based model on PyTorch to create high dimensional representations for enzyme sequences for the prediction of enzyme-substrate binding (binary classification task)
- Adapted ESM model to build an end-to-end model which outperformed the existing SOTA models for the task
- Improved accuracy for the task was achieved from 79.8% to 87.5% with the new representations that were created with parallel training on 8 GPUs for 10 epochs

Video Denoising using Low-Rank Matrix Completion
Guide: Prof. Ajit Rajwade | Course Project: Advanced Image Processing
- Applied Adaptive Median Filtering algorithm for removing impulsive noise from the spatial domain of the video
- Implemented Three Step Cross Search on MATLAB for organising similar patches across the temporal domain
- Adapted Fixed Point Iterative Algorithm for reducing Poisson and Gaussian noise from the image sequence

Blockchain: Simulation of P2P Cryptocurreny Network
Guide: Prof. Vinay J. Ribeiro | Course Project: Introduction to Blockchains, Cryptocurrencies, and Smart Contract
- Build a discrete-event simulator for P2P cryptocurrency network on python to study the blockchain network
- Implemented selfish mining attack proposed by Eyal and Sirer in the paper ”Majority is not Enough”
- Build a layer-2 Decentralised APPlication (DAPP) on top of Etherium blockchain using Solidity and Ganache

Cache Replacement for Graph Application
Guide: Prof. Biswa | Course Project: Computer Architecture
- Worked on P-OPT algorithm proposed in the paper "Practical Optimal Cache Replacement for Graphs"
- Used Champsim cache simulator to extract data and implement the algorithm for a fixed graph benchmark
- Achived an improvement in LLC load hit rate by a factor of 3x as compared to LRU replacement policy

SnapMath, Image to LaTeX convertor
Institute Technical Summer Project | Institute Technical Council
- Implemented a CNN and LSTM based model on PyTorch for generating LaTeX expression of the input equation
- Adapted the OpenAI problem statement and used im2latex-100k dataset to achieve a BLUE-4 score of 38.82
- Deployed the model on a Django based interactive web application and integrated the Django-TeX parser

Online Coding and Development Environment
Guide: Prof. Amitabha Sanyal | Course Project: Software and System Labs
- Created a web platform providing an in-browser IDE and support for user organised coding competitions
- Used Django, Angular and PostgreSQL to create an interactive user interface with secure user-authentication
- Implemented user directory separation, multiple language support and isolation using Docker for security.

Video from Single Exposure Coded Snapshot
Guide: Prof. Ajit Rajwade | Course Project: Advanced Image Processing
- Adapted publication from ICCV’11 to reconstruct spatial and temporal domain of the video from coded snapshot
- Implemented Orthogonal Matching Pursuit algorithm for sparse reconstruction to achieve RMSE of 0.03301

Image Compression using Quad-Tree
Guide: Prof. Ajit A. Diwan | Course Project: Data Structures and Algorithm
- Created a Region quad-tree class in C++ to store grey-scale images with highly optimised space complexity
- Implemented optimised algorithms to allow for intersection, overlap, resize, complement, and extraction of images

Testing Transport Layer Protocols
Guide: Prof. Vinay Ribeiro | Course Project: Computer Networks
- Implemented client and server using Socket Programming in C, to send files using different variants of TCP
- Used Bash to automate experiments and generate plots for comparing throughput, delay, and packet loss
- Recorded network traffic using Wireshark and analysed window scaling graphs for TCP Cubic and TCP Reno

Robust Mastermind Player
Guide: Prof. Ashutosh Gupta | Course Project: Logic for Computer Science
- Encoded moves of the mastermind game into an SAT problem and solved using conflict driven clause learning
- Implemented a solver in Python using z3py library which was robust to the other player lying up to 30% of the time

AtDoc: Online Doctor Consultation Service
Database and Backend Internship
- Lead the backend development in a telemedicine startup to schedule non-contact consultations for patients
- Implemented a database for storing encrypted information of doctors and patients on AWS using PostgreSQL
- Worked on a Django based backend enabling end-to-end features including in-app audio and video calling