Multi Constrained Code-Switched Text Modelling

Guide: Prof. Preethi Jyothi, CSALT Lab | B.Tech. Project at IIT Bombay in collaboration with Google Research India | 2022 -

  • Currently working on a generalised framework for code-switched text generation, with multiple possible constraints on formality, politeness, semantic similarity, etc – test tasks on English-Hindi and English-Spanish
  • Exploring multiple MLM techniques based on word language to improve code-switched text generataion model.
  • Working on Frequency MLM technique which improves the text generation without the word language information

Snapshot Metadata Garbage Collection

Software Engineering Internship | Rubrik, Bangalore, India | 2022

  • Worked on a GC algorithm for a snapshot tool in a cloud data backup and recovery system using Go Lang & Korg.
  • Developed a mechanism to track expired critical data snapshots which were blocking 30-40% of the database table.
  • Ideated & implemented a framework to periodically find unnecessary snapshots and remove them from the database

Protein Language Modeling with ESM1b-e2e

Prof. Martin J. Lercher, Alexander Kroll | Research Internship at HHU, Dusseldorf, Germany | 2021

  • Worked on a BERT based model on PyTorch to create high dimensional representations for enzyme sequences for the prediction of enzyme-substrate binding (binary classification task)
  • Adapted ESM model to build an end-to-end model which outperformed the existing SOTA models for the task
  • Improved accuracy for the task was achieved from 79.8% to 87.5% with the new representations that were created with parallel training on 8 GPUs for 10 epochs

Video Denoising using Low-Rank Matrix Completion

Guide: Prof. Ajit Rajwade | Course Project: Advanced Image Processing

  • Applied Adaptive Median Filtering algorithm for removing impulsive noise from the spatial domain of the video
  • Implemented Three Step Cross Search on MATLAB for organising similar patches across the temporal domain
  • Adapted Fixed Point Iterative Algorithm for reducing Poisson and Gaussian noise from the image sequence

Blockchain: Simulation of P2P Cryptocurreny Network

Guide: Prof. Vinay J. Ribeiro | Course Project: Introduction to Blockchains, Cryptocurrencies, and Smart Contract

  • Build a discrete-event simulator for P2P cryptocurrency network on python to study the blockchain network
  • Implemented selfish mining attack proposed by Eyal and Sirer in the paper ”Majority is not Enough”
  • Build a layer-2 Decentralised APPlication (DAPP) on top of Etherium blockchain using Solidity and Ganache

Cache Replacement for Graph Application

Guide: Prof. Biswa | Course Project: Computer Architecture

  • Worked on P-OPT algorithm proposed in the paper "Practical Optimal Cache Replacement for Graphs"
  • Used Champsim cache simulator to extract data and implement the algorithm for a fixed graph benchmark
  • Achived an improvement in LLC load hit rate by a factor of 3x as compared to LRU replacement policy

SnapMath, Image to LaTeX convertor

Institute Technical Summer Project | Institute Technical Council

  • Implemented a CNN and LSTM based model on PyTorch for generating LaTeX expression of the input equation
  • Adapted the OpenAI problem statement and used im2latex-100k dataset to achieve a BLUE-4 score of 38.82
  • Deployed the model on a Django based interactive web application and integrated the Django-TeX parser

Online Coding and Development Environment

Guide: Prof. Amitabha Sanyal | Course Project: Software and System Labs

  • Created a web platform providing an in-browser IDE and support for user organised coding competitions
  • Used Django, Angular and PostgreSQL to create an interactive user interface with secure user-authentication
  • Implemented user directory separation, multiple language support and isolation using Docker for security.

Video from Single Exposure Coded Snapshot

Guide: Prof. Ajit Rajwade | Course Project: Advanced Image Processing

  • Adapted publication from ICCV’11 to reconstruct spatial and temporal domain of the video from coded snapshot
  • Implemented Orthogonal Matching Pursuit algorithm for sparse reconstruction to achieve RMSE of 0.03301

Image Compression using Quad-Tree

Guide: Prof. Ajit A. Diwan | Course Project: Data Structures and Algorithm

  • Created a Region quad-tree class in C++ to store grey-scale images with highly optimised space complexity
  • Implemented optimised algorithms to allow for intersection, overlap, resize, complement, and extraction of images

Testing Transport Layer Protocols

Guide: Prof. Vinay Ribeiro | Course Project: Computer Networks

  • Implemented client and server using Socket Programming in C, to send files using different variants of TCP
  • Used Bash to automate experiments and generate plots for comparing throughput, delay, and packet loss
  • Recorded network traffic using Wireshark and analysed window scaling graphs for TCP Cubic and TCP Reno

Robust Mastermind Player

Guide: Prof. Ashutosh Gupta | Course Project: Logic for Computer Science

  • Encoded moves of the mastermind game into an SAT problem and solved using conflict driven clause learning
  • Implemented a solver in Python using z3py library which was robust to the other player lying up to 30% of the time

AtDoc: Online Doctor Consultation Service

Database and Backend Internship

  • Lead the backend development in a telemedicine startup to schedule non-contact consultations for patients
  • Implemented a database for storing encrypted information of doctors and patients on AWS using PostgreSQL
  • Worked on a Django based backend enabling end-to-end features including in-app audio and video calling