Karan Pathak

When something is important enough, you do it even if the odds are not in your favor

Elon Musk

TECHNICAL WORK EXPERIENCE

  • DataTurtles LLP

  • Chief Technology Officer

Jul 2020 – Jul 2022 | Noida, India
  • Managed and delivered custom software solutions for clients to increase their business growth.
  • Led a four-person team of full-stack developers & UI/UX designers to build software solutions using ExpressJS, ReactJS, PostgreSQL, AWS, and Vercel.
  • Analyzed business requirements and designed cost-efficient end-to-end scalable customer engagement systems such as WhatsApp chatbot, Lead & Email marketing tool, etc.
  • Collaborated with a 3-person team to develop and deploy a Gardening E-Commerce Marketplace solution on AWS Fargate using AWS ECS service.
  • Developed a forecasting model using LSTM in PyTorch for a Fortune 1000 company to reduce raw material storage costs by 23%.
  • Created a Software Development Kit in Python (similar to AWS Boto3) for an AI-Cloud Platform to automate & manage the computing, AI, pipelining, data storage, and model deployment on the platform through the SDK.

  • Schlumberger Limited

  • Data Analytics Engineer

  • |Sub-Surface Team

Jul 2018 – Jun 2020 | Pune, India
  • Worked on automating Seismic Fault interpretation by building deep convolutional semantic segmentation models in Keras to achieve human-level accuracy and reduce manual efforts from 3 months to 40 hours.
  • Co-developed an automated Seabed detection solution using a weakly supervised learning convolutional model which outperformed the state-of-the-art solution by 4%.
  • Collaborated with a data engineer to develop a distributed prediction service (deployed in GCP) using Zarr & PySpark to reduce prediction time from 68 hours to 14 hours for a 1 PB file.
  • Developed a post-processor for the Seismic Fault interpretation pipeline using a 3D U-Net in Keras to remove spatiotemporal seismic noises and increase the overall prediction accuracy by 7%.


  • Schlumberger Limited

  • Software Engineer

  • |Data Ingestion Team

Jul 2017 – Jun 2018 | Pune, India
  • Converted a monolithic application, to parse & store DLIS files (petroleum log file format) in a petroleum Logstore, into a micro-services application using ExpressJS & GCP (compute engine & Pub/Sub). Decreased the processing time from 3 mins to 35 seconds for a 1 GB file.
  • Built a client library in Python for accessing well logs from a petroleum well logs storage system.

  • ADmyBRAND

  • Data Science Intern

  • |8hoarding

15th August 2016 - 30th September 2016

Worked for an advertisement management startup. Internship was focused on data extraction and data cleaning for media hoarding booking product named “8hoarding”.

  • Built web scrapers to extract hoarding information from different websites.
  • Developed complex scrapers to handle AJAX pagination on websites.
  • Used SFrame python module for data cleaning.

TECHNICAL PROJECTS

FloodMetaSegNet - Flood Damage Extent Detection Using Satellite Images and Meta Attributes

  • Simon Fraser University
Oct 2022 - present
  • Co-developed FloodMetaSegNet (deep convolutional semantic segmentation model with meta channel) using PyTorch, GeoPandas, Rasterio and Shapely to determine the extent of building damage caused by floods using satellite images and elevation attributes.
  • Applied transfer learning using renowned flood damage datasets to overcome the data shortage.
  • Increased Dice score by 80% and Tversky score by 30% as compared with segmentation models without meta attributes.

Gardening E-Commerce Marketplace

  • DataTurtles LLP
Oct 2021 - Jul 2022
  • Collaborated with a 3-person team to develop a Gardening E-Commerce Marketplace solution using ReactJS, ExpressJS, PostgreSQL, and Prisma (ORM).
  • Deployed the solution to AWS Fargate using AWS ECS service to handle a variable concurrent customer load.

Error logging module

  • DataTurtles LLP
Jun 2022 - Jun 2022
  • Designed and co-developed a NodeJS package to log information and errors from NodeJS cloud applications into different services like Axiom, GCP Cloud Logging, AWS CloudWatch, etc., which reduced the development time.
  • Used in all the NodeJS cloud applications in the company.

Email Marketing Campaign Tool

  • DataTurtles LLP
May 2022 - May 2022
  • Designed a cost-efficient email marketing campaign system to manage customer leads and schedule email sending based on the customer’s persona without landing emails in the spam folder.
  • Co-created microservices in ExpressJS to upload leads to the PostgreSQL database, manage email scheduling, and take actions based on customer responses.
  • Increase B2B product sales by 38%.

WhatsApp Chatbot for Customer Engagement

  • DataTurtles LLP
Apr 2022 - May 2022
  • Designed a rule-based Shopify-WhatsApp chatbot to enhance customer engagement experience like impulsive buying, customer inquiry resolution, etc.
  • Co-developed the application in ExpressJS and deployed it on Vercel (Serverless compute platform) and increased B2C product sales by 45%.

AI-Platform SDK

  • DataTurtles LLP
Feb 2022 - Mar 2022
  • Designed and created a Software Development Kit in Python to automate & manage the compute, data storage, and model deployment on the AI-Platform through the SDK.
  • Helped platform's clients in running CPU & GPU-based data workloads and inference pipelines in development and production scenarios.
  • Deployed and served AI models as API using BentoML

Chemical Raw Material Inventory Optimization using Time-Series forecasting

  • DataTurtles LLP
Dec 2021 - Feb 2022
  • Applied an LSTM model in PyTorch to forecast the consumption of individual chemicals in a complex series of chemical processes.
  • Reduced the raw material inventory storage costs by 23% by achieving a mean MAPE score of 8.7%.

Alzheimer Research with Stall Catchers – Classify Stalled Blood Vessels In Brain Images

  • Personal
Jul 2020 - Aug 2020
  • Applied a CNN-LSTM network with a time-distributed layer on the Alzheimer video dataset to classify if a blood vessel is stalled or not in PyTorch.
  • Achieved the best Mathews Correlation Coefficient of 0.548 with EfficientNet-B2 spatial encoder.

Fault geobody detection

  • Deep Learning Application
  • |Schlumberger
December 2018 - Jan 2020

Global model for fault geobody detection using deep supervised learning

  • Built a supervised 2D deep CNN to detect fault geobody in seismic signals.
  • Trained on multiple surveys with different distributions to achieve a global model.
  • Used Unet as the model architecture.

Seismic Noise removal using V-Net

Built a supervised 3D CNN model to enhance fault detection by removing noise (small faults, false positives and non-laterally pervasive faults) from global fault model predictions.

Salt body detection using network-based deep transfer learning

  • Deep Learning Application
  • |Schlumberger
July 2018 – November 2018
  • Built a model to detect the Top of Salt geobody using very few training data.
  • Used Tiramisu architecture for semantic segmentation.
  • Performed transfer learning by training only the last few layers of the decoder of a model pre-trained on seismic fault data and froze the remaining layers of the model.

Sea-Floor detection using weakly supervised deep learning

  • Deep Learning Research Application
  • |Schlumberger
July 2019 – September 2019
  • Designed and developed a unique process for accurately detecting seafloor.
  • Built a global model using weakly supervised learning to predict seafloor and applied a series of smart post processors on top of the model’s prediction to increase the overall accuracy.
  • Filed provisional patent. U.S. patent no.: 62/914,608

Real-Time Hand Gesture-based Mouse Control

  • Vellore Institute of Technology
Feb 2017 - May 2017
  • Developed a novel rotation and scale-invariant pipeline for controlling mouse pointers using a webcam. Demo
  • Defined custom hand gestures to imitate all the current mouse functionalities, including the drag-and-drop feature.

Client Library for Logstore

  • Software Engineering
  • |Schlumberger
March 2018 – June 2018

Logstore is a distributed logs storage system used to store well logs of an Oil Field. I worked on building client library for logtore, and integrated logstore with different microservices developed by my team using the client library.

DLIS parser

  • Software Engineering
  • |Schlumberger
October 2017 – January 2018

A DLIS file is a log file created by sensors that monitor petroleum wells. I worked on creating a DLIS parser for extracting log headers and data from a DLIS file and store the information extracted in the logstore.

Clustering using Brainstorm Optimization

  • Machine Learning
  • |Coursework
March 2016 – May 2016

Applied Brainstorm optimization, a swarm intelligence technique, to find optimal cluster clusters in protein dataset. Used Dunn index and DB index for internal cluster validity indices.

Smart Gui for linux terminal

  • Operating Systems
  • |Coursework
February 2015 – April 2015

Developed a smart terminal GUI for Debian Linux Kernel 3.14-kali. Created new commands using linux system calls. Stored the user’s history to filter top five frequently used commands and displayed it on the GUI.

Code: View Code

Text search using Trie

  • Data Structure & Algorithms
  • |Coursework
August 2014 - October 2014

Developed a cross-platform desktop app in C++ and Qt for searching text in a given paragraph using Trie data structure.

Code: View Code

KUcart Ecommerce

  • Web Development
  • |Coursework
March 2015 – April 2015

A simple E-commerce website.

Code: View Code

Publications

  • Co-authored a patent FEATURE DETECTION IN SEISMIC DATA while I was working on seafloor detection project in Schlumberger. Link to Patent

Conferences

  • Presented a paper titled “Hand Gesture Recognition and FingerTip Detection using Neural Networks” in World Summit On Advances in Science, Engineering and Technology conference at the University of Cambridge, U.K. in 2018

Presentations

  • Presented my research work on Fault and Seafloor detection using deep learning in TECHDAY conference (internal) at Schlumberger Pune
  • Gave a presentation on Demystifying Data Science at Decodr Technologies
  • Gave a presentation on Starting a Career in Data Science at IEEE ComSoc SBC, UKFCET

Achievements

  • Quater Finalist in Swadeshi Microprocessor Challenge, Aatma Nirbhar Bharat Abhiyan, Govt. of India. Certificate
  • Ranked 8th in ICPR 2020 conference - Pollen Grain Classification Challenge Certificate
  • Certificate of Appreciation for presenting Presented my paper titled “Hand Gesture Recognition and FingerTip Detection using Neural Networks” in World Summit On Advances in Science, Engineering and Technology conference at the University of Cambridge, U.K. in 2018. Certificate
  • Ranked 232nd/6334 (Top 4%) in HackerEarth Deep Learning Challenge — Auto-tag Images of the Gala
  • Silver Medal — Ranked 674th/9151 (Top 8%) in Week of Code 29
  • Bronze Medal — Ranked 1568th/9026 (Top 18%) in Week of Code 21
  • Certificate of Appreciation for presenting my research work on Fault and Seafloor detection using deep learning in TECHDAY conference (internal) at Schlumberger Pune Certificate
  • Cleared 1st round of CodeVita Season IV - an online competitive coding competition conducted by TCS.

Education

Simon Fraser University

Master of Science in Professional Computer Science (VisualComputing specialization)

Cumulative GPA: 3.78 / 4.33

AREA OF INTEREST
  • Distributed systems and algorithms
  • Semantic segmentation
  • Generative adversarial network
RELATED COURSES
    • Distributed and Cloud Systems
    • Practices in visual computing 1
    • Practices in visual computing 2
    • Data Science in Business
    • Machine Learning
September 2022 - Present

Vellore Institute of Technology

Bachelor of Technology in Computer Science and Engineering

Cumulative GPA: 8.92 / 10

AREA OF INTEREST
  • Data Structures and Algorithms
  • Computer vision
  • Reinforcement learning
  • Self-driving cars
RELATED COURSES
  • Data Science
    • Agent based intelligent systems
    • Soft computing
    • Computer vision
    • Data warehousing and data mining
  • Mathematics
    • Multivariable calculus and differential equations
    • Differential and difference equations
    • Discrete mathematical equations
    • Linear algebra
    • Numerical Analysis
    • Graph theory and its applications
    • Applied probability, statistics and reliability
  • Data Structures and Algorithms
    • Data Structures and Algorithms
    • Algorithm design and analysis
CAPSTONE PROJECT DEMO : https://www.youtube.com/watch?v=gcHJ179Yop4
July 2013 - August 2017

Somerville School

Science Student

Class XII CBSE - AISSCE: 85.2% (Maths 90%, Physics 91%, Computer Science 90%)

Class X CBSE - AISSE: 9.0/10 (Maths:10, Science: 10, Social Science:10)

April 2006 - February 2012

Certifications

  • Coursera

Deep Learning specialization by deeplearning.ai
  • Neural Networks and Deep Learning see credentials
  • Improving Deep Neural Networks: Hyperparameter tuning, Regularization and Optimization see credentials
  • Structuring Machine Learning Projects see credentials
  • Convolutional Neural Networks
TensorFlow in Practice specialization by deeplearning.ai
  • Introduction to TensorFlow for Artificial Intelligence, Machine Learning, and Deep Learning see credentials
  • Convolutional Neural Networks in TensorFlow
Courses
  • DataCamp
  • edX

Courses