Skip to content
View MT23083's full-sized avatar
  • Indraprastha Institiute Of Information Technology Delhi

Block or report MT23083

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
MT23083/README.md

Sarthak Sharma

Profile Banner

Profile

Welcome to my GitHub profile! I am Sarthak Sharma, a dedicated software developer with a keen interest in Android development, machine learning, and contributing to open-source projects. Currently, I am pursuing my Master's in Computer Science and Engineering at IIIT Delhi. I am passionate about leveraging technology to innovate and solve real-world problems. My journey in the tech world has been shaped by hands-on experience in developing diverse projects that span from multimodal data analysis to Android application development and machine learning model optimization.

Contact

Feel free to reach out to me for collaboration, questions, or just to say hi!

Email LinkedIn GitHub

Key Details

  • Name: Sarthak Sharma
  • Location: [New Delhi, India]
  • Email: [email protected]
  • Education: MTech. CSE, Indraprastha Institute of Information Technology, Delhi (2023 - 2025)

Skills

Python C++ Java Kotlin JavaScript SQL AWS Docker TensorFlow PyTorch

Publications

1. "AVR: Synergizing Foundation Models for Audio-Visual Humor Detection"

  • Conference Name: Interspeech 2024
  • Authors: Sarthak Sharma*, Orchid Chetia Phukan*, Drishti Singh, Arun Balaji Buduru, Rajesh Sharma
  • Repository: Arxiv Link

Projects

1. Multimodal Humor Detection (Accepted in Interspeech 2024 Demonstration Track)

  • Description:
    • Ensembles the power of Audio Visual Data to generate humorous notions for the machine to understand.
    • Utilizes Pre-Trained AST (Audio Spectogram Transformer) and VideoMAE models for embedding generation.
    • Develops a CNN-based fusion model using TensorFlow and PyTorch.
    • LanguageBind model used as an alternate approach but less accurate than VideoMAE and AST models.
    • Experimented with several CNN and LSTM models, deployed the most accurate one over a Tkinter application.
  • Technologies: TensorFlow, PyTorch, Tkinter
  • Repository: GitHub Link

2. Estimated Time of Arrival Prediction using Doordash Dataset

  • Description:
    • Leverages the power of data-driven insights to predict accurate delivery ETA (Estimated Time of Arrival) using DoorDash dataset present on Kaggle.
    • Utilized the attributes already present in the dataset and applied feature engineering to get novel attributes to train the model instead of feeding the initial data.
    • Experimented with various Machine Learning techniques such as XGBoost, Random Forest Regressor, LightGBM, and among them XGBoost tends to be better among these regressors.
    • Metrics used were Mean Absolute Error(MAE), Mean Squared Error(MSE)
  • Technologies: PyTorch, sklearn
  • Repository: GitHub Link

3. TravelVista: Personalized Travel Planner

  • Description:
    • Developed a personalized travel web application for tailored travel plans.
    • Augmented City and Hotel Datasets for better model comprehension.
    • Explored RAG, Langchain, and other LLMs for relevant travel suggestions.
  • Technologies: Python, Information Retrieval, LLM, NodeJS
  • Repository: GitHub Link

4. Linux Shell in C/C++

  • Description:
    • Implemented Linux commands such as cd, mv, rm, cp, and ls in C++.
    • Added multithreading for recursive options, measured performance improvements.
  • Technologies: C/C++
  • Repository: GitHub Link

5. Resume Parser

  • Description:
    • Developed a Resume Parser to extract key entities from resumes.
    • Utilized BERT for embedding extraction and custom models for entity identification.
    • Deployed using Flask for functional information extraction.
  • Technologies: Python, Information Extraction, BERT, Flask
  • Repository: GitHub Link

6. LLM Fine Tuning for Text Summarization

  • Description:
    • Fine-tuned GPT2 and T5 pre-trained models for text summarization.
    • Used Amazon Fine Food Reviews Dataset for context-based fine-tuning.
    • Achieved significant results with T5 Model for shorter reviews.
  • Technologies: TensorFlow, PyTorch, Python, T5 Model, GPT2 Model
  • Repository: GitHub Link

Thank you for visiting my profile! Let's build something amazing together.

Popular repositories Loading

  1. Library_System Library_System Public

    C++

  2. Text_Summarizer Text_Summarizer Public

    Jupyter Notebook

  3. Windows-Form-Application Windows-Form-Application Public

    Visual Basic .NET

  4. MT23083 MT23083 Public

  5. TravelVista TravelVista Public

    Jupyter Notebook

  6. Multimodal_AVR Multimodal_AVR Public

    Jupyter Notebook