Sarthak Sharma MT23083

Sarthak Sharma

Profile

Welcome to my GitHub profile! I am Sarthak Sharma, a dedicated software developer with a keen interest in Android development, machine learning, and contributing to open-source projects. Currently, I am pursuing my Master's in Computer Science and Engineering at IIIT Delhi. I am passionate about leveraging technology to innovate and solve real-world problems. My journey in the tech world has been shaped by hands-on experience in developing diverse projects that span from multimodal data analysis to Android application development and machine learning model optimization.

Contact

Feel free to reach out to me for collaboration, questions, or just to say hi!

Key Details

Name: Sarthak Sharma
Location: [New Delhi, India]
Email: [email protected]
Education: MTech. CSE, Indraprastha Institute of Information Technology, Delhi (2023 - 2025)

Skills

Publications

1. "AVR: Synergizing Foundation Models for Audio-Visual Humor Detection"

Conference Name: Interspeech 2024
Authors: Sarthak Sharma*, Orchid Chetia Phukan*, Drishti Singh, Arun Balaji Buduru, Rajesh Sharma
Repository: Arxiv Link

Projects

1. Multimodal Humor Detection (Accepted in Interspeech 2024 Demonstration Track)

Description:
- Ensembles the power of Audio Visual Data to generate humorous notions for the machine to understand.
- Utilizes Pre-Trained AST (Audio Spectogram Transformer) and VideoMAE models for embedding generation.
- Develops a CNN-based fusion model using TensorFlow and PyTorch.
- LanguageBind model used as an alternate approach but less accurate than VideoMAE and AST models.
- Experimented with several CNN and LSTM models, deployed the most accurate one over a Tkinter application.
Technologies: TensorFlow, PyTorch, Tkinter
Repository: GitHub Link

2. Estimated Time of Arrival Prediction using Doordash Dataset

Description:
- Leverages the power of data-driven insights to predict accurate delivery ETA (Estimated Time of Arrival) using DoorDash dataset present on Kaggle.
- Utilized the attributes already present in the dataset and applied feature engineering to get novel attributes to train the model instead of feeding the initial data.
- Experimented with various Machine Learning techniques such as XGBoost, Random Forest Regressor, LightGBM, and among them XGBoost tends to be better among these regressors.
- Metrics used were Mean Absolute Error(MAE), Mean Squared Error(MSE)
Technologies: PyTorch, sklearn
Repository: GitHub Link

3. TravelVista: Personalized Travel Planner

Description:
- Developed a personalized travel web application for tailored travel plans.
- Augmented City and Hotel Datasets for better model comprehension.
- Explored RAG, Langchain, and other LLMs for relevant travel suggestions.
Technologies: Python, Information Retrieval, LLM, NodeJS
Repository: GitHub Link

4. Linux Shell in C/C++

Description:
- Implemented Linux commands such as cd, mv, rm, cp, and ls in C++.
- Added multithreading for recursive options, measured performance improvements.
Technologies: C/C++
Repository: GitHub Link

5. Resume Parser

Description:
- Developed a Resume Parser to extract key entities from resumes.
- Utilized BERT for embedding extraction and custom models for entity identification.
- Deployed using Flask for functional information extraction.
Technologies: Python, Information Extraction, BERT, Flask
Repository: GitHub Link

6. LLM Fine Tuning for Text Summarization

Description:
- Fine-tuned GPT2 and T5 pre-trained models for text summarization.
- Used Amazon Fine Food Reviews Dataset for context-based fine-tuning.
- Achieved significant results with T5 Model for shorter reviews.
Technologies: TensorFlow, PyTorch, Python, T5 Model, GPT2 Model
Repository: GitHub Link

Thank you for visiting my profile! Let's build something amazing together.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly