GitHub - ishitapaliwal03/ECommerce_Transactions_Dataset_Analysis

eCommerce Transactions Data Science Assignment

This repository contains the solution to the Data Science Intern Assignment involving an eCommerce transactions dataset. The project includes exploratory data analysis (EDA), a lookalike model, and customer segmentation tasks.

** Project Structure**

The project is divided into the following tasks:

1. Exploratory Data Analysis (EDA)

Objective: Analyze the provided dataset to extract meaningful business insights.

2. Lookalike Model

Objective: Build a model to recommend 3 similar customers for the first 20 customers (C0001–C0020) based on their profile and transaction history.

3. Customer Segmentation / Clustering

Objective: Segment customers using clustering techniques to group them based on profiles and transaction information.

Dataset Description

The dataset consists of three files:

Customers.csv
Contains customer profiles, including:
- CustomerID: Unique identifier for each customer.
- CustomerName: Name of the customer.
- Region: Customer's continent.
- SignupDate: Date when the customer signed up.
Products.csv
Contains product details, including:
- ProductID: Unique product identifier.
- ProductName: Name of the product.
- Category: Product category.
- Price: Price in USD.
Transactions.csv
Contains transaction details, including:
- TransactionID: Unique transaction identifier.
- CustomerID: ID of the customer making the transaction.
- ProductID: ID of the purchased product.
- TransactionDate: Date of the transaction.
- Quantity: Quantity purchased.
- TotalValue: Total value of the transaction.

Getting Started

Prerequisites

Python 3.x
Required libraries:
- pandas
- numpy
- scikit-learn
- matplotlib
- seaborn

Setup

Clone this repository:
```
git clone <repository_url>
```
Install the required libraries:
```
pip install -r requirements.txt
```

Running the Notebooks

Open the notebooks (.ipynb) in Jupyter Notebook or a compatible IDE.
Follow the instructions and execute cells to reproduce the results.

Results

EDA

The EDA results are documented in Ishita_Paliwal_EDA.pdf, highlighting key business insights derived from the dataset.

Lookalike Model

The recommendations for the top 3 similar customers are saved in Ishita_Paliwal__Lookalike.csv.

Clustering

The clustering results, along with the DB Index value and visualizations, are detailed in Ishita_Paliwal__Clustering.pdf.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
Ishita_Paliwal_Clustering.ipynb		Ishita_Paliwal_Clustering.ipynb
Ishita_Paliwal_Clustering.pdf		Ishita_Paliwal_Clustering.pdf
Ishita_Paliwal_EDA.ipynb		Ishita_Paliwal_EDA.ipynb
Ishita_Paliwal_EDA.pdf		Ishita_Paliwal_EDA.pdf
Ishita_Paliwal_Lookalike.csv		Ishita_Paliwal_Lookalike.csv
README.md		README.md
ishita_paliwal_Lookalike.ipynb		ishita_paliwal_Lookalike.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

1. Exploratory Data Analysis (EDA)

2. Lookalike Model

3. Customer Segmentation / Clustering

Dataset Description

Getting Started

Prerequisites

Setup

Running the Notebooks

Results

EDA

Lookalike Model

Clustering

About

Releases

Packages

Languages

ishitapaliwal03/ECommerce_Transactions_Dataset_Analysis

Folders and files

Latest commit

History

Repository files navigation

1. Exploratory Data Analysis (EDA)

2. Lookalike Model

3. Customer Segmentation / Clustering

Dataset Description

Getting Started

Prerequisites

Setup

Running the Notebooks

Results

EDA

Lookalike Model

Clustering

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages