Creating a model using SPSS Modeler on DSX

Machine learning flow is a graphical representation of data, by using the Flow Editor to prepare or shape data, train or deploy a model, or transform data and export it back to a database table or file in object storage.

Technology: SPSS Modeler, feature selection, auto classifier, data audit, field operations, data visualization

Prerequisites

Sign up to IBM Data Science Experience (DSX): https://datascience.ibm.com/
Sign in to DSX
On DSX, create a new project
- Click Projects, and select View All Projects
- Click the New button to create a new project
- On the New Project page, input Bank Churn as the project name
- In the Target container field, input churn as the container name
- Click the Create button
Download dataset bank-churn.csv from Github
- Use a new browser tab to access dataset: https://github.com/mlhubca/lab/blob/master/bank-churn/bank-churn.csv
- Right-click the Raw button on the toolbar, and select Save Link As... or Save Content As... (depending on your browser)
Upload dataset bank-churn.csv to your project
- On DSX, open your project
- Click the Add to project dropdown and select Data asset from the dropdown menu
- On your right-hand panel, select the Load tab
- Drop file bank-churn.csv to the box or browse file bank-churn.csv and add the file to the project

Steps

Creating a new flow

Add a new flow using New flow button or from the "Add to project" dropdown, select "SPSS Modeler flow"
On the Create Flow page,
- Specify a name, e.g. Bank Churn Flow
- Select IBM SPSS Modeler Runtime
- Click "Create Flow"

Loading data

Drag and drop node bank-churn.csv from the Files list to the flow
Click Palette icon (first icon on the toolbar) to show node palette

Checking data quality

Add Data Audit node from the Outputs list on the palette
Connect file bank-churn.csv node to Data Audit node
Run Data Audit node to generate output

Filtering data

Add Filter node from the Field Operations list on the palette
Connect file bank-churn.csv node to Filter node
Open Filter node, select columns CUST_ID, TwitterID and CHURN_LABLE (to be filtered)

Setting metadata

Add node Type node from the Field Operations list on the palette
Connect file Filter node to Type node
Open Type node, add all columns to the Types list
Locate CHURN field, and
- Change Measure from Range to Flag
- Change Role from Input to Target

Selecting features

Add Feature Selection node from the Modeling list on the palette
Connect Feature Selection node to node "Type" (note that the Feature Selection node name is being changed to CHURN)
Run node CHURN (Feature Selection). When the execution completes, a new model node CHURN is created
Add Data Audit node from the Outputs list on the palette
Connect the new model node CHURN to Data Audit node
Run Data Audit node to generate output

Splitting data

Add Partition node from the Field Operations list on the palette
Connect the new model node CHURN to Partition node
Open Partition node, change the Training and Test partition to the ratio of 80/20.

Building model - Auto classifier

Add Auto Classifier node from the Modeling list on the palette
Connect Auto Classifier to node Partition(note that the Auto Classifier node name is being changed to CHURN automatically)
Run node CHURN (Auto Classifier). When the execution completes, a new model node CHURN is created automatically.

Aanlyzing model performance

Add Analysis node from the Outputs list on the palette
Connect the new model nodel CHURN to Analysis node
Run Analysis node to generate output

Executing the flow

Run the whole flow by clicking the Run button on the toolbar

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

spss-bank-churn.md

spss-bank-churn.md

Creating a model using SPSS Modeler on DSX

Prerequisites

Creating a new flow

Loading data

Checking data quality

Filtering data

Setting metadata

Selecting features

Splitting data

Building model - Auto classifier

Aanlyzing model performance

Executing the flow

Files

spss-bank-churn.md

Latest commit

History

spss-bank-churn.md

File metadata and controls

Creating a model using SPSS Modeler on DSX

Prerequisites

Creating a new flow

Loading data

Checking data quality

Filtering data

Setting metadata

Selecting features

Splitting data

Building model - Auto classifier

Aanlyzing model performance

Executing the flow