-
Notifications
You must be signed in to change notification settings - Fork 84
Results: Model Performances in GPML
- Number of vertices (accounts): 27,770
- Alert patterns: 100 triangles as fraud patterns and 1000 fan-in/out patterns with 5 vertices as false alerts
-
Sparse (56,021 transactions)
-
Medium (96,463 transactions)
-
Dense (160,970 transactions)
-
239,385 transactions
-
276,359 transactions
In dense transaction networks, the combinations which yield the largest precision is undirected cycle feature, but the recall drops to around 0.2. The best number of edges is around 160K.
- Number of vertices (accounts): 27,770
- Number of edges (transactions): 160K
- Fraud pattern: Triangle (3-length cycle)
- False alert pattern: Fan-in/out with 5 accounts
-
50 fraud patterns (500 false alerts)
-
100 fraud patterns (1,000 false alerts)
-
200 fraud patterns (2,000 false alerts)
The recall values are more stable with more fraud patterns (more than 0.8 with 200 fraud patterns). The graph feature combinations of undirected cycles and egonet gain the precision and F1-score.
- Number of vertices (accounts): 27,770
- Number of edges (transactions): 160K
-
Triangle (3-length cycles)
-
Square (4-length cycles)
-
Pentagon (5-length cycles)
-
Triangle
-
Square
-
Pentagon
The recall values are stable with the triangle fraud patterns. With the square and pentagon fraud patterns, the recall values drop to 0.4-0.5 for some features.
- Number of vertices (accounts): 27,770
- Number of edges (transactions): 160K
- False alert patterns: 1000 fan-in/out patterns with 3 vertices
-
100 Pentagons (5-length cycles)
-
100 Fan-in/out (5 vertices)
-
200 Pentagons (5-length cycles)
-
200 Fan-in/out (5 vertices)
In fan-in/out fraud patterns, there are trade-offs between precision and recall values, and recall values dropped to 0.2-0.4 in some feature combinations.