Date of Submission

Fall 12-16-2020

Degree Type

Thesis

Degree Name

Master of Science in Computer Science (MSCS)

Department

Computer Science

Committee Chair/First Advisor

Dr. Dan lo

Track

Big Data

Chair

Dr. Dan Lo

Committee Member

Dr. Yong Shi

Committee Member

Dr. Hossain Shahriar

Related Publications

Gardner, C, Lo, D(2019). Tiered Financial Fraud Detection Utilizing Precision Stratified Random Forest Assembly. In: 2019 IEEE 5th International Conference on Big Data Intelligence and Computing

Abstract

Imbalanced datasets have been a unique challenge for machine learning, requiring specialized approaches to correctly classify the minority class. Financial fraud detection involves using highly imbalanced datasets with a class imbalance of up to .01% frauds to 99.99% regular transactions. It is essential to identify all frauds in financial fraud detection, even if some classifications' precision is low. I developed a random forest assembly that separates fraudulent transactions into tiers of precision. With this approach, 96% of fraudulent transactions are identified, showing an 8% increase in recall when compared to standard approaches. 59% of fraud classifications' precision increases by 10% up to 98% by optimizing several random forests on different fitness functions. These models are then combined to act as a sieve with increasing tolerance for low precision classifications. The effectiveness of random forest for financial fraud detection is also improved through feature extraction techniques. Random forest is weak at detecting patterns between interdepended features. This problem is address through unsupervised feature extraction. I will demonstrate a new random forest architecture PCA-embedded random forest, which increased random forest performance.

Download

Included in

Data Science Commons

COinS

Master of Science in Computer Science Theses

Classifying Imbalanced Financial Fraud Data Utilizing Enhanced Random Forest Algorithm

Date of Submission

Degree Type

Degree Name

Department

Committee Chair/First Advisor

Track

Chair

Committee Member

Committee Member

Related Publications

Abstract

Included in

Search

Authors

Browse

Links

Useful Links

Master of Science in Computer Science Theses

Classifying Imbalanced Financial Fraud Data Utilizing Enhanced Random Forest Algorithm

Author

Date of Submission

Degree Type

Degree Name

Department

Committee Chair/First Advisor

Track

Chair

Committee Member

Committee Member

Related Publications

Abstract

Included in

Share

Search

Authors

Browse

Links

Useful Links