Confidence-Based Cheat Detection Through Constrained Order Inference of Temporal Sequences
Department
Computer Science
Document Type
Article
Publication Date
6-1-2023
Abstract
For particular domains, duplication may be indicative of cheating or an adversarial act intended to skew data. For Sony's PlayStation Network (PSN) that services the world's most popular gaming platform, we observe cheating through duplication of user data in the context of trophies/achievements. This particular domain is representative of the challenges of increasingly prevalent temporal data, where conventional similarity and distance-based deduplication techniques struggle in the context of deduplication. We leverage the Adaptive Sorted Neighborhood Method (ASNM) for temporal domains by applying ASNM, inferring attribute metadata, and performing inference of temporal ordering requirements using subsequence discovery techniques Longest Common Subsequence (LCS) and Needleman-Wunsch (NW). For records of a shared type, we split each record's time-ordered events into constrained and unconstrained sequences. Through both a binary classification and confidence-based approach, we indicate suspicious (errant) records that do not adhere to the inferred constrained order and may indicate a record as a duplicate if its unconstrained order matches that of another record. ASNM, ASNM+LCS and ASNM+NW were evaluated against a labeled dataset of 22,794 records from PSN trophy data where duplication may be indicative of cheating. ASNM+LCS resulted in an F1 of 0.949 using the confidence-based approach, outperforming ASNM and ASNM+NW. ASNM's best performance was an F1 of 0.708 at the 0.99 similarity threshold; ASNM+NW's best performance was an F1 of 0.942 using the confidence-based approach. The significant performance improvement costs little overhead as ASNM+LCS and ASNM+NW averaged only 3.79% and 5.75% additional runtime, respectively.
Journal Title
International Journal of Semantic Computing
Journal ISSN
1793351X
Volume
17
Issue
2
First Page
223
Last Page
247
Digital Object Identifier (DOI)
10.1142/S1793351X23600036