Presenters

Khoa NguyenFollow

Disciplines

Artificial Intelligence and Robotics | Programming Languages and Compilers

Abstract (300 words maximum)

Recently, the amount of encrypted malicious network traffic masquerading as normal traffic of data has increased greatly. This poses a concern for the user’s security and privacy. Moreover, malicious traffic rates have been reported to skyrocket during the COVID-19 pandemic. Therefore, we should adopt new methods to tackle such unpleasant traffic detection problems as soon as possible.

Regular security solutions depending on common analysis like deep packet inspection have been proven to be less effective while detecting malware using Artificial Intelligence (AI)–based solutions are becoming more popular. These solutions are believed to be less expensive, faster, and more secure since no traffic interceptor is required. Thus, the target of this research is to detect malware traffic flows by extracting new features from multiple popular public sources with well-known machine-learning and deep-learning algorithms such as KNN, Random Forest, and CNNs. These are among the best artificial intelligence algorithms that are expected to produce high (as high as 95%) malware detection rates. The system first extracts relevant features including packet count, size, and protocol type. They are inserted into machine-learning and deep-learning models for detection. The models are trained on a large dataset mixed with benign and malicious traffic to accurately detect the encrypted malicious traffic flows.

The conclusion discusses the malicious network traffic detection rates of different feature sets tested by multiple machine learning and deep learning algorithms and the challenges that might occur in the process, including the need for high-quality training data and the possibility of encountering false positives and false negatives. Further research in this area will emphasize improving the model’s detection rates and addressing these challenges.

Academic department under which the project should be listed

CCSE - Computer Science

Loading...

Media is loading
 

Primary Investigator (PI) Name

Liang Zhao

Share

COinS
 

Encrypted Malicious Network Traffic Detection using Machine Learning and Deep Learning

Recently, the amount of encrypted malicious network traffic masquerading as normal traffic of data has increased greatly. This poses a concern for the user’s security and privacy. Moreover, malicious traffic rates have been reported to skyrocket during the COVID-19 pandemic. Therefore, we should adopt new methods to tackle such unpleasant traffic detection problems as soon as possible.

Regular security solutions depending on common analysis like deep packet inspection have been proven to be less effective while detecting malware using Artificial Intelligence (AI)–based solutions are becoming more popular. These solutions are believed to be less expensive, faster, and more secure since no traffic interceptor is required. Thus, the target of this research is to detect malware traffic flows by extracting new features from multiple popular public sources with well-known machine-learning and deep-learning algorithms such as KNN, Random Forest, and CNNs. These are among the best artificial intelligence algorithms that are expected to produce high (as high as 95%) malware detection rates. The system first extracts relevant features including packet count, size, and protocol type. They are inserted into machine-learning and deep-learning models for detection. The models are trained on a large dataset mixed with benign and malicious traffic to accurately detect the encrypted malicious traffic flows.

The conclusion discusses the malicious network traffic detection rates of different feature sets tested by multiple machine learning and deep learning algorithms and the challenges that might occur in the process, including the need for high-quality training data and the possibility of encountering false positives and false negatives. Further research in this area will emphasize improving the model’s detection rates and addressing these challenges.