TY - GEN
T1 - Grooming detection using fuzzy-rough feature selection and text classification
AU - Zuo, Zheming
AU - Li, Jie
AU - Anderson, Philip
AU - Yang, Longzhi
AU - Naik, Nitin
PY - 2018/10/12
Y1 - 2018/10/12
N2 - Online child grooming detection has recently attracted intensive research interests from both the machine learning community and digital forensics community due to its great social impact. The existing data-driven approaches usually face the challenges of lack of training data and the uncertainty of classes in terms of the classification or decision boundary. This paper proposes a grooming detection approach in an effort to address such uncertainty based on a data set derived from a publicly available profiling data set. In particular, the approach firstly applies the conventional text feature extraction approach in identifying the most significant words in the data set. This is followed by the application of a fuzzy-rough feature selection approach in reducing the high dimensions of the selected words for fast processing, which at the same time addressing the uncertainty of class boundaries. The experimental results demonstrate the efficiency and efficacy of the proposed approach in detecting child grooming.
AB - Online child grooming detection has recently attracted intensive research interests from both the machine learning community and digital forensics community due to its great social impact. The existing data-driven approaches usually face the challenges of lack of training data and the uncertainty of classes in terms of the classification or decision boundary. This paper proposes a grooming detection approach in an effort to address such uncertainty based on a data set derived from a publicly available profiling data set. In particular, the approach firstly applies the conventional text feature extraction approach in identifying the most significant words in the data set. This is followed by the application of a fuzzy-rough feature selection approach in reducing the high dimensions of the selected words for fast processing, which at the same time addressing the uncertainty of class boundaries. The experimental results demonstrate the efficiency and efficacy of the proposed approach in detecting child grooming.
UR - http://www.scopus.com/inward/record.url?scp=85053663117&partnerID=8YFLogxK
UR - https://ieeexplore.ieee.org/document/8491591
U2 - 10.1109/FUZZ-IEEE.2018.8491591
DO - 10.1109/FUZZ-IEEE.2018.8491591
M3 - Conference publication
AN - SCOPUS:85053663117
T3 - IEEE International Conference on Fuzzy Systems
BT - 2018 IEEE International Conference on Fuzzy Systems, FUZZ 2018 - Proceedings
PB - IEEE
T2 - 2018 IEEE International Conference on Fuzzy Systems, FUZZ 2018
Y2 - 8 July 2018 through 13 July 2018
ER -