This repository contains the source code and datasets used for the research paper titled "A Novel Machine Learning & Deep Learning Approach Based Dataset Efficacy Study in Predicting Mental Health Outcomes from Internet Gaming Disorder and Cyberbullying". [https://doi.org/10.6084/m9.figshare.27266961]
This repository provides the code for predicting mental health outcomes associated with Internet Gaming Disorder (IGD) and Cyberbullying using machine learning and deep learning models. Models like Logistic Regression, Random Forest, Ensemble Models, CNNs, and LSTMs are implemented to detect patterns from behavioral data.
To run this code, you'll need the following dependencies:
- Python 3.x
- TensorFlow
- scikit-learn
- pandas
- numpy
- matplotlib
- imbalanced-learn
You can install the required dependencies using the provided requirements.txt file.
pip install -r requirements.txtThe repository contains preprocessed datasets for both Cyberbullying detection and IGD. The following datasets are included:
aggression_parsed_dataset.csvattack_parsed_dataset.csvkaggle_parsed_dataset.csvtoxicity_parsed_dataset.csvtwitter_parsed_dataset.csvtwitter_racism_parsed_dataset.csvtwitter_sexism_parsed_dataset.csvyoutube_parsed_dataset.csv
GamingStudy_data.csv(This is the dataset used for predicting Internet Gaming Disorder based on user behavior data.)
-
Clone the repository:
git clone https://github.com/BryanSJamesDev/IGD-cyberbullying-detection-AI cd IGD-cyberbullying-detection-AI -
Install the required dependencies:
pip install -r requirements.txt
-
Cyberbullying Prediction:
- Open the
Cyberbullying.ipynbnotebook and run the cells in order to train and evaluate the deep learning models on the provided datasets.
- Open the
-
Internet Gaming Disorder Prediction:
- Open the
Gamestudy.ipynbnotebook and run the cells to analyze IGD data using models like LSTM and CNN to detect patterns in gaming behavior.
- Open the
- Place the datasets in the
data/directory before running the code. The default path for loading datasets is set to this folder.
jupyter notebook Cyberbullying.ipynb-
Cyberbullying Detection:
- CNN and Random Forest models achieve accuracy of around 91% to 93%.
- The ensemble model yields the best performance with an accuracy of 93%.
-
IGD Detection:
- The LSTM model achieves 91.6% accuracy in detecting IGD from gaming behavioral data.
The notebooks will output the model performance metrics, including confusion matrices, precision, recall, F1-scores, and accuracy.