IGD and Cyberbullying Detection: A Deep Learning Approach

This repository contains the source code and datasets used for the research paper titled "A Novel Machine Learning & Deep Learning Approach Based Dataset Efficacy Study in Predicting Mental Health Outcomes from Internet Gaming Disorder and Cyberbullying". [https://doi.org/10.6084/m9.figshare.27266961]

Overview

This repository provides the code for predicting mental health outcomes associated with Internet Gaming Disorder (IGD) and Cyberbullying using machine learning and deep learning models. Models like Logistic Regression, Random Forest, Ensemble Models, CNNs, and LSTMs are implemented to detect patterns from behavioral data.

Requirements

To run this code, you'll need the following dependencies:

Python 3.x
TensorFlow
scikit-learn
pandas
numpy
matplotlib
imbalanced-learn

You can install the required dependencies using the provided requirements.txt file.

pip install -r requirements.txt

Datasets

The repository contains preprocessed datasets for both Cyberbullying detection and IGD. The following datasets are included:

Cyberbullying Datasets:

aggression_parsed_dataset.csv
attack_parsed_dataset.csv
kaggle_parsed_dataset.csv
toxicity_parsed_dataset.csv
twitter_parsed_dataset.csv
twitter_racism_parsed_dataset.csv
twitter_sexism_parsed_dataset.csv
youtube_parsed_dataset.csv

IGD Dataset:

GamingStudy_data.csv (This is the dataset used for predicting Internet Gaming Disorder based on user behavior data.)

Installation

Clone the repository:

git clone https://github.com/BryanSJamesDev/IGD-cyberbullying-detection-AI
cd IGD-cyberbullying-detection-AI

Install the required dependencies:
```
pip install -r requirements.txt
```

Running the Code

Cyberbullying Prediction:
- Open the Cyberbullying.ipynb notebook and run the cells in order to train and evaluate the deep learning models on the provided datasets.
Internet Gaming Disorder Prediction:
- Open the Gamestudy.ipynb notebook and run the cells to analyze IGD data using models like LSTM and CNN to detect patterns in gaming behavior.

Datasets Structure

Place the datasets in the data/ directory before running the code. The default path for loading datasets is set to this folder.

Example:

jupyter notebook Cyberbullying.ipynb

Observed Results

Cyberbullying Detection:
- CNN and Random Forest models achieve accuracy of around 91% to 93%.
- The ensemble model yields the best performance with an accuracy of 93%.
IGD Detection:
- The LSTM model achieves 91.6% accuracy in detecting IGD from gaming behavioral data.

The notebooks will output the model performance metrics, including confusion matrices, precision, recall, F1-scores, and accuracy.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
Cyberbullying Dataset		Cyberbullying Dataset
Cyberbullying.ipynb		Cyberbullying.ipynb
Gamestudy Dataset		Gamestudy Dataset
Gamestudy.ipynb		Gamestudy.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

IGD and Cyberbullying Detection: A Deep Learning Approach

Table of Contents

Overview

Requirements

Datasets

Cyberbullying Datasets:

IGD Dataset:

Installation

Running the Code

Datasets Structure

Example:

Observed Results

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

IGD and Cyberbullying Detection: A Deep Learning Approach

Table of Contents

Overview

Requirements

Datasets

Cyberbullying Datasets:

IGD Dataset:

Installation

Running the Code

Datasets Structure

Example:

Observed Results

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages