This repository is part of the machine learning division repository, which contains datasets, models, and the results of the trained models.
The KidLink Dataset is composed of publicly accessible datasets available for personal use, except for academic paper publications. These datasets include:
- Indonesian Hate Speech by Ika Alfina
- Sentimen Analisis Bahasa Indonesia by Rizal Setya Perdana
- Hate Speech and Abusive Language by Okky Ibrohim
- Kalimat Kasar Bahasa Indonesia on Kaggle
- Indonesia Twitter Comments on Kaggle
Before using this model, make sure you have the following software installed:
- Python version 3.10.12
- The Python packages listed in the
requirements.txt
file
- Clone this repository to your local device:
git clone https://github.com/kidlink-bangkit/capstone-nlp-project.git
- Go to the repository directory:
cd casptone-nlp-project
- (Optional) Create and activate the virtual environment:
python3 -m venv env
source env/bin/activate
- Install the required packages:
pip install -r requirements.txt