ENGtoID
English to Indonesian seq2seq translation model implementation for the final 2024 ICT303 assignment.
TLDR Summary
This project contains code which fetches a dataset, filters out bad data, tokenizes the data, saves language token vocabularies, then trains an LSTM seq2seq model on the data with optional teacher forcing.
You may view the final notebook here
-
Notifications
You must be signed in to change notification settings - Fork 0
daverlon/ENGtoID
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
ENGtoID Seq2Seq LSTM Translation Model for ICT303
Topics
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published