decoder-model

Star

Here are 17 public repositories matching this topic...

shivendrra / SmallLanguageModel

Star

a LLM cookbook, for building your own from scratch, all the way from gathering data to training a model

machine-learning transformer neural-networks gpt bert-model decoder-model llms llm-training llm-cookbook

Updated Jun 25, 2024
Jupyter Notebook

logic-OT / Decoder-Only-LLM

Star

This repository features a custom-built decoder-only language model (LLM) with a total of 37 million parameters 🔥. I train the model to be able to ask question from a given context

nlp computer-vision deep-learning inference transformer attention-mechanism decoder-model large-language-models llm small-models

Updated Aug 27, 2024
Jupyter Notebook

partarstu / transformers-in-java

Star

Experimental project for AI and NLP based on Transformer Architecture

java nlp ai transformers transformer dl4j encoder-decoder-model self-attention encoder-network decoder-model samediff

Updated Jan 1, 2024
Java

SharathHebbar / Transformers

Star

Transformers Intuition

transformers embeddings semantic-similarity sequence-to-sequence tokenization attention-is-all-you-need encoder-decoder-model encoder-model decoder-model masked-language-models causal-language-modeling

Updated Dec 11, 2023
Jupyter Notebook

LaurentVeyssier / Image-Captioning-Project-with-full-Encoder-Decoder-model

Star

Generate caption on images using CNN Encoder- LSTM Decoder structure

encoder pytorch lstm image-captioning bleu-score rnn-encoder-decoder caption-generation rnn-lstm decoder-model

Updated Aug 26, 2020
Jupyter Notebook

aiden200 / GPT3_Implementation

Star

Implementation of the GPT-3 paper: Language Models are Few-Shot Learners

machine-learning transformer gpt-3 decoder-model llm

Updated Jan 29, 2025
Python

edwinthomas444 / cheese_advertisement_generator

Star

An LLM based tool for generation of cheese advirtisements

text-generation data-to-text encoder-decoder-architecture advertisement-generation generative-modeling decoder-model large-language-models

Updated Jan 2, 2024
Jupyter Notebook

shivendrra / enigma

Star

a dna sequence generation/classification using transformers

transformer seq2seq sequence-to-sequence gpt dna-sequences dna-sequencing encoder-decoder-model bert-model decoder-model llm dna-bert

Updated Jan 20, 2025
Jupyter Notebook

DaniyalAhmedKhan1234 / Academic-Text-Simplification

Star

This project aims to simplify texts from research papers using advanced natural language processing (NLP) techniques, making them more accessible to a broader audience

embeddings text-processing attention-mechanism research-paper nlp-machine-learning bleu-score paraphrase-generation sari decoder-model

Updated Jul 29, 2024
Jupyter Notebook

JasonShao55 / NLP-Transformer-Implementation

Star

nlp encoder transformer classification nlp-machine-learning decoder-model

Updated Jul 11, 2024
Python

KempnerInstitute / minOLMo

Star

An explainable and simplified version of OLMo model

transformer decoder-model olmo

Updated Sep 26, 2024
Jupyter Notebook

dipankarsrirag / lordd

Star

Code and dataset used to train dialect adapters for decoder models.

dialects dialect-adaptation decoder-model

Updated Jan 28, 2025
Python

hardaatbaath / multimodal_vision_model

Star

A multimodal vision model that takes in an image and a prompt query, and output the answer

decoder-model vision-transformer

Updated Jan 12, 2025
Python

ahmedelsayed968 / Arabic-Text-Summarizer

Star

Build Text summarizer for arabic language

text-analysis text-summarization fine-tuning encoder-decoder-model pytroch decoder-model llms

Updated Oct 16, 2024
Jupyter Notebook

SLotAbr / Decoder_model

Star

Decoder model for language modelling

transformer language-modelling decoder-model

Updated Dec 8, 2021
Python

deniz-askin / Decoder-Based-Semantic-Parser

Star

A Decoder Based Semantic Parser that can be tested on four benchmark datasets (ATIS, GeoQuery, Jobs640 and Django)

natural-language-processing decoder transformers semantic-parsing decoder-model

Updated Feb 18, 2022
Python

Muhammad-Ibrahim-Khan / minigpt

Star

A miniGPT inspired from the original NanoGPT released by OpenAI. This is a notebook to walk through the decoder part of the transformer architecture with details outlined.

python jupyter-notebook pytorch transformer gpt decoder-model

Updated Jan 17, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the decoder-model topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the decoder-model topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

decoder-model

Here are 17 public repositories matching this topic...

shivendrra / SmallLanguageModel

logic-OT / Decoder-Only-LLM

partarstu / transformers-in-java

SharathHebbar / Transformers

LaurentVeyssier / Image-Captioning-Project-with-full-Encoder-Decoder-model

aiden200 / GPT3_Implementation

edwinthomas444 / cheese_advertisement_generator

shivendrra / enigma

DaniyalAhmedKhan1234 / Academic-Text-Simplification

JasonShao55 / NLP-Transformer-Implementation

KempnerInstitute / minOLMo

dipankarsrirag / lordd

hardaatbaath / multimodal_vision_model

ahmedelsayed968 / Arabic-Text-Summarizer

SLotAbr / Decoder_model

deniz-askin / Decoder-Based-Semantic-Parser

Muhammad-Ibrahim-Khan / minigpt

Improve this page

Add this topic to your repo