That’s what the data said:
An NLP Analysis of Script Lines from the US TV-Show "The Office"

Project Goal

Our objective is to apply various traditional and methorn methods of NLP in order to gain interesting insights into the show and its characters by only looking at "what the data says". More specific, we analyze characters, relationships, sentiments and topics to identify speaking styles and developments. We want to provide additional insights both for fans and for people who did not watch the show.

Find our used data here.

This repository also contains scripts to train models to generate scenes (such as the scene above) and to classify the speaker of a line.

Use our models

We uploaded the fine-tuned models to HuggingFace to make them easy accessible for everyone. There you can find the Speaker Classification and Scene Generation models and directly test them via Inference API.

Name		Name	Last commit message	Last commit date
Latest commit History 113 Commits
data		data
notebooks		notebooks
plots		plots
slides		slides
NLP_The_Office_Paper.pdf		NLP_The_Office_Paper.pdf
README.md		README.md
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

That’s what the data said:
An NLP Analysis of Script Lines from the US TV-Show "The Office"

Project Goal

Use our models

Read more in our blog articles

About

Releases

Packages

Contributors 3

Languages

timo282/NLP-The-Office

Folders and files

Latest commit

History

Repository files navigation

That’s what the data said: An NLP Analysis of Script Lines from the US TV-Show "The Office"

Project Goal

Use our models

Read more in our blog articles

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

That’s what the data said:
An NLP Analysis of Script Lines from the US TV-Show "The Office"

Packages