Support for specialized transcript notation styles (e.g. GAT 2 and/or Jefferson) #2732

zackbatist · 2025-01-13T16:18:10Z

zackbatist
Jan 13, 2025

First off, just want to express my gratitude to the developers and the community for supporting this project. It's made my work a lot easier. That being said, I think one way it could be even better is by implementing support for detailed transcript notation styles such as GAT 2 or the Jefferson system. These systems are really important in research involving transcribed interviews, especially conversation analysis, and I imagine some aspects of this can be easily automated using LLMs.

Here's a basic summary of some notations from the GAT 2 system that I frequently use in my work. I also included additional examples and information about implementation at the end of this post.

Notation	Description
(.)	A full stop inside brackets denotes a micro pause, a notable pause but of no significant length.
(0.2)	A number inside brackets denotes a timed pause. This is a pause long enough to time and subsequently show in transcription.
CAPITALS	Where capital letters appear, it denotes that something was said loudly or even shouted.
[ ]	Square brackets denote a point where overlapping speech occurs.
{ }	Underlined text where overlaid laughter occurs.
(( ))	Non-verbal vocal actions and events encased within two rounded brackets.
(unclear)	Intelligible or unclear speech are denoted with a "unclear" placed within rounded brackets.
--	Double hyphens, usually at the end of a word or line, indicate an abrupt cutoff.

I know that not all of these things can be reliably covered (non-verbal vocal actions such as coughing or laughter), but I imagine some can be, including calculating the duration of pauses, detecting loud or shouted speech, or detecting speaker overlap.

I did a fairly comprehensive search for resources on implementing LLMs to support this kind of annotation, which turned up nothing so far. So aside from simply serving as a feature request, maybe others can chime in with strategies they used to modify outputs to add additional details like the ones I describe here.

Overlaps and simultaneous speech

Opening square brackets are inserted at exactly the point in speaking where the overlap starts, and closing square brackets, where it ends. In both Jefferson and GAT, the respective brackets are aligned with each other within the text. Note that the exact alignment is difficult to represent in markdown, so appears unaligned here.

Subject 1: Are you going too?

Subject 2: No, I have to [work.

Subject 1: How about a] drink to celebrate [the day?

Subject 2: That] would be great.

Laughter

With "ha-ha laughter" the approximate number and phonetic laughter syllables are transcribed, i.e. HA HA HA HA. With overlaid laughter, this is represented through annotation conventions, such as curly brackets (as in the following example).

Subject 1: What do you do?

Subject 2: HA HA HA HA HA AHH

Subject 1: I want to know, what do you do?

Subject 2: {Transcribe music.} Read books. {Swim at the river. Go out at night.}

Non-verbal vocal actions and events

Non-verbal vocal actions and events are denoted with two rounded brackets (( )). If the non-verbal action cannot be attributed to any one speaker the notion is entered as a new line in the transcript with its own timestamp.

Subject 1: Hello ((coughs)) I am ready.

((recording device beeps))

Subject 2: Great.

Intelligibility

Intelligible or unclear speech are denoted with a "unclear" placed within rounded brackets, (unclear). GAT 2 has suggestions for uncertainties/alternatives in speech, however adding in assumptions may lead to bias.

Subject 1: Are you sleeping?

Subject 2: (unclear) I was.

Subject 1: Oh never mind then.

zackbatist · 2025-01-13T16:49:45Z

zackbatist
Jan 13, 2025
Author

I just found out about GailBot which is a project-in-progress meant to facilitate generation of Jefferson style annotated transcripts. It's not open source but I submitted a license request.

0 replies

FoxyLevin · 2025-01-26T02:17:59Z

FoxyLevin
Jan 26, 2025

How did it work out, any news on GAT 2 or could you get a license for Gailbot? A friend (social scientific researcher) had an injury and thus needs an quite similar solution..

2 replies

zackbatist Jan 26, 2025
Author

I submitted a request on the Gailbot website but haven't heard back yet. I haven't found any other great alternatives either, unfortunately. Sorry I can't be of more help to you or your friend.

FoxyLevin Jan 27, 2025

Let us know if something comes up.. thx!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for specialized transcript notation styles (e.g. GAT 2 and/or Jefferson) #2732

{{title}}

Replies: 2 comments 2 replies

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

Support for specialized transcript notation styles (e.g. GAT 2 and/or Jefferson) #2732

zackbatist Jan 13, 2025

Overlaps and simultaneous speech

Laughter

Non-verbal vocal actions and events

Intelligibility

Replies: 2 comments · 2 replies

zackbatist Jan 13, 2025 Author

FoxyLevin Jan 26, 2025

zackbatist Jan 26, 2025 Author

FoxyLevin Jan 27, 2025

zackbatist
Jan 13, 2025

Replies: 2 comments 2 replies

zackbatist
Jan 13, 2025
Author

FoxyLevin
Jan 26, 2025

zackbatist Jan 26, 2025
Author