| Table of Contents | ||
|---|---|---|
|
Overview of the Reliability Transcription Process for Connected Speech Samples
To ensure the reliability of transcriptions for connected speech samples, the following structured process will be implemented:
Randomized Sample Selection:
A subset equivalent to 10% of the total transcription samples will be randomly selected for reliability testing. This selection will be project-specific (for example DrSantos_Spa_WABs_FTLD_2024, Kesha_nfvPPA_lvPPA_2024).
...
Connected Speech reliability supervisor: usually project lead of the research project, responsible to assign, track sample completion, compare the samples and run reliability analysis.
Connected Speech transcriber 1: first transcriber
Connected Speech transcriber 2: second transcriber, reliability supervisor assigns the samples they have to transcribe
...
Create transcription (.cha) file by copying the template that exists within this folder: 4. Picnic Scene_Rely (transcriber 2)
Copy the whisper output CODE001_BACC001_PicnicScene_Spa_Pre_20230914.txt and paste it in the template (.cha)
Fill out some particular fields in the headers of the file (@) (language, participant code, etc.,)
Segment in utterances following the transcription protocol rules CHAT
Code with the transcription protocol rules
Use CLAN to detect typos or spelling mistakes (command CHECK and command MOR)
Save in 4. Picnic Scene_Rely (transcriber 2) and make sure naming is correct by adding your initials (e.g. Ana Quinonez, AQ): CODE001_BACC001_CatRescue_Spa_Pre_20230914_Coded_AQ.cha
Running Rely Initial Comparison for Reliability (reliability supervisor)
Run transcriber 1 and transcriber 2 Coded_Rely samples through RELY
Save RELY output to this folder: 5. Picnic Scene_Rely output. The name of the file should be the original file name (excluding Coded_Rely_Initials), as provided here: BISE010_PreTX_PicnicDescription_Castellano.rely
Add score in Smartsheet CS Data Analysis in the 2 columns (report)reliability reports (1.3 SPANISH CS Reliability, 2.3 CATALAN CS Reliability)
The selected samples will be assessed for transcription accuracy using the CLAN software and using Rely (How to use RELY on CLAN). Two key metrics will be evaluated:
...