3. Whisper Transcription Process (Research Assistants)
How to Process Audios Through Whisper
What is Whisper? - Whisper is an advanced automatic speech recognition (ASR) system developed by OpenAI that converts spoken language from audio recordings into written text. It supports transcription in multiple languages and can also translate speech into English. When executed on the Texas Advanced Computing Center (TACC), Whisper efficiently processes large-scale audio datasets to produce precise, time-aligned transcriptions—making it an invaluable tool for research, analysis, and data processing workflows.
Clipping and reclipping should always be completed in the lab. If you must work remotely, it’s essential to follow these steps since we are handling patient data:
Connect through the UT VPN
Work in a private, secure location
1. Update Status of Whisper Processing in Smartsheet | Update the status of whisper transcriptions in Smartsheet report the files belong in such as these: Make sure to mark them depending on the step you are on:
“Pre-whisper” means that the file has already been transcribed manually before Whisper began to be used. | |
2. Checking and correcting Audio File Names and that files are saved in the correct folder Before Running Whisper | Before processing your audio files with Whisper, it’s important to verify that all file names are correct and consistent. Proper naming helps avoid errors during transcription and keeps your workflow organized. Make sure that each audio file follows the expected naming convention (for example, | |
3. Access TACC Account | Access the shared credentials on 1password to enter TACC Analysis Portal
| |
3.1 Option A USING THE LABS laptop | Use the MAC laptop found in the main lab space, if not on table, ask Sonia to retrieve it for you If you are accessing TACC for the first time, please go here: https://tap.tacc.utexas.edu/ If you have already logged into TACC before, please access the Analysis Portal here: https://tap.tacc.utexas.edu/jobs/ | |
3.1 Option B USING your own device | If you are accessing it for the first time you need a TACC token. Send a message on the channel requestion for the TACC token tagging the team and @Kesha Pugalenthi . The TACC token is a number that gets sent to us privately and it has an expiration time, please make sure to stay on this site until you have entered the TACC Token
Please do not sign out of the TACC account after running whisper in your device so this step isn’t needed the next time you run whisper |
Request for TACC token
|
3.2 | Submit a job on TACC by clicking on the dropdowns and selecting:
| |
4. Enter Jupiter | If there are available nodes (picture A), you will be able to enter Jupiter right away. In that case, follow these steps:
| A B |
5. Set up Folder |
| |
6. Upload Audio Files |
|
|
7. Go to the Terminal |
| |
8. Type the Command (If running several files at once) | Once you are in the terminal, follow these steps (if running several audio files):
|
|
8. Type the command (if running single file) | If running 1 single file, follow these steps:
Type command below, then press enter: The commands in bold change depending on:
|
|
9. Whisper Running | Depending on the number of files, they may take some time to run. You will see it transcribing in real time and will know when it's done running when you see this at the bottom of the terminal (see picture). | |
10. Find Output Files | Once the files are finished running, follow the next steps:
| A |
11. Download Output Files |
Type command below, then press enter: zip -r BISE004_Output.zip BISE004_Output The commands in bold change depending on:
| |
12. Clear Cache from TACC | Run command to clear cache. This is an important step because TACC will not allow you to start in the future if the home directory exceeds 9GB (the cache directory is in the home directory).
| |
13. Log Out |
| A |
14. Upload Files to Box | Go to this link: https://utexas.box.com/s/ghd8ho1ciko1n2le94u796cnhcf57rgw
|
|
15. Delete Files from TACC | Once the files have been uploaded to Box, delete them from TACC. | |
16. Unfinished Whisper Files | A printed calendar of schedules for the Whisper and Clipping Team is now available on the main lab’s big desk. This calendar should be used to verify who is available to assume Whisper tasks if a shift concludes before all files have been processed. The digital version of the calendar can be accessed through the provided link. At the end of each day, the last individual using Whisper is responsible for saving all completed files, properly shutting down the laptop, and storing it in the white cabinet located next to the snacks. | Digital Smartsheet: https://utexas.app.box.com/file/2043269077900?s=eizx5tb1ijkjpr35gv5jib5efeqr64xj |