Transcription Speech to Text (imported from uploaded file)

digital-humanities-Whisper_speech_recognition/main-workflow

Author(s)
Daniela Schneider
version Version
1
last_modification Last updated
Apr 8, 2026
license License
CC-BY-4.0
galaxy-tags Tags
whisper
transcription
audio
video

Features
Tutorial
hands_on Transcribing Audio and Video files with Automated Speech Recognition

Workflow Testing
Tests: ✅
Results: Not yet automated
FAIRness purl PURL
https://gxy.io/GTN:
RO-Crate logo with flask Download Workflow RO-Crate
Launch in Tutorial Mode question
galaxy-download Download
flowchart TD
  0["ℹ️ Input Dataset\nVideo or Audio File"];
  style 0 stroke:#2c3143,stroke-width:4px;
  1["Speech to Text with Diarization"];
  0 -->|output| 1;
  2["Allocate Moderator"];
  1 -->|output_txt| 2;
  3["Allocate Kennedy"];
  2 -->|outfile| 3;
  4["Allocate Nixon"];
  3 -->|outfile| 4;
  5["Allocate Journalist"];
  4 -->|outfile| 5;
  6["Extract Passages spoken by Nixon"];
  5 -->|outfile| 6;
  7["Extract Passages spoken by Kennedy"];
  5 -->|outfile| 7;
  8["Delete Speaker Name for Nixon"];
  6 -->|output| 8;
  7c0ead7c-a704-4bc8-babf-b3213bbb3273["Output\nNixon-transcribed-cleaned"];
  8 --> 7c0ead7c-a704-4bc8-babf-b3213bbb3273;
  style 7c0ead7c-a704-4bc8-babf-b3213bbb3273 stroke:#2c3143,stroke-width:4px;
  9["Delete Speaker Name for Kennedy"];
  7 -->|output| 9;
  5f9b4108-c084-4099-967a-c00e6ccfbf5c["Output\nKennedy-transcribed-cleaned"];
  9 --> 5f9b4108-c084-4099-967a-c00e6ccfbf5c;
  style 5f9b4108-c084-4099-967a-c00e6ccfbf5c stroke:#2c3143,stroke-width:4px;

Inputs

Input Label
Input dataset Video or Audio File

Outputs

From Output Label
toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_find_and_replace/9.5+galaxy2 Replace Delete Speaker Name for Nixon
toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_find_and_replace/9.5+galaxy2 Replace Delete Speaker Name for Kennedy

Tools

Tool Links
toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_find_and_replace/9.5+galaxy2 View in ToolShed
toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_grep_tool/9.5+galaxy2 View in ToolShed
toolshed.g2.bx.psu.edu/repos/bgruening/whisperx/whisperx/3.4.2+galaxy1 View in ToolShed

To use these workflows in Galaxy you can either click the links to download the workflows, or you can right-click and copy the link to the workflow which can be used in the Galaxy form to import workflows.

Importing into Galaxy

Below are the instructions for importing these workflows directly into your Galaxy server of choice to start using them!
Hands On: Importing a workflow
  1. Click on galaxy-workflows-activity Workflows in the Galaxy activity bar (on the left side of the screen, or in the top menu bar of older Galaxy instances). You will see a list of all your workflows
  2. Click on galaxy-upload Import at the top-right of the screen
  3. Provide your workflow
    • Option 1: Paste the URL of the workflow into the box labelled “Archived Workflow URL”
    • Option 2: Upload the workflow file in the box labelled “Archived Workflow File”
  4. Click the Import workflow button

Below is a short video demonstrating how to import a workflow from GitHub using this procedure:

Video: Importing a workflow from URL

Version History

Version Commit Time Comments
3 993990f70 2026-03-25 14:23:34 Add files via upload
2 d8ab37596 2026-03-25 13:38:13 Add files via upload
1 e3cab8732 2026-03-24 14:35:06 Add files via upload

For Admins

Installing the workflow tools

wget https://training.galaxyproject.org/training-material/topics/digital-humanities/tutorials/Whisper_speech_recognition/workflows/main_workflow.ga -O workflow.ga
workflow-to-tools -w workflow.ga -o tools.yaml
shed-tools install -g GALAXY -a API_KEY -t tools.yaml
workflow-install -g GALAXY -a API_KEY -w workflow.ga --publish-workflows