Podcast transcription with Modal
note
To see video of this tutorial
In this tutorial, you'll build a pipeline with Dagster that:
- Automatically detects newly published podcasts
- Transcribes them using the power of GPUs
- Notifies you with a summary
Prerequisites
To follow the steps in this guide, you'll need:
- Basic Python knowledge
- Python 3.9+ installed on your system. For more information, see the Installation guide.
Step 1: Set up your Dagster environment
First, set up a new Dagster project.
-
Clone the Dagster repo and navigate to the project:
cd examples/project_dagster_modal_pipes
-
Create and activate a virtual environment:
- MacOS
- Windows
uv venv dagster_tutorial
source dagster_tutorial/bin/activateuv venv dagster_tutorial
dagster_tutorial\Scripts\activate -
Install Dagster and the required dependencies:
uv pip install -e ".[dev]"
Step 2: Launch the Dagster webserver
To make sure Dagster and its dependencies were installed correctly, navigate to the project root directory and start the Dagster webserver:
dagster dev
Next steps
- Continue this tutorial with modal application