Implement MVP podcast generation #34

Alameen688 · 2025-10-06T02:47:39Z

Summary

Adds an MVP “Podcast” feature with local/S3 storage, a custom player, and RAG‑driven content.

What’s Included

Podcast generation
- Modes: Dialogue (Teacher/Student) and Presentation (Narrator)
- Voice selection per role (validated against supported voices)
- Optional focus topics and limiting to selected documents
- Stores audio as MP3 (local or S3) and saves transcript in DB
- Delete podcasts (media + DB row)
UI
- Podcast tab: generate form + document filters
- Custom player: play/pause, ±15s seek, progress, volume, delete
Retrieval
- Pinecone upsert batching to avoid 2MB request limit
- Pinecone filters by course_id and optional document_ids
- Structured logs for RAG and podcast flows

How To Use

Go to a course → Podcast tab.
Enter Title (required), choose Mode/Voices.
(Optional) set Focus Topics and select specific Documents.
Generate → Play or Delete.

Config

.env
- PODCAST_STORAGE=local|s3
- PODCAST_LOCAL_DIR=/app/podcasts
- S3_BUCKET_NAME, AWS_REGION, credentials
- PODCAST_TEACHER_VOICE, PODCAST_STUDENT_VOICE

Verification

Generate both modes; confirm audio plays and transcript spacing.
Limit to selected documents; confirm targeted context.
Delete podcast; verify media and DB cleanup.

Screenshot

Unrelated Podcast Changes (and Why)

These changes are not directly part of the podcast feature but were needed to get the app running on my local machine, including some debugging logs to know the flow of data as I also had issues getting chat to work.

72a94ee — fix prestart errors
- backend/app/alembic/versions/10368f38610b_fix_delete_document_error.py
  - Removed incorrect chat table re‑creation to prevent DuplicateTable during alembic upgrade.
- backend/scripts/prestart.sh
  - Ensures migrations + initial data run deterministically on container start.
b3b3373 — hold fixes to chat
- backend/app/api/routes/chat.py, backend/app/services/chat_service.py, backend/app/services/chat_cache.py
  - Added structured logging, cache checks, and safer streaming to make chat more observable and robust.
- backend/app/services/rag_service.py, backend/app/api/routes/documents.py
  - Fixed Pinecone filtering (by course_id), added diagnostics; later leveraged by podcast RAG.
- backend/app/schemas/public.py
  - Corrected ChatPublic schema to avoid validation errors when returning chat history.
- frontend/src/app/layout.tsx
  - Hydration fix to reduce dev error noise: added suppressHydrationWarning on <html> for next-themes. I think it is safe here becuase client toggles the theme class and SSR defaults match, so we’re not hiding real mismatches.
- frontend/src/components/quiz/quiz-attempts.tsx
  - React key fix to silence list key warning.

backend/app/api/routes/documents.py

deluakin · 2025-10-06T03:20:42Z

backend/app/api/routes/podcasts.py

+
+@router.get("/{course_id}", response_model=PodcastsPublic)
+def list_podcasts(course_id: uuid.UUID, session: SessionDep, current_user: CurrentUser) -> Any:
+    pods = session.exec(select(Podcast).where(Podcast.course_id == course_id)).all()


Some pagination might be useful here

Yes. We can add this as a feature in another PR on both the backend and frontend.

backend/app/api/routes/podcasts.py

michaelgichia

This is really impressive, and I will be testing it out in a few hours. I've left a few comments before it is merge.

backend/app/api/routes/documents.py

backend/app/api/routes/podcasts.py

backend/app/services/podcast_service.py

frontend/src/lib/podcast-service.ts

michaelgichia · 2025-10-06T03:35:41Z

frontend/src/runtime-config.ts


-  const baseURL =
-    process.env.NEXT_PUBLIC_BACKEND_BASE_URL ?? 'http://localhost:8000'
+  const baseURL = isServer


You are probably running into this error because you are not using the client.

#15 (comment)

I can remove this in this PR but this was an issue I ran into even before implementing podcast. I couldn't log in or sign up. (PS: I'm running the project with docker)

.env.example

frontend/src/components/podcast.tsx

backend/app/api/routes/podcasts.py

deluakin · 2025-10-06T03:44:39Z

backend/app/api/routes/podcasts.py

+    teacher_voice = body.teacher_voice if body and body.teacher_voice else settings.PODCAST_TEACHER_VOICE
+    student_voice = body.student_voice if body and body.student_voice else settings.PODCAST_STUDENT_VOICE
+    narrator_voice = body.narrator_voice if body and body.narrator_voice else settings.PODCAST_TEACHER_VOICE


All these aren't really necessary if we use enum type and specify a default value

deluakin

Good start but some improvement will be needed

Alameen688 · 2025-10-06T07:53:08Z

frontend/src/client/types.gen.ts

-     * Total Submitted
-     */
-    total_submitted: number;
-    /**
-     * Total Correct
-     */
-    total_correct: number;
-    /**
-     * Score Percentage


FYI: the deletions in the autogenerated files happened automatically after I ran the client generation script #34 (comment)

Alameen688 · 2025-10-06T08:24:01Z

@deluakin @michaelgichia Thanks for your review!
I'm not familiar with the entire stack, but your comments have been helpful. I have updated most of it (if not all) to match the expected structure/convention. Kindly help review again.

michaelgichia

A few comments, and this should be good. Testing now.

michaelgichia · 2025-10-06T08:46:41Z

backend/app/api/routes/documents.py

                "id": embedding_uuid,
                "values": embedding,
                "metadata": {
+                    "course_id": str(document.course_id),


Thanks for fixing this. I was curious why it wasn't working.

michaelgichia · 2025-10-06T08:48:13Z

backend/app/api/routes/podcasts.py

+
+
+@router.get("/course/{course_id}", response_model=PodcastsPublic)
+def list_podcasts(course_id: uuid.UUID, session: SessionDep, _current_user: CurrentUser) -> PodcastsPublic:


Suggested change

def list_podcasts(course_id: uuid.UUID, session: SessionDep, _current_user: CurrentUser) -> PodcastsPublic:

def list_podcasts(course_id: uuid.UUID, session: SessionDep, _current_user: CurrentUser, skip: int = 0, limit: int = 50) -> PodcastsPublic:

michaelgichia · 2025-10-06T08:53:34Z

frontend/src/app/api/v1/podcasts/audio/[podcastId]/route.ts

+}
+
+export const config = {
+  runtime: 'nodejs',


Suggested change

runtime: 'nodejs',

runtime: 'nodejs',

maxDuration: 300,

michaelgichia · 2025-10-06T08:55:17Z

frontend/src/app/api/v1/podcasts/audio/[podcastId]/route.ts

+      console.error('[PodcastAudio] Audio stream error:', error)
+    }
+
+    return Response.json({ error: 'Failed to stream audio' }, { status: 500 })


Return the error like this, otherwise, the client will fail because it expects a specific payload structure:

const status: number = get( error as Record<string, never>, 'response.status', 500, ) const body: ErrorResponse = get( error as Record<string, never>, 'response.data.detail', { detail: 'Internal Server Error', }, ) return NextResponse.json(body, {status})

I figured it didn't matter because I wasn't using the ErrorBox component, which relies on error.detail, but I can add in case of future use

michaelgichia · 2025-10-06T08:56:15Z

frontend/src/components/create-course/upload-documents.tsx

 import {CourseWithDocuments} from '@/client'
 import FileCard from '@/components/ui/file-card'
-import {getCourse} from '@/lib/courses'
+import {getCourse} from '@/actions/courses'


Why the change?

for consistency, as I noticed you were importing getCourse directly from '@/actions/courses' in the other parts of the dashboard

michaelgichia · 2025-10-06T08:57:23Z

frontend/src/components/project-settings.tsx


 import {CourseWithDocuments} from '@/client'
-import {getCourse} from '@/lib/courses'
+import {getCourse} from '@/actions/courses'


Why the change?

#34 (comment)

.env.example

michaelgichia · 2025-10-06T09:00:12Z

We point the PRs to dev first.

Alameen688 · 2025-10-06T11:13:35Z

frontend/src/app/(routes)/(dashboard)/dashboard/layout.tsx

    // Configure axios client per request
    client.setConfig({
-      baseURL: process.env.NEXT_PUBLIC_BACKEND_BASE_URL as string,
+      baseURL: process.env.NEXT_INTERNAL_BACKEND_BASE_URL as string,


@michaelgichia After merging to dev, I noticed this was changed to NEXT_PUBLIC_BACKEND_BASE_URL but I had to return it to NEXT_INTERNAL_BACKEND_BASE_URL for things to run on my local.
Not sure if this is fine.

Alameen688 added 3 commits October 5, 2025 05:57

fix prestart errors

72a94ee

hold fixes to chat

b3b3373

implement mvp podcast

d9a78a1

Alameen688 requested review from d-beloved and michaelgichia October 6, 2025 02:47