A newer version of the Gradio SDK is available:
6.0.1
metadata
title: Meeting Minutes Generator
emoji: π
colorFrom: blue
colorTo: purple
sdk: gradio
sdk_version: 5.49.1
app_file: app.py
pinned: false
license: mit
π Meeting Minutes Generator
Upload meeting audio and get professionally formatted minutes instantly!
π Features
- Audio Transcription using Groq Whisper Large v3
- Smart Formatting using Groq GPT-OSS-120B
- Instant Results in clean Markdown format
- No Login Required - privacy-focused
- Free to Use - no hidden costs
π― How to Use
- Upload your meeting audio file (MP3, WAV, M4A, WEBM)
- Click "Generate Minutes"
- Wait ~30-40 seconds for processing
- Copy your formatted minutes
π Supported Files
- Formats: MP3, WAV, M4A, WEBM, FLAC
- Max Size: 25 MB (~25 minutes at 128kbps)
- Languages: English (optimized)
π οΈ Tech Stack
- Backend: FastAPI
- Frontend: Gradio
- Transcription: Groq Whisper Large v3 (temperature: 0.1)
- Minutes Generation: Groq GPT-OSS-120B (120B parameters, temperature: 0.1)
π» Local Setup
If you want to run this locally:
- Clone the repository
- Install dependencies:
pip install -r requirements.txt - Add your
GROQ_API_KEYto.envfile - Run:
python app.py - Open: http://localhost:7860
π Privacy & Security
- β No data is stored or logged
- β Audio files are processed in memory only
- β Temporary files are deleted after processing
- β Your meetings stay completely private
- β Open source - audit the code yourself
π Output Format
The generated minutes include:
- Meeting metadata (date, attendees)
- Summary (2-3 sentence overview)
- Key discussions (structured agenda items)
- Action items (with owners and deadlines when mentioned)
- Open issues/concerns
- Additional context (when relevant)
π Use Cases
Perfect for:
- Team standups and sprint reviews
- Client meetings and consultations
- Board meetings and executive sessions
- Educational lectures and seminars
- Medical consultations (keep HIPAA in mind)
- Legal depositions (with appropriate consent)
β‘ Performance
- Transcription: ~30 seconds for 10-minute audio
- Minutes Generation: ~5 seconds
- Total Processing: ~35-40 seconds end-to-end
π§ Technical Details
API Endpoints (Backend)
POST /transcribe- Audio to text transcriptionPOST /generate-minutes- Text to formatted minutesGET /- Health check
Models Used
- Whisper Large v3: 1550M parameters, multilingual
- GPT-OSS-120B: 120B parameters, optimized for structured output
Processing Pipeline
Audio Upload β Validation β Whisper Transcription β
LLM Formatting β Markdown Output β Copy/Download
π€ Contributing
This is an MVP. Future enhancements could include:
- Speaker diarization (who said what)
- Multi-language support (Hindi, Spanish, etc.)
- PDF/DOCX export
- Meeting history with search
- Real-time transcription
- Calendar integration
π¨βπ» Built By
Made with β€οΈ by Jai
π License
MIT License - Free to use and modify
β οΈ Note: This app requires a Groq API key. Add yours in the HF Spaces Settings β Repository Secrets with the name GROQ_API_KEY.
---
## π Files Summary for HF Upload
**Upload these 6 files to HF Spaces:**
1. β
`app.py` (NEW - just gave you)
2. β
`backend.py` (your existing file)
3. β
`frontend.py` (your existing file)
4. β
`requirements.txt` (your existing file)
5. β
`README.md` (updated version above)
6. β
`.gitignore` (NEW - just gave you)
---
## β
Final Checklist
**In Cursor, verify you now have:**
hackathon/ βββ app.py β NEW βββ backend.py β Existing βββ frontend.py β Existing βββ requirements.txt β Existing βββ README.md β Updated βββ .gitignore β NEW