Commit Graph

22 Commits

Author SHA1 Message Date
d1e1fedcae fix: Resolve issues #7, #8, #9 - moviepy, transformers, Whisper OOM
Issue #7: Handle moviepy 2.x removing verbose param from write_audiofile

Issue #8: Pin transformers<5.0.0 to fix summarization pipeline task registry

Issue #9: Add Whisper model memory warnings and OOM error handling
2026-03-03 02:10:52 -05:00
70c5d32413 feat: Add streaming Ollama support, model caching, and UI improvements
- Add streaming summarization via Ollama API (stream_summarize_with_ollama)

- Cache ML models with @st.cache_resource (diarization, NER, translation, Whisper)

- Add temp file cleanup for extracted audio

- Add system capabilities detection (FFmpeg, GPU info)

- Add get_video_duration utility

- Improve validation with FFmpeg check

- Rewrite app.py with streaming support and UI enhancements

- Clean up redundant comments and unused imports across all utils
2026-02-18 10:26:09 -05:00
ce398ae1d4 fix: Update moviepy import for v2.x compatibility (closes #6)
moviepy 2.x removed the moviepy.editor submodule. Import AudioFileClip directly from moviepy with a fallback for moviepy 1.x users. Also close the audio clip after writing to prevent resource leaks.

Co-authored-by: Cursor <cursoragent@cursor.com>
2026-02-13 00:15:07 -05:00
168bf5f573 fix: Free up disk space in GitHub Actions runner
- Add disk cleanup step to remove unused packages (~30GB freed)
- Remove arm64 platform build (reduces disk usage significantly)
- Add cleanup step between CPU and GPU builds
- Fixes 'No space left on device' error during Docker build
2026-01-05 17:34:07 -05:00
efee0b0abe fix: Resolve protobuf dependency conflict for pyannote.audio 4.x
- Update protobuf from <5.0.0 to >=5.0.0 (required by opentelemetry-proto)
- Update streamlit minimum version to >=1.30.0 (protobuf 5.x compatible)
- Update regular Dockerfile to match GPU dockerfile structure
- Install PyTorch CPU version in regular Dockerfile for consistency
2026-01-05 11:30:56 -05:00
4dd3c7600e fix: Remove torch from requirements.txt and use flexible versions
- Remove torch/torchaudio/torchvision from requirements.txt (installed separately in Docker)
- Use >= instead of == for most packages to avoid version conflicts
- Install numpy before other requirements
- Add setuptools and wheel to pip upgrade step
2026-01-05 11:25:12 -05:00
78e9df31e6 fix: Use flexible PyTorch versions and fix Docker build order
- Don't pin exact torch/torchaudio/torchvision versions (use >=2.1.0)
- Install CUDA PyTorch first in Docker before other requirements
- Upgrade pip before installations to avoid compatibility issues
- Let pip resolve latest compatible versions from cu118 index
2026-01-05 10:50:20 -05:00
c5cc7c2969 fix: Update PyTorch/torchaudio to 2.1.0 for SpeechBrain 1.0 diarization compatibility
- Update torch from 2.0.1 to 2.1.0
- Update torchaudio from 2.0.2 to 2.1.0 (fixes 'NoneType' object has no attribute 'eval' error)
- Update torchvision from 0.15.2 to 0.16.0
- Update pytorch-lightning from 2.0.6 to 2.1.0
- Add explicit speechbrain==1.0.0 dependency
- Update transformers and tokenizers for compatibility
- Update protobuf version constraint

Fixes diarization error: speechbrain.pretrained was deprecated and redirected to speechbrain.inference in SpeechBrain 1.0, but required torchaudio >= 2.1.0
2026-01-05 09:57:20 -05:00
81f40726e3 Refactor keyword extraction logic to convert stopwords to a list for compatibility with scikit-learn; add checks for empty keywords list in timestamp extraction function. 2025-09-10 23:33:57 -04:00
f04853eba9 Enhance file selection to support additional audio format (M4A) and update README.md to reflect new supported formats for video and audio recordings. 2025-08-05 11:18:36 -04:00
3346b0df0f Refactor Dockerfile and requirements.txt for improved dependency management; update README.md with clearer installation options and instructions for prebuilt images. 2025-07-23 14:45:19 -04:00
dcf13c1423 Enhance README.md with Docker installation instructions and update Ollama API endpoint to be configurable via environment variable. 2025-07-17 00:05:23 -04:00
c2ee2394d2 Enhance file selection to support multiple video formats (MP4, AVI, MOV, MKV) and provide user guidance on supported formats. 2025-07-16 23:50:32 -04:00
ce9bb9c2e2 Fix summarization issues and improve GPU handling. Update .gitignore for venv 2025-04-30 12:09:10 -04:00
9ca396d6fa Update README.md 2025-03-03 12:28:32 -05:00
56e947cc71 Update README.md 2025-03-03 12:07:30 -05:00
524f0d6a6c Update README.md
New implemented modules
2025-03-03 08:26:39 -05:00
7ea098bd05 Add installation scripts and update documentation for Phase 3 features 2025-03-01 20:37:52 -05:00
a653ac7f28 Update README.md 2025-01-28 17:05:50 -05:00
a78d2a6053 Update README.md 2025-01-28 17:00:49 -05:00
3c44257d8a Updated with code 2025-01-28 17:00:03 -05:00
2b37a68c56 Initial commit 2025-01-28 16:57:23 -05:00