- Update torch from 2.0.1 to 2.1.0 - Update torchaudio from 2.0.2 to 2.1.0 (fixes 'NoneType' object has no attribute 'eval' error) - Update torchvision from 0.15.2 to 0.16.0 - Update pytorch-lightning from 2.0.6 to 2.1.0 - Add explicit speechbrain==1.0.0 dependency - Update transformers and tokenizers for compatibility - Update protobuf version constraint Fixes diarization error: speechbrain.pretrained was deprecated and redirected to speechbrain.inference in SpeechBrain 1.0, but required torchaudio >= 2.1.0
54 lines
1.5 KiB
Plaintext
54 lines
1.5 KiB
Plaintext
# OBS Recording Transcriber Dependencies
|
|
# Core dependencies with pinned compatible versions
|
|
streamlit==1.26.0
|
|
moviepy==1.0.3
|
|
openai-whisper==20231117
|
|
requests>=2.28.0
|
|
humanize>=4.6.0
|
|
|
|
# PyTorch ecosystem - updated for SpeechBrain 1.0 compatibility
|
|
# torchaudio >= 2.1.0 is REQUIRED for diarization to work properly
|
|
torch==2.1.0
|
|
torchaudio==2.1.0
|
|
torchvision==0.16.0
|
|
|
|
# Transformers ecosystem - compatible versions
|
|
transformers==4.35.0
|
|
tokenizers>=0.14.0
|
|
|
|
# ML dependencies with compatible versions
|
|
numpy==1.24.3
|
|
scipy==1.10.1
|
|
scikit-learn==1.3.0
|
|
|
|
# Audio processing and ML models
|
|
# Pin speechbrain for stability with pyannote.audio
|
|
speechbrain==1.0.0
|
|
pyannote.audio==3.1.1
|
|
pytorch-lightning==2.1.0
|
|
|
|
# Other dependencies
|
|
iso639>=0.1.4
|
|
protobuf>=3.20.0,<5.0.0
|
|
matplotlib>=3.5.0
|
|
soundfile>=0.10.3
|
|
ffmpeg-python>=0.2.0
|
|
|
|
# Optional: Ollama Python client (uncomment to install)
|
|
# ollama
|
|
|
|
# Installation notes:
|
|
# 1. For Windows users, you may need to install PyTorch separately:
|
|
# pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118
|
|
#
|
|
# 2. For tokenizers issues, try installing Visual C++ Build Tools:
|
|
# https://visualstudio.microsoft.com/visual-cpp-build-tools/
|
|
#
|
|
# 3. For pyannote.audio, you'll need a HuggingFace token with access to:
|
|
# https://huggingface.co/pyannote/speaker-diarization-3.0
|
|
#
|
|
# 4. FFmpeg is required for audio processing:
|
|
# Windows: https://www.gyan.dev/ffmpeg/builds/
|
|
# Mac: brew install ffmpeg
|
|
# Linux: apt-get install ffmpeg
|