7ea098bd05c08d7edd502d60bb2584e64de6a740
OBS Recording Transcriber
Process OBS recordings or any video/audio files with AI-based transcription and summarization locally on your machine.
Features
- AI transcription using Whisper.
- Summarization using Hugging Face Transformers.
- File selection, resource validation, and error handling.
- Speaker diarization to identify different speakers in recordings.
- Language detection and translation capabilities.
- Keyword extraction with timestamp linking.
- Interactive transcript with keyword highlighting.
- Export to TXT, SRT, VTT, and ASS subtitle formats with compression options.
- GPU acceleration for faster processing.
- Caching system for previously processed files.
Installation
Easy Installation (Recommended)
Windows
- Download or clone the repository
- Run
install.batby double-clicking it - Follow the on-screen instructions
Linux/macOS
- Download or clone the repository
- Open a terminal in the project directory
- Make the install script executable:
chmod +x install.sh - Run the script:
./install.sh - Follow the on-screen instructions
Manual Installation
- Clone the repo.
git clone https://github.com/DataAnts-AI/VideoTranscriber.git
cd VideoTranscriber
- Install dependencies:
pip install -r requirements.txt
Notes:
- Ensure that the versions align with the features you use and your system compatibility.
- torch version should match the capabilities of your hardware (e.g., CUDA support for GPUs).
- For advanced features like speaker diarization, you'll need a HuggingFace token.
- See
INSTALLATION.mdfor detailed instructions and troubleshooting.
- Run the application:
streamlit run app.py
Usage
- Set your base folder where OBS recordings are stored
- Select a recording from the dropdown
- Choose transcription and summarization models
- Configure performance settings (GPU acceleration, caching)
- Select export formats and compression options
- Click "Process Recording" to start
Advanced Features
- Speaker Diarization: Identify and label different speakers in your recordings
- Translation: Automatically detect language and translate to multiple languages
- Keyword Extraction: Extract important keywords with timestamp links
- Interactive Transcript: Navigate through the transcript with keyword highlighting
- GPU Acceleration: Utilize your GPU for faster processing
- Caching: Save processing time by caching results
Contributing
Contributions are welcome! Please feel free to submit a Pull Request.
Languages
TypeScript
60.5%
Python
27.6%
Rust
9.3%
Shell
2%
JavaScript
0.3%
Other
0.3%