Files
TalkEdit/README.md

75 lines
2.5 KiB
Markdown
Raw Normal View History

2025-01-28 17:00:03 -05:00
# OBS Recording Transcriber
2025-01-28 17:00:49 -05:00
Process OBS recordings or any video/audio files with AI-based transcription and summarization locally on your machine.
2025-01-28 17:00:03 -05:00
## Features
- AI transcription using Whisper.
- Summarization using Hugging Face Transformers.
- File selection, resource validation, and error handling.
- Speaker diarization to identify different speakers in recordings.
- Language detection and translation capabilities.
- Keyword extraction with timestamp linking.
- Interactive transcript with keyword highlighting.
- Export to TXT, SRT, VTT, and ASS subtitle formats with compression options.
- GPU acceleration for faster processing.
- Caching system for previously processed files.
2025-01-28 17:00:03 -05:00
## Installation
### Easy Installation (Recommended)
#### Windows
1. Download or clone the repository
2. Run `install.bat` by double-clicking it
3. Follow the on-screen instructions
#### Linux/macOS
1. Download or clone the repository
2. Open a terminal in the project directory
3. Make the install script executable: `chmod +x install.sh`
4. Run the script: `./install.sh`
5. Follow the on-screen instructions
### Manual Installation
2025-01-28 17:00:03 -05:00
1. Clone the repo.
```
git clone https://github.com/DataAnts-AI/VideoTranscriber.git
2025-01-28 17:05:50 -05:00
cd VideoTranscriber
```
2025-01-28 17:00:03 -05:00
2. Install dependencies:
```
pip install -r requirements.txt
```
2025-01-28 17:00:03 -05:00
Notes:
- Ensure that the versions align with the features you use and your system compatibility.
- torch version should match the capabilities of your hardware (e.g., CUDA support for GPUs).
- For advanced features like speaker diarization, you'll need a HuggingFace token.
- See `INSTALLATION.md` for detailed instructions and troubleshooting.
3. Run the application:
```
streamlit run app.py
```
## Usage
1. Set your base folder where OBS recordings are stored
2. Select a recording from the dropdown
3. Choose transcription and summarization models
4. Configure performance settings (GPU acceleration, caching)
5. Select export formats and compression options
6. Click "Process Recording" to start
## Advanced Features
- **Speaker Diarization**: Identify and label different speakers in your recordings
- **Translation**: Automatically detect language and translate to multiple languages
- **Keyword Extraction**: Extract important keywords with timestamp links
- **Interactive Transcript**: Navigate through the transcript with keyword highlighting
- **GPU Acceleration**: Utilize your GPU for faster processing
- **Caching**: Save processing time by caching results
2025-01-28 17:00:03 -05:00
## Contributing
Contributions are welcome! Please feel free to submit a Pull Request.