dillon_stuff/TalkEdit

Fork 0

Go to file

Your Name 7ea098bd05 Add installation scripts and update documentation for Phase 3 features

2025-03-01 20:37:52 -05:00

utils

Add installation scripts and update documentation for Phase 3 features

2025-03-01 20:37:52 -05:00

.gitattributes

Initial commit

2025-01-28 16:57:23 -05:00

.gitignore

Updated with code

2025-01-28 17:00:03 -05:00

app.py

Add installation scripts and update documentation for Phase 3 features

2025-03-01 20:37:52 -05:00

GEMINI_INSIGHTS.md

Add installation scripts and update documentation for Phase 3 features

2025-03-01 20:37:52 -05:00

install.bat

Add installation scripts and update documentation for Phase 3 features

2025-03-01 20:37:52 -05:00

install.py

Add installation scripts and update documentation for Phase 3 features

2025-03-01 20:37:52 -05:00

install.sh

Add installation scripts and update documentation for Phase 3 features

2025-03-01 20:37:52 -05:00

INSTALLATION.md

Add installation scripts and update documentation for Phase 3 features

2025-03-01 20:37:52 -05:00

LICENSE

Initial commit

2025-01-28 16:57:23 -05:00

README.md

Add installation scripts and update documentation for Phase 3 features

2025-03-01 20:37:52 -05:00

requirements.txt

Add installation scripts and update documentation for Phase 3 features

2025-03-01 20:37:52 -05:00

README.md

OBS Recording Transcriber

Process OBS recordings or any video/audio files with AI-based transcription and summarization locally on your machine.

Features

AI transcription using Whisper.
Summarization using Hugging Face Transformers.
File selection, resource validation, and error handling.
Speaker diarization to identify different speakers in recordings.
Language detection and translation capabilities.
Keyword extraction with timestamp linking.
Interactive transcript with keyword highlighting.
Export to TXT, SRT, VTT, and ASS subtitle formats with compression options.
GPU acceleration for faster processing.
Caching system for previously processed files.

Installation

Easy Installation (Recommended)

Windows

Download or clone the repository
Run install.bat by double-clicking it
Follow the on-screen instructions

Linux/macOS

Download or clone the repository
Open a terminal in the project directory
Make the install script executable: chmod +x install.sh
Run the script: ./install.sh
Follow the on-screen instructions

Manual Installation

Clone the repo.

git clone https://github.com/DataAnts-AI/VideoTranscriber.git
cd VideoTranscriber

Install dependencies:

pip install -r requirements.txt

Notes:

Ensure that the versions align with the features you use and your system compatibility.
torch version should match the capabilities of your hardware (e.g., CUDA support for GPUs).
For advanced features like speaker diarization, you'll need a HuggingFace token.
See INSTALLATION.md for detailed instructions and troubleshooting.

Run the application:

streamlit run app.py

Usage

Set your base folder where OBS recordings are stored
Select a recording from the dropdown
Choose transcription and summarization models
Configure performance settings (GPU acceleration, caching)
Select export formats and compression options
Click "Process Recording" to start

Advanced Features

Speaker Diarization: Identify and label different speakers in your recordings
Translation: Automatically detect language and translate to multiple languages
Keyword Extraction: Extract important keywords with timestamp links
Interactive Transcript: Navigate through the transcript with keyword highlighting
GPU Acceleration: Utilize your GPU for faster processing
Caching: Save processing time by caching results

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Languages

TypeScript 60.5%

Python 27.6%

Rust 9.3%

Shell 2%

JavaScript 0.3%

Other 0.3%