Commit Graph

60 Commits

Author SHA1 Message Date
4d3d8a2218 speed zones work now 2026-04-15 20:17:05 -06:00
b7a795f986 UI improvements, moved file name and moved buttons left 2026-04-15 19:54:39 -06:00
7479acd3ee forgot to add stuff 2026-04-15 18:02:25 -06:00
17874587a4 improved zone handling 2026-04-15 18:00:34 -06:00
84edddded8 removed electron 2026-04-15 17:40:27 -06:00
48d761c713 defaults to project folders; examining zones 2026-04-15 17:31:41 -06:00
024b9bd806 ai tools finished 2026-04-15 17:13:56 -06:00
d11e26cf2d improved tools for ai 2026-04-15 16:36:21 -06:00
4f90750497 volume panel; copilot instructions 2026-04-15 16:10:35 -06:00
0df967507f able to process audio with different model; new project button 2026-04-11 19:42:30 -06:00
b8ec396ebd able to modify trim zones 2026-04-11 19:13:04 -06:00
140b7a5319 fixed error 2026-04-09 01:50:19 -06:00
1d17a8f19a trying to fix bug 2026-04-09 01:36:28 -06:00
f9cd2bf579 gitignore cache 2026-04-08 21:19:52 -06:00
d80ff847d8 silence trimmer 2026-04-03 12:05:44 -06:00
8a7c94d594 delete key can remove zones 2026-04-03 11:38:58 -06:00
0237d685e5 able to drag edges of zones 2026-04-03 11:36:08 -06:00
585262c3e7 added cut and mute zones 2026-04-03 11:14:31 -06:00
d7bc6ea74d added split audio 2026-04-03 10:46:49 -06:00
f0568ed267 darker model name text 2026-04-03 10:46:26 -06:00
7c8c74d04d removed cutscript 2026-04-03 10:35:07 -06:00
addd87c45b Remove obsolete open-cutscript script 2026-04-03 10:34:04 -06:00
bb9ac53ae5 Remove CutScript submodule - can be cloned separately from https://github.com/DataAnts-AI/CutScript.git 2026-04-03 10:33:56 -06:00
c7445206cc added distil models 2026-04-03 10:25:48 -06:00
ea3f1d2b23 close sh;able to save/load projects 2026-03-30 18:36:41 -06:00
246d816f84 added close; fixed some issues 2026-03-28 15:09:56 -06:00
2ffc406b10 changed to python312 2026-03-28 12:26:45 -06:00
4a857d8cbf added api for ai; got backend working 2026-03-26 23:39:31 -06:00
164b2f87d4 got cpu based backend working; trying python/gpu solution bc faster probs 2026-03-26 00:58:57 -06:00
00ee076baa frontend changes 2026-03-25 01:41:40 -06:00
b4bcb8f3f2 i think i got step one working 2026-03-25 01:22:30 -06:00
4230ae6cb9 added features doc 2026-03-25 00:11:35 -06:00
c01db38eb3 update files 2026-03-24 23:56:08 -06:00
d134e4ab27 add CutScript submodule 2026-03-24 23:53:59 -06:00
a864b562ae initial commit 2026-03-24 23:53:59 -06:00
e5c47e31b3 Audio 2026-03-06 21:49:22 +05:30
78d34133ad Add image to README for visual enhancement
Added an image to the README to enhance visual appeal.
2026-03-03 14:45:36 -05:00
33cca5f552 Initial CutScript release - Open-source AI-powered text-based video editor
CutScript is a local-first, Descript-like video editor where you edit video by editing text.
Delete a word from the transcript and it's cut from the video.

Features:
- Word-level transcription with WhisperX
- Text-based video editing with undo/redo
- AI filler word removal (Ollama/OpenAI/Claude)
- AI clip creation for shorts
- Waveform timeline with virtualized transcript
- FFmpeg stream-copy (fast) and re-encode (4K) export
- Caption burn-in and sidecar SRT generation
- Studio Sound audio enhancement (DeepFilterNet)
- Keyboard shortcuts (J/K/L, Space, Delete, Ctrl+Z/S/E)
- Encrypted API key storage
- Project save/load (.aive files)

Architecture:
- Electron + React + Tailwind (frontend)
- FastAPI + Python (backend)
- WhisperX for transcription
- FFmpeg for video processing
- Multi-provider AI support

Performance optimizations:
- RAF-throttled time updates
- Zustand selectors for granular subscriptions
- Dual-canvas waveform rendering
- Virtualized transcript with react-virtuoso

Built on top of DataAnts-AI/VideoTranscriber, completely rewritten as a desktop application.

License: MIT
2026-03-03 06:31:04 -05:00
d1e1fedcae fix: Resolve issues #7, #8, #9 - moviepy, transformers, Whisper OOM
Issue #7: Handle moviepy 2.x removing verbose param from write_audiofile

Issue #8: Pin transformers<5.0.0 to fix summarization pipeline task registry

Issue #9: Add Whisper model memory warnings and OOM error handling
2026-03-03 02:10:52 -05:00
70c5d32413 feat: Add streaming Ollama support, model caching, and UI improvements
- Add streaming summarization via Ollama API (stream_summarize_with_ollama)

- Cache ML models with @st.cache_resource (diarization, NER, translation, Whisper)

- Add temp file cleanup for extracted audio

- Add system capabilities detection (FFmpeg, GPU info)

- Add get_video_duration utility

- Improve validation with FFmpeg check

- Rewrite app.py with streaming support and UI enhancements

- Clean up redundant comments and unused imports across all utils
2026-02-18 10:26:09 -05:00
ce398ae1d4 fix: Update moviepy import for v2.x compatibility (closes #6)
moviepy 2.x removed the moviepy.editor submodule. Import AudioFileClip directly from moviepy with a fallback for moviepy 1.x users. Also close the audio clip after writing to prevent resource leaks.

Co-authored-by: Cursor <cursoragent@cursor.com>
2026-02-13 00:15:07 -05:00
168bf5f573 fix: Free up disk space in GitHub Actions runner
- Add disk cleanup step to remove unused packages (~30GB freed)
- Remove arm64 platform build (reduces disk usage significantly)
- Add cleanup step between CPU and GPU builds
- Fixes 'No space left on device' error during Docker build
2026-01-05 17:34:07 -05:00
efee0b0abe fix: Resolve protobuf dependency conflict for pyannote.audio 4.x
- Update protobuf from <5.0.0 to >=5.0.0 (required by opentelemetry-proto)
- Update streamlit minimum version to >=1.30.0 (protobuf 5.x compatible)
- Update regular Dockerfile to match GPU dockerfile structure
- Install PyTorch CPU version in regular Dockerfile for consistency
2026-01-05 11:30:56 -05:00
4dd3c7600e fix: Remove torch from requirements.txt and use flexible versions
- Remove torch/torchaudio/torchvision from requirements.txt (installed separately in Docker)
- Use >= instead of == for most packages to avoid version conflicts
- Install numpy before other requirements
- Add setuptools and wheel to pip upgrade step
2026-01-05 11:25:12 -05:00
78e9df31e6 fix: Use flexible PyTorch versions and fix Docker build order
- Don't pin exact torch/torchaudio/torchvision versions (use >=2.1.0)
- Install CUDA PyTorch first in Docker before other requirements
- Upgrade pip before installations to avoid compatibility issues
- Let pip resolve latest compatible versions from cu118 index
2026-01-05 10:50:20 -05:00
c5cc7c2969 fix: Update PyTorch/torchaudio to 2.1.0 for SpeechBrain 1.0 diarization compatibility
- Update torch from 2.0.1 to 2.1.0
- Update torchaudio from 2.0.2 to 2.1.0 (fixes 'NoneType' object has no attribute 'eval' error)
- Update torchvision from 0.15.2 to 0.16.0
- Update pytorch-lightning from 2.0.6 to 2.1.0
- Add explicit speechbrain==1.0.0 dependency
- Update transformers and tokenizers for compatibility
- Update protobuf version constraint

Fixes diarization error: speechbrain.pretrained was deprecated and redirected to speechbrain.inference in SpeechBrain 1.0, but required torchaudio >= 2.1.0
2026-01-05 09:57:20 -05:00
81f40726e3 Refactor keyword extraction logic to convert stopwords to a list for compatibility with scikit-learn; add checks for empty keywords list in timestamp extraction function. 2025-09-10 23:33:57 -04:00
f04853eba9 Enhance file selection to support additional audio format (M4A) and update README.md to reflect new supported formats for video and audio recordings. 2025-08-05 11:18:36 -04:00
3346b0df0f Refactor Dockerfile and requirements.txt for improved dependency management; update README.md with clearer installation options and instructions for prebuilt images. 2025-07-23 14:45:19 -04:00
dcf13c1423 Enhance README.md with Docker installation instructions and update Ollama API endpoint to be configurable via environment variable. 2025-07-17 00:05:23 -04:00