AI video analysis tool: Extracts audio transcripts (Whisper) & visual subtitles (Vision LLM), then generates summaries using LLMs (Gemini/Qwen). Features multi-modal processing and speed optimizations. - View it on GitHub
Star
7
Rank
1977895