by takereshui
Extracts text from video platforms and audio files using multiple ASR services including OpenAI Whisper, Bilibili's Bcut ASR, and ByteDance's JianYing ASR, supporting downloads from YouTube, Bilibili, TikTok, and other platforms with configurable transcription backends, word-level timestamps, and caching for content analysis and subtitle generation.
Get the fastest-growing projects, useful MCP servers, and technical reads in one weekly email.