YT-Study

Turn YouTube learning into structured study material.

Convert videos, playlists, and URL batches into clean Markdown notes with chapter awareness, transcript fallback logic, and LLM-powered organization.

Quick Start • Features • How-It-Works • Configuration • Contributing

At A Glance

Category	Details
Primary Use	Generate reusable study notes from YouTube
Inputs	Single video URL, playlist URL, `urls.txt` batch file
Output	Markdown notes (single file or chapter-based files)
Core Stack	Python, Typer, Rich, LiteLLM, youtube-transcript-api, pytubefix
Runtime Style	Async orchestration with concurrent playlist workers
Quality	Tests, Ruff, MyPy, CI workflows

Documentation Map

Use the wiki for detailed guides:

Vision

Most high-quality educational content is now video-first, but video is a weak medium for revision:

Slow to scan.
Hard to search deeply.
Difficult to convert into repeatable study systems.

yt-study exists to close that gap.

The long-term vision is to make long-form video learning operational: not just consumable once, but reusable as a durable knowledge asset. Instead of repeatedly scrubbing timelines, you get structured notes that can be reviewed, annotated, shared, and versioned.

Design principles:

Fidelity over generic summarization Preserve critical terminology, workflow steps, and technical context.
Low-friction daily usage One command flow for setup, processing, and iteration.
Scalable processing Handle long transcripts and multi-video queues with concurrency.
Reliability under real-world conditions Fallbacks, retries, and strict engineering standards over demo-only flows.

Features

Content Ingestion

Supports video URLs, playlist URLs, and batch files (yt-study process urls.txt).
Parses common YouTube formats: watch, youtu.be, embed, shorts, and playlist links.

Transcript Reliability

Prioritizes manual transcripts in preferred language.
Falls back to auto-generated transcripts.
Can select other languages and translate when appropriate.
Includes retry behavior and IP-block detection paths.

Note Generation Quality

Uses chapter-based note generation for long standalone videos with chapters.
Uses chunked generation with overlap for large transcript contexts.
Outputs clean Markdown suitable for Obsidian, Notion import, Git docs, and revision sets.

Pipeline + UX

Async orchestration for better throughput.
Concurrent worker processing for playlists.
Rich live dashboard for status, worker activity, and run summary.

Engineering Quality

Strict typing with MyPy.
Formatting/linting with Ruff.
Automated validation through CI and PR gate workflows.

Quick Start

1. Install

pip install yt-study

2. First-Time Setup

yt-study setup

3. Process Content

Single video:

yt-study process "https://youtube.com/watch?v=VIDEO_ID"

Playlist:

yt-study process "https://youtube.com/playlist?list=PLAYLIST_ID"

Batch file:

yt-study process urls.txt

4. Useful CLI Commands

yt-study --help
yt-study config-path
yt-study version

How It Works

Input URL/File
  -> URL parsing (video/playlist detection)
  -> metadata + transcript retrieval
  -> generation strategy selection
     - chapter mode (long standalone video + chapters)
     - chunked mode (large transcript)
  -> LLM generation (provider/model from config)
  -> Markdown write to output directory

Command Reference

yt-study setup                # interactive first-run configuration
yt-study process "URL"        # process a single video or playlist URL
yt-study process urls.txt     # process a text file of URLs
yt-study config-path          # show ~/.yt-study/config.env location
yt-study version              # print installed CLI version
yt-study --help               # full command/options help

Usage Patterns

Course Playlist Capture

Use yt-study after finishing a lecture playlist to convert each session into searchable notes.

Research Queue

Maintain urls.txt as a personal watch-and-learn queue and generate notes in batches.

Team Knowledge Sync

Run on shared technical videos and commit generated Markdown into an internal docs repo.

Troubleshooting

YouTube IP Block Detected

When YouTube rate-limits your IP, transcript/metadata requests can fail.

Try:

Wait and retry later.
Reduce MAX_CONCURRENT_VIDEOS in ~/.yt-study/config.env (for example 1 or 2).
Retry from a different network/IP.

Missing API Key For Model

If the selected model is missing its provider key, processing will stop early.

Fix:

Run yt-study setup and reconfigure the provider/model.
Verify expected key in ~/.yt-study/config.env.
Use yt-study config-path to confirm the active config location.

Transcript Not Available

Some videos have disabled transcripts or no usable subtitles.

What to try:

Use -l en or provide preferred languages with -l.
Confirm the video has captions available on YouTube.
Try another video if captions are disabled by the creator.

Processing Fails For Some Playlist Items

Large playlists can have mixed availability (private/deleted/restricted videos).

What happens:

yt-study continues processing and shows failed items in summary.

FAQ

Is yt-study free?

The CLI is open-source. LLM provider usage may cost money depending on your API plan.

Which model should I start with?

Start with a fast model from your preferred provider, then move to higher-quality models for final notes.

Does yt-study support batch processing?

Yes. Provide a text file with one URL per line:

yt-study process urls.txt

Where are logs stored?

Session log files are written to:

~/.yt-study/logs/

If home is not writable, logs fall back to a local ./logs directory.

Where can I read extended docs?

Use the wiki pages linked in Documentation Map, including FAQ.

Configuration

Config file location:

~/.yt-study/config.env

Common settings:

MODEL (example: gemini/gemini-2.5-pro)
MAX_CONCURRENT_VIDEOS (default: 5)
Provider keys such as GEMINI_API_KEY, OPENAI_API_KEY, ANTHROPIC_API_KEY
Generation options including temperature and token controls

Configuration loading behavior:

Reads ~/.yt-study/config.env first.
Applies environment variable overrides after file load.
Syncs provider API keys into process environment for downstream SDK usage.

Full reference: Configuration Wiki

Output Organization

Typical output shape:

output/
  Video Title/
    Video Title.md
  Long Video With Chapters/
    01_Intro.md
    02_Core Concept.md
    03_Implementation.md

Developer Experience

make sync      # install locked dependencies
make ci        # CI-equivalent checks
make all       # local full pass (with autofix)
make help      # list all make targets

Project layout:

src/yt_study/
  cli.py
  config.py
  setup_wizard.py
  llm/
  pipeline/
  prompts/
  ui/
  youtube/

Contributing

License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
.github		.github
scripts/hooks		scripts/hooks
src/yt_study		src/yt_study
tests		tests
wiki @ 6f3149b		wiki @ 6f3149b
.env.example		.env.example
.gitignore		.gitignore
.gitmodules		.gitmodules
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
AGENTS.md		AGENTS.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
SECURITY.md		SECURITY.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

License

whoisjayd/yt-study

Folders and files

Latest commit

History

Repository files navigation

YT-Study

Turn YouTube learning into structured study material.

At A Glance

Documentation Map

Vision

Features

Content Ingestion

Transcript Reliability

Note Generation Quality

Pipeline + UX

Engineering Quality

Quick Start

1. Install

2. First-Time Setup

3. Process Content

4. Useful CLI Commands

How It Works

Command Reference

Usage Patterns

Course Playlist Capture

Research Queue

Team Knowledge Sync

Troubleshooting

YouTube IP Block Detected

Missing API Key For Model

Transcript Not Available

Processing Fails For Some Playlist Items

FAQ

Is yt-study free?

Which model should I start with?

Does yt-study support batch processing?

Where are logs stored?

Where can I read extended docs?

Configuration

Output Organization

Developer Experience

Contributing

License

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 7

Contributors 3

Uh oh!

Languages