Caption trainer

caption-trainer
caption-trainer

Tools for training image captioning and vision-language models. Started as a BLIP fine-tuning setup and has grown into a broader toolkit covering inference, agentic caption pipelines, and iterative caption improvement.

caption-train