Caption Creator
Other
Other
Other
Other
v11.2

Caption Creator

MM744
Creator
⭐ 0.0
⬇ 171 Downloads
👁 1 Views
🖼 11 Images

About this model

Caption Creator is a portable Windows app for generating high-quality text from images. Create captions, tags, JSON, YAML, Illustrious prompts, or fully custom outputs for image datasets, LoRA training, AI prompting, and folder organization.

Run everything locally with built-in GGUF models, or connect your own vision model through LM Studio or Ollama. Your images stay on your computer.

Website · Online Version · Releases · Patreon

Highlights:

  • Multiple output types: Captions, Tags, JSON, YAML, Illustrious, and Custom prompts.

  • Single or batch workflow: Process one image, many images, or queue multiple jobs.

  • Local-first generation: Use bundled models, LM Studio, or Ollama vision models.

  • Model management: Download, select, delete, load, and eject models from the app.

  • Professional workflow controls: Max words, trigger words, prompt enrichment, Low-VRAM mode, custom output folder, and original filename preservation.

  • Fast result actions: Copy output, open the run folder, or export the current run as a ZIP archive.

  • Modern desktop UI: Frameless dark interface with live status, progress, previews, gallery output, and an About panel.

How to Use:

  1. Download and unpack the latest release.

  2. Launch Caption Creator.exe.

  3. Open Model / VRAM Configuration.

    • Download a built-in model that matches your GPU, or

    • choose Custom (LM Studio) / Custom (Ollama) and select a running vision model.

  4. Choose Single Image or Batch Processing, then add images by clicking, dragging, or pasting in Single Image mode.

  5. Pick an output type: Captions, Tags, JSON, YAML, Illustrious, or Custom.

  6. Adjust optional settings, then click Generate or add the job to the Queue.

  7. Copy the result, open the output folder, or save the run as a ZIP archive.

Related Models

Similar AI models you may like