Skip to main content

Prompt Engineering

Introduction


Creating an image that matches a certain expectation is not as trivial as one might expect. The solution space of the AI model is insanely high. To limit this a well engineered prompt can be used to steer the AI in the right direction. This practice is called Prompt Engineering.

Workflow

To get as close to the imagined concept and as efficiently as possible, the following workflow is proposed:

  1. Database selection:
    1. Checkpoint selection: Each model is trained for specific output. Use the right one!
    2. Add additional LoRa files to improve specific features (download at: https://civitai.com/models) 
  2. Prompt creation:
    1. Use prompt perfector to get to a good descriptive prompt
    2. Use img2txt (CLIP) to retrieve a prompt from an example image that can be used to get detailed prompts for specific features
    3. Use a fixed seed, in order to same results for the same prompt.
  3. Choose settings:
    1. Choose proper sampling method. Each method has it's own benefits and downsides.
    2. Use relative low resolution, but at the desired aspect ratio. Exact resolution is depending on the used database models (e.g. 512x512, 768x768, 1024x1024 px).
  4. Image generation:
    1. Iterate the prompt until the concept is mainly displayed.
  5. Finalize the details:
    1. Move to the img2img tab to use inpainting to improve certain aspects
  6. Upscale image:
    1. Move to Extras tab and set higher resolution and iterations. Fine-tune the prompt until all details are correct.
  7. Save image and save prompt & settings.

Prompt perfecters

Using ChatGPT to optimize the prompt is efficient, because it is not always clear how the AI will respond to a certain prompt. There are templates available to help kickstart the prompt perfectioning via ChatGPT.

Prompt guides

Prompt resources/examples