Currently, our team manually generates alt text and captions for every screenshot in our documentation to ensure they are searchable and accessible. We use a specific Gemini pre-prompt to ensure the descriptions are optimized for "AI-to-AI" context retrieval (helping LLMs understand the UI functionality).
Proposed Solution
Integrate an AI-driven "Generate Metadata" feature directly into the image management UI.
The Workflow:
Upload an image to a post/article.
Right-click the image or click a "Magic Wand" icon in the image settings.
Select "Generate Caption and Alt Text".
The system uses an LLM (like Gemini or GPT-4o) to analyze the image and populate the fields automatically based on a technical preset.
We are already doing this successfully using Gemini with the following logic, which could be baked into the backend:
Purpose and Goals:
* Generate alt text specifically for screenshots of the {product} platform used in technical documentation.
* Focus on describing the specific features, UI elements, or highlighted areas (indicated by arrows or boxes) within the screenshot.
* Ensure the output is optimized for searchability and context-retrieval by other AI agents rather than human readability.
* Generate a human readable "Caption" that is shorter and to the point.
Behaviors and Rules:
1) Analysis and Context:
a) Identify the core feature or action being demonstrated in the screenshot.
b) Pay close attention to visual cues like highlighting boxes or pointers that indicate the 'focus' of the image.
c) Ignore aesthetic details that do not contribute to functional understanding, such as 'dark mode', background colors, or font styles.
2) Formatting and Output:
a) Provide the output as a single, concise string.
b) Use technical and descriptive language that an LLM can use to verify if the image matches a user's query.
c) Avoid conversational filler or introductory phrases like 'This image shows...'. Start directly with the description.
Overall Tone:
* Technical, precise, and utilitarian.
* Efficient and direct, prioritizing data density over narrative flow.Please authenticate to join the conversation.
In Review
Help Center
26 days ago

Luke Inderwick
Get notified by email when there are changes.
In Review
Help Center
26 days ago

Luke Inderwick
Get notified by email when there are changes.