Overview

LLM Actions in Agent Studio provide built-in capabilities to leverage large language model (LLM) functionalities directly within your Compound Actions and Conversational Processes. These actions enable tasks such as summarization, reasoning, classification, data extraction, and content generation, allowing you to build workflows without custom integrations.

This documentation focuses on two key LLM Actions: generate_text_action and generate_structured_value_action. These actions are designed to help you process unstructured data, generate insights, and structure outputs efficiently.

generate_text_action

Description

The generate_text_action invokes an LLM to produce free-form text output based on user-provided input. This action is ideal for tasks requiring natural language generation, such as summarizing documents, generating responses, or performing step-by-step reasoning. Use this action when you need unstructured text results, like drafting emails, explaining concepts, or brainstorming ideas.

Input Parameters

Field	Type	Required	Description
`system_prompt`	`string`	❌	Defines the model’s behavior or instructions. For example, “Act as a helpful assistant that summarizes technical articles.”
`user_input`	`string`	✅	The primary context or query for the LLM to process.
`image`	`file`	❌	An image file to include as visual input for the LLM. Must be a `File` object retrieved via a File Slot in a Conversational Process. Supported formats: PNG, JPEG/JPG, WEBP, non-animated GIF. See the Model Reference for image-capable models.
`file`	`file`	❌	A document file to include as input for the LLM. Must be a `File` object retrieved via a File Slot in a Conversational Process. Only PDF (`.pdf`) files are supported. See the File Input Limitations callout for size constraints, and the Model Reference for compatible models.
`model`	`string`	❌	Specifies the LLM model to use. See the Model Reference for available options. Defaults to `gpt-4o-mini-2024-07-18`
`temperature`	`number`	❌	Control the randomness of the output. Higher values will make the output more random, while lower values will make it more focused and deterministic
`reasoning_effort`	`string`	❌	Optional reasoning effort argument. Can be set to one of “minimal” (only `gpt-5` models), `"low"`, `"medium"`, or `"high"`. Must be left empty for non-reasoning models such as `gpt-4.1`.

Output

Field	Type	Description
`generated_output`	`string`	The LLM-generated text response.

Usage Examples

Here are practical examples demonstrating various LLM abilities. Each includes a sample request schema for integration into a Compound Action.

Example 1: Text Summarization

Summarize a lengthy article or user query into a concise overview.

YAML - Compound Action

1 - action:
2     action_name: mw.generate_text_action
3     input_args:
4       system_prompt: '''Summarize the following text in 3-5 bullet points, focusing on key takeaways.'''
5       user_input: data.article_content  # e.g., a long blog post fetched from an API
6       model: '''gpt-4o-mini'''
7 			temperature: 0.7
8     output_key: summary_output

Conversational Process

1 system_prompt: '''Summarize the following text in 3-5 bullet points, focusing on key takeaways.'''
2 user_input: data.article_content  # e.g., a long blog post fetched from an API
3 model: '''gpt-4o-mini'''
4 temperature: 0.7

Example 2: Content Generation

Generate creative or instructional content, such as drafting a user email.

YAML - Compound Action

1 - action:
2     action_name: mw.generate_text_action
3     input_args:
4       system_prompt: '''Write a professional email response based on the user's complaint.'''
5       user_input: data.user_complaint  # e.g., "My order is delayed by two weeks."
6     output_key: email_draft

Conversational Process

1 system_prompt: '''Write a professional email response based on the user's complaint.'''
2 user_input: data.user_complaint  # e.g., "My order is delayed by two weeks."

Example 3: Step-by-Step Reasoning

Guide the LLM through logical reasoning for problem-solving.

YAML - Compound Action

1 - action:
2     action_name: mw.generate_text_action
3     input_args:
4       system_prompt: '''Solve the problem step by step, explaining your reasoning.'''
5       user_input: '''What is the next number in the sequence: 2, 4, 8, 16?'''
6       reasoning_effort: '''high'''	
7 			model: '''gpt-5-2025-08-07'''
8     output_key: reasoning_output

Conversational Process

1 system_prompt: '''Solve the problem step by step, explaining your reasoning.'''
2 user_input: '''What is the next number in the sequence: 2, 4, 8, 16?'''
3 reasoning_effort: '''high'''	
4 model: '''gpt-5-2025-08-07'''

Example 4: Image OCR

Extract text from an uploaded image, such as a receipt. The image parameter maps to a File object collected via a File Slot.

YAML - Compound Action

1 - action:
2     action_name: mw.generate_text_action
3     input_args:
4       system_prompt: '''Extract all text from the provided image exactly as it appears.'''
5       user_input: '''Extract the text from this receipt image.'''
6       image: data.receipt
7       model: '''gpt-4o'''
8     output_key: ocr_output

Conversational Process

1 system_prompt: '''Extract all text from the provided image exactly as it appears.'''
2 user_input: '''Extract the text from this receipt image.'''
3 image: data.receipt
4 model: '''gpt-4o'''

Example 5: Document Q&A

Answer a question against an uploaded document, such as a PDF policy. The file parameter maps to a File object collected via a File Slot.

YAML - Compound Action

1 - action:
2     action_name: mw.generate_text_action
3     input_args:
4       system_prompt: '''Answer the user's question using only the content of the attached document.'''
5       user_input: '''What is the reimbursement limit described in this policy?'''
6       file: data.policy_document
7       model: '''gpt-4o'''
8     output_key: document_answer

Conversational Process

1 system_prompt: '''Answer the user's question using only the content of the attached document.'''
2 user_input: '''What is the reimbursement limit described in this policy?'''
3 file: data.policy_document
4 model: '''gpt-4o'''

generate_structured_value_action

Description

The generate_structured_value_action calls an LLM to extract or generate data in a predefined structured format (JSON schema). This is particularly useful for classification, entity extraction, or transforming unstructured input into queryable data.

Apply this action for tasks where output consistency is critical, such as tagging content, extracting key-value pairs, or categorizing user inputs.

Input Parameters

Field	Type	Required	Description
`payload`	`object`	✅	The data or text to analyze.
`output_schema`	`object`	✅	JSON Schema defining the expected output structure.
`system_prompt`	`string`	❌	Defines the model’s behavior or instructions. For example, “Act as a helpful assistant that summarizes technical articles.”
`image`	`file`	❌	An image file to include as visual input for the LLM. Must be a `File` object retrieved via a File Slot in a Conversational Process. Supported formats: PNG, JPEG/JPG, WEBP, non-animated GIF. See the Model Reference for image-capable models.
`file`	`file`	❌	A document file to include as input for the LLM. Must be a `File` object retrieved via a File Slot in a Conversational Process. Only PDF (`.pdf`) files are supported. See the File Input Limitations callout for size constraints, and the Model Reference for compatible models.
`model`	`string`	❌	Specifies the LLM model to use. Defaults to `"gpt-4o-mini-2024-07-18"`. IMPORTANT: This action is only compatible with `gpt-4o-mini-2024-07-18` and later and `gpt-4o-2024-08-06` and later.
`strict`	`string`	❌	Enforces schema adherence. Defaults to `false`; Can either be `true`or `false`
`output_schema_name`	`string`	❌	LLM-facing name for the schema (defaults to `extracted_value`)
`output_schema_description`	`string`	❌	Description of the schema for the LLM.
`reasoning_effort`	`string`	❌	Optional reasoning effort argument. Can be set to one of “minimal” (only `gpt-5` models), `"low"`, `"medium"`, or `"high"`. Must be left empty for non-reasoning models such as `gpt-4.1`.

️ `additionalProperties: false` must always be set in objects.

additionalProperties controls whether it is allowable for an object to contain additional keys / values that were not defined in the JSON Schema.

Output

Field	Type	Description
`generated_output`	`object`	Structured data matching the provided schema.

Usage Examples

Examples illustrate extraction, classification, and more. Include request schemas for easy implementation.

Example 1: Topic Classification (Existing Example, Expanded)

Classify a research abstract into predefined topics.

YAML - Compound Action

1 - action:
2     action_name: mw.generate_structured_value_action
3     input_args:
4       payload: data.research_paper_abstract
5       system_prompt: '''Given a research paper abstract and a list of topic options, output up to 5 topics that accurately apply to the paper.'''
6       output_schema: >-
7         {
8             "type": "object",
9             "properties": {
10                 "topic_tags": {
11                     "type": "array",
12                     "items": {
13                       "type": "string",
14                       "enum": data.topic_tag_options  # e.g., ["AI", "ML", "NLP"]
15                     }
16                 }
17             },
18             "required": ["topic_tags"],
19             "additionalProperties": false
20         }
21       strict: true
22     output_key: classified_topics

Conversational Process

1 payload: data.research_paper_abstract
2 system_prompt: '''Given a research paper abstract and a list of topic options, output up to 5 topics that accurately apply to the paper.'''
3 output_schema: >-
4     {
5         "type": "object",
6         "properties": {
7             "topic_tags": {
8                 "type": "array",
9                 "items": {
10                     "type": "string",
11                     "enum": data.topic_tag_options  # e.g., ["AI", "ML", "NLP"]
12                 }
13             }
14         },
15         "required": ["topic_tags"],
16         "additionalProperties": false
17     }
18 strict: true

Expected Output

1 generated_output: {
2   "topic_tags": ["LLM Capabilities", "Reinforcement Learning (RL)", "Reasoning"]
3 }

Example 2: Entity Extraction

Extract named entities like names, dates, and locations from text.

YAML - Compound Action

1 - action:
2     action_name: mw.generate_structured_value_action
3     input_args:
4       payload: data.user_message  # e.g., "John Doe will arrive in New York on October 15, 2025."
5       system_prompt: '''Extract entities such as persons, locations, and dates from the text.'''
6       output_schema: >-
7         {
8             "type": "object",
9             "properties": {
10                 "persons": {"type": "array", "items": {"type": "string"}},
11                 "locations": {"type": "array", "items": {"type": "string"}},
12                 "dates": {"type": "array", "items": {"type": "string"}}
13             },
14             "required": ["persons", "locations", "dates"],
15             "additionalProperties": false
16         }
17       reasoning_effort: '''low'''
18 			model: '''gpt-5-2025-08-07'''
19     output_key: extracted_entities

Conversational Process

1 payload: data.user_message  # e.g., "John Doe will arrive in New York on October 15, 2025."
2 system_prompt: '''Extract entities such as persons, locations, and dates from the text.'''
3 output_schema: >-
4     {
5         "type": "object",
6         "properties": {
7             "persons": {"type": "array", "items": {"type": "string"}},
8             "locations": {"type": "array", "items": {"type": "string"}},
9             "dates": {"type": "array", "items": {"type": "string"}}
10         },
11         "required": ["persons", "locations", "dates"],
12         "additionalProperties": false
13     }
14 reasoning_effort: '''low'''
15 model: '''gpt-5-2025-08-07'''

Expected Output

1 generated_output: {
2   "persons": ["John Doe"],
3   "locations": ["New York"],
4   "dates": ["October 15, 2025"]
5 }

Example 3: Sentiment Classification

Classify text sentiment with confidence scores.

YAML - Compound Action

1 - action:
2     action_name: mw.generate_structured_value_action
3     input_args:
4       payload: data.customer_review
5       system_prompt: '''Analyze the sentiment of the review and output the category with a confidence score.'''
6       output_schema: >-
7         {
8             "type": "object",
9             "properties": {
10                 "sentiment": {
11                     "type": "string",
12                     "enum": ["positive", "negative", "neutral"]
13                 },
14                 "confidence": {"type": "number"}
15             },
16             "required": ["sentiment", "confidence"],
17             "additionalProperties": false
18         }
19     output_key: sentiment_analysis

Conversational Process

1 payload: data.customer_review
2 system_prompt: '''Analyze the sentiment of the review and output the category with a confidence score.'''
3 output_schema: >-
4     {
5         "type": "object",
6         "properties": {
7             "sentiment": {
8                 "type": "string",
9                 "enum": ["positive", "negative", "neutral"]
10             },
11             "confidence": {"type": "number"}
12         },
13         "required": ["sentiment", "confidence"],
14         "additionalProperties": false
15     }

Expected Output

1 generated_output: {
2   "sentiment": "positive",
3   "confidence": 0.85
4 }

Example 4: Structured Extraction from an Image

Extract structured fields from an uploaded image, such as a receipt. The image parameter maps to a File object collected via a File Slot.

YAML - Compound Action

1 - action:
2     action_name: mw.generate_structured_value_action
3     input_args:
4       payload: '''Extract the merchant, total amount, and date from this receipt.'''
5       image: data.receipt
6       system_prompt: '''Extract structured expense details from the provided receipt image.'''
7       output_schema: >-
8         {
9             "type": "object",
10             "properties": {
11                 "merchant": {"type": "string"},
12                 "total_amount": {"type": "number"},
13                 "date": {"type": "string"}
14             },
15             "required": ["merchant", "total_amount", "date"],
16             "additionalProperties": false
17         }
18       model: '''gpt-4o'''
19     output_key: receipt_details

Conversational Process

1 payload: '''Extract the merchant, total amount, and date from this receipt.'''
2 image: data.receipt
3 system_prompt: '''Extract structured expense details from the provided receipt image.'''
4 output_schema: >-
5     {
6         "type": "object",
7         "properties": {
8             "merchant": {"type": "string"},
9             "total_amount": {"type": "number"},
10             "date": {"type": "string"}
11         },
12         "required": ["merchant", "total_amount", "date"],
13         "additionalProperties": false
14     }
15 model: '''gpt-4o'''

Expected Output

1 generated_output: {
2   "merchant": "Blue Bottle Coffee",
3   "total_amount": 12.50,
4   "date": "2025-10-15"
5 }

Example 5: Structured Extraction from a Document

Extract structured fields from an uploaded document, such as a PDF invoice. The file parameter maps to a File object collected via a File Slot.

YAML - Compound Action

1 - action:
2     action_name: mw.generate_structured_value_action
3     input_args:
4       payload: '''Extract the invoice number, vendor, and amount due from this invoice.'''
5       file: data.invoice_document
6       system_prompt: '''Extract structured invoice details from the attached document.'''
7       output_schema: >-
8         {
9             "type": "object",
10             "properties": {
11                 "invoice_number": {"type": "string"},
12                 "vendor": {"type": "string"},
13                 "amount_due": {"type": "number"}
14             },
15             "required": ["invoice_number", "vendor", "amount_due"],
16             "additionalProperties": false
17         }
18       model: '''gpt-4o'''
19     output_key: invoice_details

Conversational Process

1 payload: '''Extract the invoice number, vendor, and amount due from this invoice.'''
2 file: data.invoice_document
3 system_prompt: '''Extract structured invoice details from the attached document.'''
4 output_schema: >-
5     {
6         "type": "object",
7         "properties": {
8             "invoice_number": {"type": "string"},
9             "vendor": {"type": "string"},
10             "amount_due": {"type": "number"}
11         },
12         "required": ["invoice_number", "vendor", "amount_due"],
13         "additionalProperties": false
14     }
15 model: '''gpt-4o'''

Expected Output

1 generated_output: {
2   "invoice_number": "INV-2025-0042",
3   "vendor": "Acme Supplies Inc.",
4   "amount_due": 1450.00
5 }

Model Reference

Model	Capabilities					OpenAI Direct		Azure OpenAI
Model	Context	Max Output	Reasoning Effort	Live Search	Image & File Input	US	EU	US	EU	CA	AU	Gov
gpt-5.5-2026-04-23	400K	128K	✓	—	✓	✓	✓	—	—	—	—	—
gpt-5.5	400K	128K	✓	—	✓	✓	✓	—	—	—	—	—
gpt-5.4-2026-03-05	400K	128K	✓	—	✓	✓	✓	✓	—	—	—	—
gpt-5.4	400K	128K	✓	—	✓	✓	✓	—	—	—	—	—
gpt-5.4-mini-2026-03-17	400K	128K	✓	—	✓	✓	✓	✓	—	—	—	—
gpt-5.4-mini	400K	128K	✓	—	✓	✓	✓	—	—	—	—	—
gpt-5.4-nano-2026-03-17	400K	128K	✓	—	✓	✓	✓	✓	—	—	—	—
gpt-5.4-nano	400K	128K	✓	—	✓	✓	✓	—	—	—	—	—
gpt-5.2-2025-12-11	400K	128K	✓	—	✓	✓	✓	✓	—	—	—	—
gpt-5.2	400K	128K	✓	—	✓	✓	✓	—	—	—	—	—
gpt-5.1-2025-11-13	400K	128K	✓	—	✓	✓	✓	✓	✓	—	—	✓
gpt-5.1-chat-latest	400K	128K	✓	—	✓	✓	✓	—	—	—	—	—
gpt-5.1	400K	128K	✓	—	✓	✓	✓	—	—	—	—	—
gpt-5-2025-08-07	400K	128K	✓	—	✓	✓	✓	✓	✓	✓	✓	✓
gpt-5	400K	128K	✓	—	✓	✓	✓	—	—	—	—	—
gpt-5-mini-2025-08-07	400K	128K	✓	—	✓	✓	✓	✓	✓	✓	✓	✓
gpt-5-mini	400K	128K	✓	—	✓	✓	✓	—	—	—	—	—
gpt-5-nano-2025-08-07	400K	128K	✓	—	✓	✓	✓	✓	✓	✓	✓	✓
gpt-5-nano	400K	128K	✓	—	✓	✓	✓	—	—	—	—	—
o4-mini-2025-04-16	200K	100K	✓	—	✓	✓	✓	✓	✓	✓	✓	✓
o4-mini	1M	100K	✓	—	✓	✓	✓	—	—	—	—	—
gpt-4.1-2025-04-14	1M	32K	—	—	✓	✓	✓	✓	✓	✓	✓	✓
gpt-4.1	1M	32K	—	—	✓	✓	✓	—	—	—	—	—
gpt-4.1-mini-2025-04-14	1M	32K	—	—	✓	✓	✓	✓	✓	✓	✓	✓
gpt-4.1-mini	1M	32K	—	—	✓	✓	✓	—	—	—	—	—
gpt-4.1-nano-2025-04-14	1M	32K	—	—	✓	✓	✓	✓	✓	✓	✓	✓
gpt-4.1-nano	1M	32K	—	—	✓	✓	✓	—	—	—	—	—
o3-2025-04-16	200K	100K	—	—	✓	✓	✓	✓	✓	✓	✓	✓
o3	200K	100K	—	—	✓	✓	✓	—	—	—	—	—
o3-mini-2025-01-31	200K	100K	—	—	—	✓	✓	✓	✓	✓	✓	✓
o3-mini	200K	100K	—	—	—	✓	✓	—	—	—	—	—
o1-2024-12-17	200K	100K	—	—	✓	✓	✓	✓	✓	✓	✓	✓
o1	200K	100K	—	—	✓	✓	✓	—	—	—	—	—
gpt-4o-2024-11-20	128K	16K	—	—	✓	✓	✓	✓	✓	✓	✓	✓
gpt-4o	128K	16K	—	—	✓	✓	✓	—	—	—	—	—
gpt-4o-mini-2024-07-18	128K	16K	—	—	✓	✓	✓	✓	✓	✓	✓	✓
gpt-4o-mini	128K	16K	—	—	✓	✓	✓	—	—	—	—	—
gpt-4o-search-preview	128K	16K	—	✓	—	✓	✓	—	—	—	—	—

Image Input Limitations

Images must be supplied as a File object via a File Slot, which enforces a maximum file size of 5MB — files exceeding this limit will fail to upload. Supported image formats: PNG, JPEG/JPG, WEBP, and non-animated GIF.

File Input Limitations

Only PDF (.pdf) files are supported. Files must be supplied as a File object via a File Slot, which enforces a maximum file size of 5MB — files exceeding this limit will fail to upload.

Both the text and page images are extracted from the PDF by vision-capable models such as gpt-4o and later.

Overview

generate_text_action

Description

Input Parameters

Output

Usage Examples

Example 1: Text Summarization

Example 2: Content Generation

Example 3: Step-by-Step Reasoning

Example 4: Image OCR

Example 5: Document Q&A

generate_structured_value_action

Description

Input Parameters

️ additionalProperties: false must always be set in objects.

Output

Usage Examples

Example 1: Topic Classification (Existing Example, Expanded)

Expected Output

Example 2: Entity Extraction

Expected Output

Example 3: Sentiment Classification

Expected Output

Example 4: Structured Extraction from an Image

Expected Output

Example 5: Structured Extraction from a Document

Expected Output

Model Reference

Image Input Limitations

File Input Limitations

️ `additionalProperties: false` must always be set in objects.