Amazon Titan Cheat Sheet
- Amazon Titan models are a family of powerful, general-purpose models pre-trained by AWS on massive datasets. They are designed to be used out of the box or fine-tuned with your own data, allowing you to adapt them for specific tasks without the need to annotate large volumes of training data.
- The Titan family consists of three core model types: Text Generation, Embeddings (Text & Multimodal), and Image Generation.
Features
- A family of foundation models developed by AWS for generative AI tasks.
- Titan Text models support summarization, writing, Q&A, and code generation.
- Titan Embeddings models convert text into vectors for semantic search, RAG, and personalization.
- Titan Image Generator creates and edits images from text prompts (available in preview/GA).
- Built-in safety filters and responsible AI practices to minimize harmful outputs.
- Fine-tune models on your own data for domain-specific needs.
- Natively accessible in Amazon Bedrock and easily integrated into AWS workflows.
- Usage-based pricing via Amazon Bedrock APIs or Console.
Use Cases
- Content Creation:
- Automated copywriting, blog post generation, and email drafting using Titan Text.
- Semantic Search:
- Building search engines that understand the meaning of a query (not just keywords) using Titan Embeddings.
- Visual Marketing:
- Generating studio-quality images for ad campaigns or e-commerce backgrounds using Titan Image Generator.
- Retrieval Augmented Generation (RAG):
- Combining Titan Text and Embeddings to answer questions based on your private company data.
- Data Cleaning:
- Removing backgrounds from product images or inpainting missing details.
Amazon Titan Text Models
-
Amazon Titan Text G1 – Premier
- LLM for text generation.
- Supports open-ended and context-aware Q&A, code generation, and summarization. It integrates with Amazon Bedrock Knowledge Base and Agents, and offers custom fine-tuning (currently in preview).
- Model ID –
amazon.titan-text-premier-v1:0 - Max tokens – 32K
- Languages – English
- Supported use cases – 32k context window, open-ended text generation, brainstorming, summarizations, code generation, table creation, data formatting, paraphrasing, chain of thought, rewrite, extraction, QnA, chat, Knowledge Base support, Agents support, Model Customization (preview).
- Inference parameters – Temperature, Top P (defaults: Temperature = 0.7, Top P = 0.9)
- Model ID –
-
Amazon Titan Text G1 – Express
- LLM for text generation.
- Designed for advanced language tasks, including open-ended text generation, chat, and Retrieval Augmented Generation (RAG). It is optimized for English, with preview support for over 100 additional languages.
- Model ID –
amazon.titan-text-express-v1 - Max tokens – 8K
- Languages – English (GA), 100 additional languages (Preview)
- Supported use cases – Retrieval augmented generation, open-ended text generation, brainstorming, summarizations, code generation, table creation, data formatting, paraphrasing, chain of thought, rewrite, extraction, QnA, and chat.
- Model ID –
-
Amazon Titan Text G1 – Lite
- A lightweight, efficient model designed for fine-tuning English tasks like summarization and copywriting.
- It offers a smaller, cost-effective option that is highly customizable for specific use cases.
- Model ID –
amazon.titan-text-lite-v1 - Max tokens – 4K
- Languages – English
- Supported use cases – Open-ended text generation, brainstorming, summarizations, code generation, table creation, data formatting, paraphrasing, chain of thought, rewrite, extraction, QnA, and chat.
- Model ID –
-
Amazon Titan Text Model Customization Hyperparameters
- Amazon Titan Text Premier model supports the following hyperparameters for model customization. The number of epochs you specify increases your model customization cost by processing more tokens. Each epoch processes the entire training dataset once.
| Hyperparameter (console) | Hyperparameter (API) | Definition | Type | Min | Max | Default |
|---|---|---|---|---|---|---|
| Epochs | epochCount | No. of iterations through the entire training dataset | integer | 1 | 5 | 2 |
| Batch size (micro) | batchSize | No. of samples processed before updating model parameters | integer | 1 | 1 | 1 |
| Learning rate | learningRate | The rate at which model parameters are updated after each batch | float | 1.00E-07 | 1.00E-05 | 1.00E-06 |
| Learning rate warmup steps | learningRateWarmupSteps | The number of iterations over which the learning rate is gradually increased to the specified rate | integer | 0 | 20 | 5 |
-
- Amazon Titan Text Lite and Amazon Titan Text Express models support the following hyperparameters for model customization. The number of epochs you specify increases your model customization cost by processing more tokens.
| Hyperparameter (console) | Hyperparameter (API) | Definition | Type | Min | Max | Default |
|---|---|---|---|---|---|---|
| Epochs | epochCount | The number of iterations through the entire training dataset | integer | 1 | 10 | 5 |
| Batch size (micro) | batchSize | The number of samples processed before updating model parameters | integer | 1 | 64 | 1 |
| Learning rate | learningRate | The rate at which model parameters are updated after each batch | float | 0.0 | 1 | 1.00E-5 |
| Learning rate warmup steps | learningRateWarmupSteps | The number of iterations over which the learning rate is gradually increased to the specified rate | integer | 0 | 250 | 5 |
-
Amazon Titan Text Embeddings V2
-
A highly efficient model optimized for RAG and semantic search.
-
It allows for flexible output dimensions to balance accuracy with storage costs.
- Model ID –
amazon.titan-embed-text-v2:0 - Max input text tokens – 8,192
- Max input text characters – 50,000
- Languages – English (100+ languages in preview)
- Output vector size – 1,024 (default), 512, 256
- Inference types – On-Demand, Provisioned Throughput
- Supported use cases – RAG, document search, reranking, classification, etc.
- Model ID –
-
V2 Request Parameters
- The request body is passed in the body field of an InvokeModel request.
inputText(Required): The text string to convert into an embedding.normalize(Optional): Boolean flag indicating whether to normalize the output embedding. Defaults to true.dimensions(Optional): Specifies the size of the output vector. Accepted values: 1024, 512, 256.embeddingTypes(Optional): A list specifying the desired return format. Accepts [“float”], [“binary”], or both. Defaults to float.
-
V2 Response Fields
embedding: An array of float values representing the vector of the input text.inputTextTokenCount: The number of tokens detected in the input text.embeddingsByType: A dictionary containing the embedding lists keyed by type (float or binary), depending on the requested embeddingTypes.
-
-
Amazon Titan Text Embeddings G1 – Text
-
The original AWS text embedding model was designed for general-purpose semantic search and clustering.
-
Note that this model does not support additional inference parameters, such as
temperatureortopP.- Model ID –
amazon.titan-embed-text-v1 - Max Input Text – 8,192 tokens.
- Output Vector Size – 1,536 (Fixed).
- Languages – Supports 25+ languages.
- Inference Types – On-Demand, Provisioned Throughput.
- Model ID –
-
G1 Request Parameters
-
inputText(Required): The text string to convert into an embedding.
-
-
G1 Response Fields
embedding: An array of values representing the embedding vector.inputTextTokenCount: The number of tokens in the input.
-
Amazon Titan Multimodal Embeddings G1 Model
-
- This model converts both text and images (or a combination of both) into a shared vector space, enabling cross-modal capabilities like “search by image” or “text-to-image” search.
- Model ID –
amazon.titan-embed-image-v1 - Max input text tokens – 256
- Languages – English
- Max input image size – 25 MB
- Output vector size – 1,024 (default), 384, 256
- Inference types – On-Demand, Provisioned Throughput
- Supported use cases – Search, recommendation, and personalization.
- Model ID –
-
Request & Response
- Request:
- Accepts
inputText(description),inputImage(base64 encoded string), or both. IncludesembeddingConfigto specify output embedding length.
- Accepts
- Response:
- Returns the
embeddingvector,inputTextTokenCount, and an optionalmessage.
- Returns the
- Request:
- This model converts both text and images (or a combination of both) into a shared vector space, enabling cross-modal capabilities like “search by image” or “text-to-image” search.
Amazon Titan Image Generator G1 Models
Generates and edits studio-quality images from natural language prompts. Both versions share a common API structure but differ in capabilities and resolution support.
-
Amazon Titan Image Generator v1 (overview)
- Enables users to generate and edit images from natural language prompts.
- Model ID –
amazon.titan-image-generator-v1 - Max input characters – 512 char
- Max input image size – 5 MB (only some specific resolutions are supported)
- Max image size using in/outpainting – 1,408 x 1,408 px px
- Max image size using image variation – 4,096 x 4,096 px
- Languages – English
- Output type – image
- Supported image types – JPEG, JPG, PNG
- Inference types – On-Demand, Provisioned Throughput
- Supported use cases – image generation, image editing, image variations
- Model ID –
- Enables users to generate and edit images from natural language prompts.
-
Amazon Titan Image Generator v2 (overview)
- Supports all Titan Image Generator v1 features with added new capabilities.
- Allows use of reference images to guide layout and composition while following text prompts.
- Includes automatic background removal for images with multiple objects.
- Model ID –
amazon.titan-image-generator-v2:0 - Max input characters – 512 char
- Max input image size – 5 MB (only some specific resolutions are supported)
- Max image size using in/outpainting, background removal, image conditioning, color palette – 1,408 x 1,408 px
- Max image size using image variation – 4,096 x 4,096 px
- Languages – English
- Output type – image
- Supported image types – JPEG, JPG, PNG
- Inference types – On-Demand, Provisioned Throughput
- Supported use cases – image generation, image editing, image variations, background removal, color guided content
- Model ID –
-
Inference Parameters & Responses
- When making an
InvokeModelcall, thebodyfield determines the task. All tasks utilize a sharedimageGenerationConfigobject (for quality, dimensions, and quantity), but require specific parameter objects for the task type. - Supported Tasks:
- When making an
| Task | Description |
|---|---|
TEXT_IMAGE |
Generates an image from a text prompt. |
INPAINTING |
Replaces masked areas of an existing image with new content. |
OUTPAINTING |
Extends the borders of an image with new content. |
IMAGE_VARIATION |
Generates variations of an input image (style/background). |
COLOR_GUIDED_GENERATION (V2 only) |
Generates images adhering to a specific hex color palette. |
BACKGROUND_REMOVAL (V2 only) |
Removes the background, leaving a transparent layer. |
TEXT_IMAGE_CONDITIONING(V2 only) |
Generates images following the layout of a reference image. |
-
- Response Body:
- Returns an
imagesarray containing the base64-encoded strings of the generated output(s) and anerrorfield if applicable.
- Returns an
- Response Body:
Pricing
- Pricing is based on Amazon Bedrock On-Demand rates.
- Titan Text Embeddings (V2): ~$0.00002 per 1,000 input tokens.
- Titan Text Embeddings (G1): ~$0.0001 per 1,000 input tokens.
- Titan Multimodal Embeddings: ~$0.0008 per 1,000 input tokens | ~$0.00006 per input image.
- Titan Image Generator (v1 & v2):
- Standard (512×512): ~$0.008 per image.
- Standard (1024×1024): ~$0.010 per image.
- Premium (1024×1024): ~$0.012 per image.
- Background Removal: ~$0.012 per image.
Amazon Titan Cheat Sheet References:
https://docs.aws.amazon.com/bedrock/latest/userguide/titan-embedding-models.html
https://docs.aws.amazon.com/bedrock/latest/userguide/model-parameters-titan-embed-text.html
https://docs.aws.amazon.com/bedrock/latest/userguide/model-parameters-titan-image.html
https://docs.aws.amazon.com/bedrock/latest/userguide/titan-image-models.html
https://docs.aws.amazon.com/bedrock/latest/userguide/titan-multiemb-models.html
https://docs.aws.amazon.com/bedrock/latest/userguide/model-parameters-titan-embed-mm.html











