Muapi

wan2.1-lora-t2v

WAN 2.1 LoRA T2V enables users to generate videos from text prompts with custom-trained LoRA modules. Tailor the generation to specific characters, outfits, or animation styles — ideal for brand storytelling, fan content, and stylized animations.

Training

motion-controls

Motion Controls adds dynamic camera movements, speed ramps, and zoom effects to bring your images to life as smooth, engaging videos.

Image to Video

vfx

VFX delivers high-impact visual effects like explosions, particles, and cinematic overlays to transform static images into action-packed videos.

Image to Video

mmaudio-v2-video-to-video

MMAudio-v2 generates high-quality, synchronized audio from video or text inputs. Seamlessly integrate it with AI video models to create fully-voiced, expressive video content.

Video to Video

bytedance-seedream-v3

Seedream is designed for generating visually rich and artistic images from text prompts. It excels at fantasy, anime, surrealism, and vibrant color compositions — ideal for creative visuals, storyboards, and concept art.

Text to Image

wan2.2-text-to-video

Wan 2.2’s T2V mode transforms descriptive text prompts into high-quality, stylized video sequences. It excels at generating anime-style or cinematic visuals with smooth motion and strong thematic consistency.

Text to Video

hidream-i1-fast

Optimized for speed, this variant generates images in just a few steps. Ideal for previews, real-time applications, and use cases where fast results are more important than fine detail.

Text to Image

flux-pulid

Flux PuLID is an innovative image-to-image model that enables consistent face rendering across different styles or scenes—without needing any model fine-tuning. By providing a reference image (e.g., a portrait), the model generates new visuals while maintaining your subject’s identity with high fidelity.

Image to Image

seedance-lite-i2v

Seedance Lite I2V version animates static images into short videos quickly, focusing on basic motion effects and efficient processing—best suited for fast demos or mobile-friendly use.

Image to Video

pixverse-v5-i2v

PixVerse V5 delivers a major leap forward in AI-powered video creation — now featuring smoother motion, ultra-high resolution, and expanded visual effects.

Image to Video

runway-aleph-v2v

Transform any input video into a new visual style or scene while preserving motion and structure. Aleph V2V lets you apply artistic looks, cinematic lighting, or thematic changes to existing footage.

Video to Video

pixverse-v4.5-t2v

PixVerse v4.5 transforms descriptive text into vivid, high-resolution video clips. It understands complex scenes, human motion, and cinematic camera angles — great for creative storytelling, trailers, and animated concepts.

Text to Video

minimax-image-01-subject-reference

Minimax’s I2I “Subject Reference” model enables you to transform images while preserving the appearance of a subject using a single reference image. Ideal for maintaining character likeness—features, clothing, or expression—across different styles or settings.

Image to Image

veed-lipsync

Generate realistic lipsync from any audio using VEED's latest model

Video to Video

google-imagen4

Google Imagen 4 is the latest text-to-image AI model from DeepMind, designed to produce stunningly photorealistic images with crisp detail, accurate text rendering, and creative flexibility. It supports high-resolution output (up to 2K), generates visuals in seconds, and embeds SynthID watermarks for authenticity.

Text to Image

pixverse-v5-t2v

PixVerse V5 delivers a major leap forward in AI-powered video creation — now featuring smoother motion, ultra-high resolution, and expanded visual effects.

Text to Video

gpt4o-edit

Edit a specific part of an image using natural language. Ideal for object removal, replacement, or content-aware filling.

Image to Image

suno-remix-music

This API covers an audio track by transforming it into a new style while retaining its core melody. It incorporates Suno's upload capability, enabling users to upload an audio file for processing. The expected result is a refreshed audio track with a new style, keeping the original melody intact.

Text to Audio

ai-image-face-swap

Advanced facial recognition and blending algorithms enable precise face swaps while preserving skin tone, lighting, and facial geometry.

Image to Image

wan2.1-lora-i2v

Bring still images to life using WAN 2.1 LoRA I2V, which supports custom LoRA fine-tunes for identity consistency. Animate expressions, subtle movements, or full-body actions while preserving personalized features from the image and LoRA.

Training

ai-dress-change

Instantly change outfits in images using AI. Visualize different clothing styles without the need for physical trials—perfect for fashion, e-commerce, and virtual try-ons.

Image to Image

ai-background-remover

Instantly remove image backgrounds with pixel-perfect precision. Ideal for product photos, profile pictures, and creative projects.

Image to Image

veo3-text-to-video

VEO3 T2V generates cinematic videos from text prompts, capturing dynamic motion, rich scenes, and storytelling visuals in stunning detail.

Text to Video

vidu-q1-reference

Vidu Q1 enables you to generate cinematic 1080p videos using multiple visual references—up to seven images—and text prompts. Designed for consistency, it preserves character appearance, props, and backgrounds across scenes while adding new motion and narrative elements.

Image to Video

wan2.1-text-to-video

WAN 2.1 turns your written prompts into vivid, cinematic video clips. Ideal for storytelling, content creation, and visualizing abstract ideas, it supports detailed natural scenes, character motion, and dramatic camera movements — all from just text.

Text to Video

google-imagen4-fast

Imagen 4 Fast is optimized for speed and accessibility, allowing you to generate high-quality images in seconds. While slightly less detailed than the Ultra version, it excels at rapid ideation, drafts, storyboarding, and casual creativity.

Text to Image

wan2.1-image-to-video

Animate static images into expressive video sequences with WAN 2.1. Upload any image and guide its transformation into a moving scene — great for bringing art, characters, or photos to life with smooth motion and consistent style.

Image to Video

ai-product-shot

Instantly generate studio-quality product images with AI. Upload your item photo and get clean, stylized shots perfect for e-commerce, ads, and catalogs.

Image to Image

google-imagen4-ultra

Imagen 4 Ultra is Google’s flagship model, designed for photorealism, rich textures, and production-level imagery. It produces crisp, high-resolution visuals with advanced detail, lighting precision, and natural compositions.

Text to Image

latent-sync

LatentSync is a video-to-video model that generates lip sync animations from audio using advanced algorithms for high-quality synchronization.

Video to Video

seedance-lite-reference-video

Seedance Lite's Reference-to-Video feature allows you to supply up to 4 images as reference inputs. The model intelligently blends aspects from these images to generate a cohesive, high-quality video.

Image to Video

wan2.1-reference-video

WAN 2.1 is an advanced AI model that transforms one or more reference images into a coherent, animated video. By combining characters, objects, or environments from multiple images, it creates smooth motion sequences while preserving realism, style, and fine details.

Image to Video

flux-kontext-pro-t2i

Flux Kontext Pro T2I offers fast and reliable generation with creative flexibility. It supports stylized prompts, character design, and fantasy themes while maintaining clear subject coherence.

Text to Image

mmaudio-v2-text-to-audio

Convert text into natural-sounding speech using mmAudio-v2. Ideal for voiceovers, virtual assistants, and content narration with lifelike clarity and tone.

Text to Audio

ai-image-extension

Expand the edges of any image with AI. This model continues your original photo or artwork beyond its borders while matching style, lighting, and content.

Image to Image

midjourney-v7-image-to-video

Midjourney V7’s I2V breathes motion into still images, animating characters, environments, and objects with artistic transitions. Ideal for looping visual stories, concept animations, or enhancing still visuals with subtle motion.

Image to Video

gpt4o-image-to-image

Transform an input image based on a new prompt — like changing style, lighting, or composition. Useful for reinterpreting visuals while keeping structure.

Image to Image

midjourney-v7-text-to-image

Midjourney V7 produces high-quality, stylized images from text prompts. Known for its artistic flair, surreal composition, and vivid textures, it's perfect for character concepts, fantasy environments, and creative illustrations.

Text to Image

midjourney-v7-style-reference

Generate images in the distinctive aesthetic of Midjourney v7 — blending cinematic depth, photorealism or painterly rendering, rich textures, and dynamic lighting. This style reference model helps you infuse any subject with the visual storytelling, composition, and high detail fidelity that Midjourney is known for. Ideal for concept art, stylized portraits, and stunning environment scenes.

Image to Image

nano-banana

Nano Banana is an advanced AI model excelling in natural language-driven image generation and editing. It produces hyper-realistic, physics-aware visuals with seamless style transformations.

Text to Image

ideogram-character

Ideogram’s Character Reference model enables consistent character generation using just one reference image. Upload a clear character portrait—and you can place that character in unlimited scenes, styles, poses, or narratives with visual fidelity maintained across all outputs.

Image to Image

image-effects

AI Image Effects applies advanced visual transformations, color grading, and cinematic filters to create stunning images from a image.

Image to Image

ai-video-effects

AI Video Effects applies advanced visual transformations, color grading, and cinematic filters to create stunning videos from images.

Image to Video

flux-dev-lora

Enables text-to-image generation using custom LoRA models. Generate consistent characters, styles, or branded visuals with high quality and fast results.

Training

ai-ghibli-style

Bring your imagination to life with art inspired by the enchanting world of Studio Ghibli. This AI model generates dreamy, hand-drawn visuals with soft colors, whimsical characters, and painterly backgrounds

Image to Image

ai-image-upscaler

Transform blurry or pixelated images into high-definition visuals. Our AI Image Upscaler uses deep learning to reconstruct details and bring your visuals to life.

Image to Image

hunyuan-fast-text-to-video

Hunyuan Fast T2V provides accelerated video generation from text prompts with slightly reduced detail but excellent speed. Ideal for rapid prototyping, concept testing, and short-form ideas where time is critical.

Text to Video

veo3-fast-text-to-video

VEO3 Fast T2V creates short videos from text instantly, balancing speed and quality for quick content generation and prototyping.

Text to Video

infinitetalk-image-to-video

InfiniteTalk Image-to-Video brings still portraits and character photos to life by generating natural, realistic talking videos. You provide a single face image and a dialogue script, and the model animates lip movement, facial expressions, and subtle head gestures to match the speech.

Image to Video

infinitetalk-video-to-video

InfiniteTalk Video-to-Video enhances or transforms existing videos by syncing the subject’s lip movements and facial expressions with new dialogue or speech. Instead of starting from a still image, you provide a video clip, and the model seamlessly reanimates the speaker’s mouth and expressions to match the script.

Video to Video

ai-dance-effects

Bring your characters and worlds to life with AI Dance Effects — a creative video effect that adds playful, dynamic, and cinematic motion to your generations. AI Dance Effects lets you guide how characters move, react, and express themselves.

Video to Video

video-effects

AI Video Effects applies advanced visual transformations, color grading, and cinematic filters to create stunning videos from images.

Image to Video

wan2.2-speech-to-video

WAN2.2 Speech-to-Video transforms a static image into a talking video by synchronizing lip movements and facial expressions with an audio input. Simply provide a character image along with a speech dialogue, and the model generates a natural, expressive video where the subject speaks your lines.

Image to Video

midjourney-v7-image-to-image

Use Midjourney V7’s I2I to refine or reinterpret existing images. Modify style, mood, lighting, or content while preserving the overall composition — great for alternate versions, art variations, or polishing concepts.

Image to Image

hidream-i1-full

The most advanced version of HiDream I1, delivering high-resolution, detailed images with superior prompt understanding. Best suited for production, content creation, and high-fidelity applications.

Text to Image

ai-anime-generator

Create stunning anime-style artwork instantly with our AI Anime Generator. Customize characters, scenes, and styles effortlessly in seconds!

Text to Image

kling-v2.1-standard-i2v

Kling 2.1 Standard (developed by Kuaishou) brings static images to life by generating smooth, realistic video clips from a single frame. It captures subtle motion, background dynamics, and camera movement to produce professional-looking animations — ideal for portraits, digital art, and cinematic illustrations.

Image to Video

sdxl-lora

The SDXL LoRA image model enhances Stable Diffusion XL with specialized fine-tuning, letting you generate images in unique styles, characters, or themes. By applying LoRA weights, you can create visuals that match a specific aesthetic, celebrity look, anime style, or custom-trained subject.

Training

runway-image-to-video

Animate any image by turning it into a video with motion effects or scene continuity. RunwayML’s I2V model transforms static visuals into short clips by extrapolating depth, movement, and temporal dynamics.

Image to Video

sync-lipsync

Generate realistic lipsync animations from audio using advanced algorithms for high-quality synchronization.

Video to Video

ideogram-v3-t2i

Ideogram v3 is an advanced text-to-image model designed for creating highly detailed and visually striking images directly from text prompts. It’s especially good for artistic compositions, design mockups, concept art, and photorealistic scenes. With strong support for text rendering inside images, it’s widely used for posters, typography-based art, and creative branding.

Text to Image

ai-video-face-swap

Replace faces in videos with stunning realism. Our AI ensures accurate expression transfer, lighting consistency, and smooth frame-by-frame blending.

Video to Video

kling-v2.1-master-t2v

Kling 2.1 Master’s T2V mode allows users to generate vivid, high-quality videos from detailed text prompts. It supports dynamic scenes, natural motion, and cinematic quality — perfect for storytelling, ads, or content creation from imagination alone.

Text to Video

seedance-pro-i2v

Seedance Pro I2V advanced model animates still images into stunning short videos, preserving intricate visual details and applying smooth motion dynamics, ideal for high-end visuals and cinematic edits.

Image to Video

seedance-pro-t2v

Seedance Pro delivers high-fidelity video generation from text, producing rich visuals, smooth camera movement, and realistic scenes. Best for storytelling, content creation, and visual production.

Text to Video

nano-banana-edit

Nano Banana is a mysterious, high-performance image model. It excels at precise, language-driven edits and consistent character preservation, allowing users to modify images with natural text commands.

Image to Image

creatify-lipsync

Realistic lipsync video - optimized for speed, quality, and consistency.

Video to Video

flux-kontext-dev-i2i

Takes an input images and transforms it based on a new prompt. Keeps structure or pose while changing style, appearance, or details.

Image to Image

flux-dev

Generate stunning visuals from simple text prompts. Flux Dev transforms your ideas into high-quality, creative images using powerful AI vision models. Perfect for design, storytelling, concept art, and marketing.

Text to Image

ideogram-v3-reframe

Ideogram V3 Reframe is a specialized image-to-image model built on Ideogram 3.0, designed to intelligently extend and adapt images across diverse aspect ratios and resolutions. Leveraging advanced AI outpainting, it preserves visual consistency while enabling creative reframing for digital, print, and video content.

Image to Image

ai-color-photo

Automatically add lifelike colors to black-and-white images. Our AI brings history to life with natural tones, accurate shading, and context-aware colorization.

Image to Image

sdxl-image

SDXL is a high-quality, large Stable Diffusion model for creating photorealistic and stylized images from text. It excels at fine detail, realistic lighting, and complex scenes.

Text to Image

flux-kontext-dev-t2i

Generates an image from a text prompt, with optional reference image for pose or style guidance. Ideal for controlled, consistent image creation using just a description.

Text to Image

veo3-fast-image-to-video

Quickly transform static images into short, motion-rich video clips with fast rendering and impressive quality — powered by Google's VEO3 on MuAPI.

Image to Video

ai-skin-enhancer

Smooth skin, reduce blemishes, and enhance complexion with natural-looking results. Perfect for portraits, selfies, and professional photo retouching.

Image to Image

ai-object-eraser

Easily remove unwanted objects, people, or text from any image using AI. Just select the area you want to erase, and the model will intelligently fill the space with realistic background matching the surrounding environment. No Photoshop skills needed.

Image to Image

veo3-image-to-video

VEO3 I2V animates static images into expressive video sequences, adding lifelike movement while preserving the original composition.

Image to Video

suno-extend-music

This API extends audio tracks while preserving the original style of the audio track. It includes Suno's upload functionality, allowing users to upload audio files for processing. The expected result is a longer track that seamlessly continues the input style.

Text to Audio

hidream-i1-dev

Optimized for speed, this variant generates images in just a few steps. Ideal for previews, real-time applications, and use cases where fast results are more important than fine detail.

Text to Image

suno-create-music

Suno generate music that turns text prompts into full songs — complete with vocals, lyrics, and instrumentation. You can describe a mood, genre, or even a specific lyric idea, and Suno creates a realistic, studio-quality track in seconds.

Text to Audio

ai-product-photography

Create professional-grade product photos using AI. Upload your item image and describe it with a prompt, and get studio-style, lifestyle, or creative backgrounds in seconds

Image to Image

runway-text-to-video

Generate short, high-quality videos from plain text prompts. RunwayML’s text-to-video model interprets your written description and animates it into a moving visual scene with realistic or stylized motion.

Text to Video

flux-kontext-pro-i2i

Flux Kontext Pro I2I variant enables transforming base images into refined artwork while keeping structure intact. It’s useful for sketch refinement, visual style changes, and creative edits such as re-dressing, relighting, or re-theming with prompt guidance.

Image to Image

gpt4o-text-to-image

Generate images from text prompts using GPT-4o's vision capabilities. Ideal for basic concept visuals, diagrams, and abstract compositions.

Text to Image

wan2.1-text-to-image

WAN 2.1 is a powerful AI model that transforms text prompts into high-resolution, photorealistic images. It excels at detailed object rendering, realistic lighting, and fine textures, making it ideal for visual content, concept art, advertising, and digital storytelling.

Text to Image

flux-kontext-max-i2i

Flux Kontext Max I2I in Max mode allows precise image enhancement and visual transformations while retaining the source layout. It’s powerful for retouching, photo-to-art workflows, concept refinement.

Image to Image

flux-schnell

Flux Schnell is a lightning-fast image generation model designed for rapid iterations. It delivers good visual quality from text prompts almost instantly, making it perfect for real-time concept testing, brainstorming, and UI-integrated experiences.

Text to Image

flux-kontext-max-t2i

Flux Kontext Max T2I delivers photorealistic or cinematic-quality images with exceptional detail. It's optimized for high-end visuals — from realistic humans to polished product renders.

Text to Image

hunyuan-text-to-video

Hunyuan T2V generates detailed and dynamic videos from text prompts with a focus on realism and coherent motion. It handles multi-object scenes, human actions, and cinematic compositions effectively, making it ideal for storytelling and visual concepts.

Text to Video

hunyuan-image-to-video

Hunyuan I2V takes a static image and generates realistic video animations by interpreting motion and context. It works well for human portraits, objects, or scenes, adding lifelike movement while maintaining the image's integrity.

Image to Video

bytedance-seededit-v3

Seededit allows precise edits to images using masks and prompt guidance. Whether you're replacing backgrounds, changing clothing, or inpainting missing areas, Seededit ensures realistic, high-quality results with semantic control.

Image to Image

seedance-lite-t2v

Seedance Lite T2V offers quick video generation from text with decent visual quality and motion. Ideal for fast previews, prototyping, or lightweight use cases where speed matters more than fine detail.

Text to Video

wan2.2-image-to-video

Wan 2.2’s I2V mode brings static visuals to life with vivid, expressive animations. It interprets motion, emotion, and background dynamics from a single image to generate smooth and cinematic short videos.

Image to Video

bytedance-seedream-v4

Seedream v4 generates stunning, high-fidelity images from text prompts. It’s designed for creativity with strong support for realism, fantasy, and artistic styles.

Text to Image

kling-v2.1-master-i2v

Kling 2.1 Master’s I2V animates a still image into a coherent video sequence. It interprets motion, environment, and context to create realistic, visually stunning video outputs — ideal for animating portraits, scenes, or concept art.

Image to Video

kling-v2.1-pro-i2v

Kling 2.1 Pro is the high-end version of Kuaishou’s video generation model, offering enhanced realism, longer motion sequences, and cinematic quality. In I2V mode, it animates static images with fluid environmental effects.

Image to Video

pixverse-v4.5-i2v

Upload an image and PixVerse v4.5 will breathe life into it with smooth camera motion, realistic effects, and animated elements. Whether it’s a portrait, landscape, or concept art, this mode turns still visuals into dynamic short videos.

Image to Video

bytedance-seedream-edit-v4

Seedream v4 Edit refines or transforms existing images based on a new prompt and a reference. Instead of masking, you provide a source image and describe how it should be altered — adjusting style, details, or replacing elements while keeping the subject consistent.

Image to Image

runway-act-two-i2v

Upload a single character image and a driving video — the model transfers facial expressions and head movements from the video onto your image, bringing it to life. It works with photos, illustrations, or stylized portraits, making them speak, blink, and move naturally. Ideal for avatars, AI presenters, digital actors, and story scenes.

Image to Video

runway-act-two-v2v

Take an existing character video and sync it with the motion from a reference video. This lets you update facial expressions, head turns, and speech gestures while keeping the original look and style. It’s perfect for reshooting performances, dubbing, or animating characters without re-rendering visuals.

Video to Video

qwen-image

Generate high-quality, detailed images from text prompts in various styles — from realistic to artistic — perfect for creative visuals, product shots, and concept art.

Text to Image

hunyuan-image-2.1

Hunyuan Image is a powerful text-to-image generation model that produces photorealistic and highly detailed visuals. It excels at creating portraits, environments, and concept art with strong consistency and realism. Designed for versatility, it supports both natural photography styles and imaginative artistic outputs.

Text to Image

vidu-v2.0-i2v

Vidu's 2.0 model delivers advanced image-based video generation with enhanced lighting, emotion dynamics, and automatic frame interpolation for polished visual content.

Image to Video

vidu-v2.0-t2v

Vidu's 2.0 model offers enhanced visual quality and comprehensive workflow support across multiple resolution options for versatile content creation.

Text to Video

midjourney-v7-omni-reference

Midjourney's Omni Reference lets you reuse characters, creatures, or styles from an existing image and place them into entirely new scenes. Simply provide a reference image (oref) and Midjourney will maintain identity, details, and visual consistency — ideal for storytelling, character design, or branding across multiple generations.

Image to Image

chroma-image

Croma Image is an advanced text-to-image generation model designed for high-quality, creative, and versatile visuals. It can produce anything from photorealistic portraits and products to imaginative concept art, fantasy illustrations, and cinematic scenes.

Text to Image

qwen-image-edit

The Qwen Edit Image Model allows you to modify existing images using text-based editing prompts. Instead of generating from scratch, you can upload a base image and describe the desired changes (e.g., replacing objects, altering colors, adding new elements).

Image to Image

luma-modify-video

Luma Modify Video lets you transform an existing video into a new creative scene while keeping the original motion and timing intact. The result is a new video with the same movements but a completely fresh look, atmosphere, or theme.

Video to Video

luma-flash-reframe

Transform and resize your videos effortlessly with Ray 2 Flash Reframe. This tool intelligently expands or adjusts your video’s aspect ratio—adding visually consistent content to the sides, top, or bottom—without altering the original subject.

Video to Video

minimax-hailuo-02-standard-t2v

Fast and lightweight text-to-video generation. Ideal for quick drafts, previews, or playful content where speed matters more than cinematic quality.

Text to Video

wan2.2-5b-fast-t2v

Wan 2.2 Fast is a lightweight, high-speed version of the Wan 2.2 model, optimized for quick text-to-video generation. It trades some cinematic detail for rapid results, making it perfect for prototyping, previews, social media clips, and quick storytelling.

Text to Video

minimax-hailuo-02-standard-i2v

Transforms an image into video with light, natural motion. Great for social media, quick animations, and previews.

Image to Video

minimax-hailuo-02-pro-i2v

Advanced image-to-video with cinematic realism. Adds dynamic camera motion, realistic physics, and atmospheric detail for storytelling.

Image to Video

minimax-hailuo-02-pro-t2v

High-fidelity text-to-video with cinematic rendering. Best for storytelling, cinematic clips, or realistic visuals with depth, atmosphere, and detail.

Text to Video

Explore Docs Discord

Dashboard Explore API Keys Billing

Explore Docs Discord

Featured AI Models

Training

wan2.1-lora-t2v

$0.3

Image to Video

motion-controls

$0.3

Motion Controls adds dynamic camera movements, speed ramps, and zoom effects to bring your images to life as smooth, engaging videos.

Image to Video

vfx

$0.3

VFX delivers high-impact visual effects like explosions, particles, and cinematic overlays to transform static images into action-packed videos.

Video to Video

mmaudio-v2-video-to-video

$0.01

MMAudio-v2 generates high-quality, synchronized audio from video or text inputs. Seamlessly integrate it with AI video models to create fully-voiced, expressive video content.

Text to Image

bytedance-seedream-v3

$0.03

Text to Video

wan2.2-text-to-video

$0.3

Text to Image

hidream-i1-fast

$0.01

Optimized for speed, this variant generates images in just a few steps. Ideal for previews, real-time applications, and use cases where fast results are more important than fine detail.

Image to Image

flux-pulid

$0.04

All Models 114 models

veo3-fast-text-to-video

VEO3 Fast T2V creates short videos from text instantly, balancing speed and quality for quick content generation and prototyping.

Text to Image$0.03

bytedance-seedream-v3

Image to Video$0.1

seedance-lite-i2v

Seedance Lite I2V version animates static images into short videos quickly, focusing on basic motion effects and efficient processing—best suited for fast demos or mobile-friendly use.

Image to Video$0.6

veo3-fast-image-to-video

Quickly transform static images into short, motion-rich video clips with fast rendering and impressive quality — powered by Google's VEO3 on MuAPI.

Image to Image$0.03

nano-banana-edit

Image to Video$0.3

wan2.2-image-to-video

Image to Video$0.18

seedance-pro-i2v

Image to Image$0.02

ai-image-face-swap

Advanced facial recognition and blending algorithms enable precise face swaps while preserving skin tone, lighting, and facial geometry.

Image to Image$0.03

bytedance-seededit-v3

Training$0.01

flux-dev-lora

Enables text-to-image generation using custom LoRA models. Generate consistent characters, styles, or branded visuals with high quality and fast results.

Text to Image$0.01

flux-dev

Text to Image$0.03

midjourney-v7-text-to-image

Image to Image$0.04

gpt4o-image-to-image

Transform an input image based on a new prompt — like changing style, lighting, or composition. Useful for reinterpreting visuals while keeping structure.

Text to Video$0.1

seedance-lite-t2v

Image to Video$0.3

motion-controls

Motion Controls adds dynamic camera movements, speed ramps, and zoom effects to bring your images to life as smooth, engaging videos.

Image to Image$0.15

ideogram-character

Image to Video$0.23

kling-v2.1-standard-i2v

Text to Image$0.03

qwen-image

Generate high-quality, detailed images from text prompts in various styles — from realistic to artistic — perfect for creative visuals, product shots, and concept art.

Image to Video$0.15

midjourney-v7-image-to-video

Image to Image$0.1

ai-dress-change

Instantly change outfits in images using AI. Visualize different clothing styles without the need for physical trials—perfect for fashion, e-commerce, and virtual try-ons.

Text to Video$0.3

wan2.2-text-to-video

Image to Video$0.15

runway-image-to-video

Text to Audio$0.09

suno-create-music

Image to Image$0.02

ai-image-upscaler

Transform blurry or pixelated images into high-definition visuals. Our AI Image Upscaler uses deep learning to reconstruct details and bring your visuals to life.

Text to Video$3

veo3-text-to-video

VEO3 T2V generates cinematic videos from text prompts, capturing dynamic motion, rich scenes, and storytelling visuals in stunning detail.

Image to Image$0.04

flux-pulid

Text to Image$0.03

nano-banana

Nano Banana is an advanced AI model excelling in natural language-driven image generation and editing. It produces hyper-realistic, physics-aware visuals with seamless style transformations.

Text to Video$0.18

seedance-pro-t2v

Seedance Pro delivers high-fidelity video generation from text, producing rich visuals, smooth camera movement, and realistic scenes. Best for storytelling, content creation, and visual production.

Text to Image$0.03

wan2.1-text-to-image

Image to Image$0.03

flux-kontext-pro-i2i

Video to Video$0.1

ai-video-face-swap

Replace faces in videos with stunning realism. Our AI ensures accurate expression transfer, lighting consistency, and smooth frame-by-frame blending.

Text to Audio$0.09

suno-remix-music

Image to Video$0.3

ai-video-effects

AI Video Effects applies advanced visual transformations, color grading, and cinematic filters to create stunning videos from images.

Image to Image$0.02

flux-kontext-dev-i2i

Takes an input images and transforms it based on a new prompt. Keeps structure or pose while changing style, appearance, or details.

Video to Video$0.04

sync-lipsync

Generate realistic lipsync animations from audio using advanced algorithms for high-quality synchronization.

Image to Video$0.3

wan2.1-image-to-video

Video to Video$0.01

mmaudio-v2-video-to-video

MMAudio-v2 generates high-quality, synchronized audio from video or text inputs. Seamlessly integrate it with AI video models to create fully-voiced, expressive video content.

Image to Video$0.4

kling-v2.1-pro-i2v

Image to Video$3

veo3-image-to-video

VEO3 I2V animates static images into expressive video sequences, adding lifelike movement while preserving the original composition.

Image to Image$0.05

ai-product-photography

Create professional-grade product photos using AI. Upload your item image and describe it with a prompt, and get studio-style, lifestyle, or creative backgrounds in seconds

Image to Video$0.15

hunyuan-image-to-video

Image to Video$0.3

vfx

VFX delivers high-impact visual effects like explosions, particles, and cinematic overlays to transform static images into action-packed videos.

Image to Video$0.15

minimax-hailuo-02-standard-i2v

Transforms an image into video with light, natural motion. Great for social media, quick animations, and previews.

Image to Image$0.01

minimax-image-01-subject-reference

Text to Image$0.04

gpt4o-text-to-image

Generate images from text prompts using GPT-4o's vision capabilities. Ideal for basic concept visuals, diagrams, and abstract compositions.

Image to Video$0.3

video-effects

AI Video Effects applies advanced visual transformations, color grading, and cinematic filters to create stunning videos from images.

Image to Video$0.3

pixverse-v5-i2v

PixVerse V5 delivers a major leap forward in AI-powered video creation — now featuring smoother motion, ultra-high resolution, and expanded visual effects.

Text to Image$0.06

flux-kontext-max-t2i

Flux Kontext Max T2I delivers photorealistic or cinematic-quality images with exceptional detail. It's optimized for high-end visuals — from realistic humans to polished product renders.

Image to Image$0.03

ai-image-extension

Expand the edges of any image with AI. This model continues your original photo or artwork beyond its borders while matching style, lighting, and content.

Text to Image$0.02

hidream-i1-dev

Optimized for speed, this variant generates images in just a few steps. Ideal for previews, real-time applications, and use cases where fast results are more important than fine detail.

Text to Image$0

flux-schnell

Text to Video$0.09

runway-text-to-video

Video to Video$0.2

runway-aleph-v2v

Transform any input video into a new visual style or scene while preserving motion and structure. Aleph V2V lets you apply artistic looks, cinematic lighting, or thematic changes to existing footage.

Text to Image$0.02

flux-kontext-dev-t2i

Generates an image from a text prompt, with optional reference image for pose or style guidance. Ideal for controlled, consistent image creation using just a description.

Image to Image$0.01

ai-skin-enhancer

Smooth skin, reduce blemishes, and enhance complexion with natural-looking results. Perfect for portraits, selfies, and professional photo retouching.

Training$0.3

wan2.1-lora-i2v

Text to Audio$0.01

mmaudio-v2-text-to-audio

Convert text into natural-sounding speech using mmAudio-v2. Ideal for voiceovers, virtual assistants, and content narration with lifelike clarity and tone.

Image to Image$0.06

flux-kontext-max-i2i

Image to Image$0.03

image-effects

AI Image Effects applies advanced visual transformations, color grading, and cinematic filters to create stunning images from a image.

Text to Image$0.03

google-imagen4

Image to Image$0.06

ai-product-shot

Instantly generate studio-quality product images with AI. Upload your item photo and get clean, stylized shots perfect for e-commerce, ads, and catalogs.

Image to Video$0.3

pixverse-v4.5-i2v

Image to Image$0.01

ai-background-remover

Instantly remove image backgrounds with pixel-perfect precision. Ideal for product photos, profile pictures, and creative projects.

Text to Image$0.03

ai-anime-generator

Create stunning anime-style artwork instantly with our AI Anime Generator. Customize characters, scenes, and styles effortlessly in seconds!

Image to Video$0.3

kling-v2.1-master-i2v

Image to Image$0.05

ai-ghibli-style

Text to Image$0.02

ideogram-v3-t2i

Text to Image$0.01

hidream-i1-fast

Optimized for speed, this variant generates images in just a few steps. Ideal for previews, real-time applications, and use cases where fast results are more important than fine detail.

Image to Video$0.6

minimax-hailuo-02-pro-i2v

Advanced image-to-video with cinematic realism. Adds dynamic camera motion, realistic physics, and atmospheric detail for storytelling.

Video to Video$0.2

infinitetalk-video-to-video

Image to Image$0.03

midjourney-v7-image-to-image

Text to Video$0.02

wan2.2-5b-fast-t2v

Image to Video$0.2

wan2.2-speech-to-video

Training$0.3

wan2.1-lora-t2v

Video to Video$0.04

latent-sync

LatentSync is a video-to-video model that generates lip sync animations from audio using advanced algorithms for high-quality synchronization.

Text to Video$0.05

hunyuan-fast-text-to-video

Video to Video$0.3

ai-dance-effects

Text to Video$0.6

minimax-hailuo-02-pro-t2v

High-fidelity text-to-video with cinematic rendering. Best for storytelling, cinematic clips, or realistic visuals with depth, atmosphere, and detail.

Text to Video$0.15

hunyuan-text-to-video

Image to Video$0.3

vidu-v2.0-i2v

Vidu's 2.0 model delivers advanced image-based video generation with enhanced lighting, emotion dynamics, and automatic frame interpolation for polished visual content.

Video to Video$0.04

veed-lipsync

Generate realistic lipsync from any audio using VEED's latest model

Image to Video$0.1

seedance-lite-reference-video

Image to Image$0.01

ai-color-photo

Automatically add lifelike colors to black-and-white images. Our AI brings history to life with natural tones, accurate shading, and context-aware colorization.

Text to Image$0.03

flux-kontext-pro-t2i

Flux Kontext Pro T2I offers fast and reliable generation with creative flexibility. It supports stylized prompts, character design, and fantasy themes while maintaining clear subject coherence.

Text to Image$0.04

hidream-i1-full

The most advanced version of HiDream I1, delivering high-resolution, detailed images with superior prompt understanding. Best suited for production, content creation, and high-fidelity applications.

Text to Video$0.3

pixverse-v4.5-t2v

Text to Image$0.04

bytedance-seedream-v4

Seedream v4 generates stunning, high-fidelity images from text prompts. It’s designed for creativity with strong support for realism, fantasy, and artistic styles.

Image to Video$0.07

runway-act-two-i2v

Text to Video$1.2

kling-v2.1-master-t2v

Text to Video$0.3

pixverse-v5-t2v

PixVerse V5 delivers a major leap forward in AI-powered video creation — now featuring smoother motion, ultra-high resolution, and expanded visual effects.

Text to Video$0.3

wan2.1-text-to-video

Text to Image$0.06

google-imagen4-ultra

Image to Image$0.03

midjourney-v7-omni-reference

Image to Video$0.4

vidu-q1-reference

Text to Audio$0.09

suno-extend-music

Video to Video$0.35

luma-modify-video

Text to Image$0.02

google-imagen4-fast

Video to Video$0.35

luma-flash-reframe

Image to Video$0.1

wan2.1-reference-video

Image to Image$0.04

bytedance-seedream-edit-v4

Text to Video$0.3

minimax-hailuo-02-standard-t2v

Fast and lightweight text-to-video generation. Ideal for quick drafts, previews, or playful content where speed matters more than cinematic quality.

Image to Image$0.03

midjourney-v7-style-reference

Video to Video$0.04

creatify-lipsync

Realistic lipsync video - optimized for speed, quality, and consistency.

Image to Image$0.05

ai-object-eraser

Training$0

sdxl-lora

Image to Image$0.15

ideogram-v3-reframe

Text to Image$0

sdxl-image

SDXL is a high-quality, large Stable Diffusion model for creating photorealistic and stylized images from text. It excels at fine detail, realistic lighting, and complex scenes.

Image to Image$0.04