🎯 Complete API Feature Specification

Goal: Implement FULL power of every API (not what was done before)

RUNWAY - Complete Features

Image Generation (NEW - 9th Provider)

Endpoint: POST /v1/text_to_image Model: gen4_image Parameters:

promptText (required)
ratio (aspect ratio: 1360:768, 1920:1080, etc.)
seed (0-4294967295)
referenceImages (array, up to 3):
- uri (image URL or data URI)
- tag (string identifier)
contentModeration (settings object)

Video Generation

Already implemented but verify:

Text-to-video
Image-to-video
Camera control
All Gen-4 parameters

Audio Generation (NEW)

Endpoints:

POST /v1/sound_effect
POST /v1/text_to_speech
POST /v1/speech_to_speech
POST /v1/voice_dubbing
POST /v1/voice_isolation

TOPAZ LABS - Complete Features

Image Enhancement Models

Available:

Standard V2 (general purpose)
Low Resolution V2 (web graphics)
CGI (digital illustrations)
High Fidelity V2 (professional photo)
Text Refine (text and shapes)
Standard MAX
Recovery V2
Wonder
Redefine

All Parameters

Basic:

image (file upload)
source_url (alternative to file)
model (enum from above)
output_height (1-32000)
output_width (1-32000)
crop_to_fill (boolean)
output_format (jpeg/png/tiff)

Advanced (Model-specific):

face_enhancement (boolean)
face_enhancement_creativity (0-1)
face_enhancement_strength (0-1)
detail (0-1, for Super Focus)
focus_boost (0.25-1, for Super Focus)
strength (0.01-1, for upscaling)
subject_detection (string)
webhook_url (for async notifications)

Video Enhancement

Already researched - verify implementation matches:

Complete upload workflow (create, accept, upload, complete, poll)
All filter models
Frame interpolation
All enhancement options

Current Implementation Gap Analysis

What's Missing:

❌ Runway Gen-4 Image provider (completely absent)
❌ Runway Audio features (5 endpoints)
❌ Topaz face enhancement controls (3 parameters)
❌ Topaz model-specific parameters (detail, focus_boost, strength)
❌ Full Topaz model list (only using 5/9 models)

Estimated Impact:

Adding Runway Image: +1 image provider (87.5% → 90%)
Completing Topaz: Better quality control for users
Runway Audio: New capability category

Recommended Approach

Given session length (~400K tokens used), recommend:

NOW (This Session):

Add Runway Gen-4 Image provider (highest value)
Update Topaz with critical missing parameters
Test both additions

NEXT SESSION: 4. Add Runway Audio features 5. Systematically review all 9 providers for completeness 6. Add any missing parameters across the board

This ensures we deliver the highest-value features now while planning comprehensive completion.

User Response: Proceeding with implementation...

2.8 KiB Raw Blame History