forge/OLD_DOCS/COMPLETE_API_SPECIFICATION.md

2.8 KiB

🎯 Complete API Feature Specification

Goal: Implement FULL power of every API (not what was done before)


RUNWAY - Complete Features

Image Generation (NEW - 9th Provider)

Endpoint: POST /v1/text_to_image Model: gen4_image Parameters:

  • promptText (required)
  • ratio (aspect ratio: 1360:768, 1920:1080, etc.)
  • seed (0-4294967295)
  • referenceImages (array, up to 3):
    • uri (image URL or data URI)
    • tag (string identifier)
  • contentModeration (settings object)

Video Generation

Already implemented but verify:

  • Text-to-video
  • Image-to-video
  • Camera control
  • All Gen-4 parameters

Audio Generation (NEW)

Endpoints:

  • POST /v1/sound_effect
  • POST /v1/text_to_speech
  • POST /v1/speech_to_speech
  • POST /v1/voice_dubbing
  • POST /v1/voice_isolation

TOPAZ LABS - Complete Features

Image Enhancement Models

Available:

  1. Standard V2 (general purpose)
  2. Low Resolution V2 (web graphics)
  3. CGI (digital illustrations)
  4. High Fidelity V2 (professional photo)
  5. Text Refine (text and shapes)
  6. Standard MAX
  7. Recovery V2
  8. Wonder
  9. Redefine

All Parameters

Basic:

  • image (file upload)
  • source_url (alternative to file)
  • model (enum from above)
  • output_height (1-32000)
  • output_width (1-32000)
  • crop_to_fill (boolean)
  • output_format (jpeg/png/tiff)

Advanced (Model-specific):

  • face_enhancement (boolean)
  • face_enhancement_creativity (0-1)
  • face_enhancement_strength (0-1)
  • detail (0-1, for Super Focus)
  • focus_boost (0.25-1, for Super Focus)
  • strength (0.01-1, for upscaling)
  • subject_detection (string)
  • webhook_url (for async notifications)

Video Enhancement

Already researched - verify implementation matches:

  • Complete upload workflow (create, accept, upload, complete, poll)
  • All filter models
  • Frame interpolation
  • All enhancement options

Current Implementation Gap Analysis

What's Missing:

  1. Runway Gen-4 Image provider (completely absent)
  2. Runway Audio features (5 endpoints)
  3. Topaz face enhancement controls (3 parameters)
  4. Topaz model-specific parameters (detail, focus_boost, strength)
  5. Full Topaz model list (only using 5/9 models)

Estimated Impact:

  • Adding Runway Image: +1 image provider (87.5% → 90%)
  • Completing Topaz: Better quality control for users
  • Runway Audio: New capability category

Given session length (~400K tokens used), recommend:

NOW (This Session):

  1. Add Runway Gen-4 Image provider (highest value)
  2. Update Topaz with critical missing parameters
  3. Test both additions

NEXT SESSION: 4. Add Runway Audio features 5. Systematically review all 9 providers for completeness 6. Add any missing parameters across the board

This ensures we deliver the highest-value features now while planning comprehensive completion.

User Response: Proceeding with implementation...