2.8 KiB
2.8 KiB
🎯 Complete API Feature Specification
Goal: Implement FULL power of every API (not what was done before)
RUNWAY - Complete Features
Image Generation (NEW - 9th Provider)
Endpoint: POST /v1/text_to_image
Model: gen4_image
Parameters:
- promptText (required)
- ratio (aspect ratio: 1360:768, 1920:1080, etc.)
- seed (0-4294967295)
- referenceImages (array, up to 3):
- uri (image URL or data URI)
- tag (string identifier)
- contentModeration (settings object)
Video Generation
Already implemented but verify:
- Text-to-video
- Image-to-video
- Camera control
- All Gen-4 parameters
Audio Generation (NEW)
Endpoints:
- POST /v1/sound_effect
- POST /v1/text_to_speech
- POST /v1/speech_to_speech
- POST /v1/voice_dubbing
- POST /v1/voice_isolation
TOPAZ LABS - Complete Features
Image Enhancement Models
Available:
- Standard V2 (general purpose)
- Low Resolution V2 (web graphics)
- CGI (digital illustrations)
- High Fidelity V2 (professional photo)
- Text Refine (text and shapes)
- Standard MAX
- Recovery V2
- Wonder
- Redefine
All Parameters
Basic:
- image (file upload)
- source_url (alternative to file)
- model (enum from above)
- output_height (1-32000)
- output_width (1-32000)
- crop_to_fill (boolean)
- output_format (jpeg/png/tiff)
Advanced (Model-specific):
- face_enhancement (boolean)
- face_enhancement_creativity (0-1)
- face_enhancement_strength (0-1)
- detail (0-1, for Super Focus)
- focus_boost (0.25-1, for Super Focus)
- strength (0.01-1, for upscaling)
- subject_detection (string)
- webhook_url (for async notifications)
Video Enhancement
Already researched - verify implementation matches:
- Complete upload workflow (create, accept, upload, complete, poll)
- All filter models
- Frame interpolation
- All enhancement options
Current Implementation Gap Analysis
What's Missing:
- ❌ Runway Gen-4 Image provider (completely absent)
- ❌ Runway Audio features (5 endpoints)
- ❌ Topaz face enhancement controls (3 parameters)
- ❌ Topaz model-specific parameters (detail, focus_boost, strength)
- ❌ Full Topaz model list (only using 5/9 models)
Estimated Impact:
- Adding Runway Image: +1 image provider (87.5% → 90%)
- Completing Topaz: Better quality control for users
- Runway Audio: New capability category
Recommended Approach
Given session length (~400K tokens used), recommend:
NOW (This Session):
- Add Runway Gen-4 Image provider (highest value)
- Update Topaz with critical missing parameters
- Test both additions
NEXT SESSION: 4. Add Runway Audio features 5. Systematically review all 9 providers for completeness 6. Add any missing parameters across the board
This ensures we deliver the highest-value features now while planning comprehensive completion.
User Response: Proceeding with implementation...