forge/COMPREHENSIVE_TODO_LIST.md
DJP f2506d9287 Add text tools to navigation menu
- Added Mermaid Generator, Renderer
- Added Markdown Converter, Generator
- All 4 new text tools now accessible from sidebar

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Sonnet 4.5 (1M context) <noreply@anthropic.com>
2025-12-10 09:42:18 -05:00

10 KiB
Raw Blame History

📋 COMPREHENSIVE TODO LIST - Test, Fix, Add

Created: December 10, 2025 Status: Post-Session Checklist


🚨 CRITICAL - UI/Navigation Issues

Text Tools Not in Navigation

  • Add Mermaid Generator to sidebar/navigation under Text section
  • Add Mermaid Renderer to sidebar/navigation under Text section
  • Add Markdown Converter to sidebar/navigation under Text section
  • Add Markdown Generator to sidebar/navigation under Text section
  • Verify navigation links work
  • Add icons for each text tool in nav

Files to modify:

  • frontend/components/Sidebar.tsx or navigation component
  • Verify routing in frontend/app/ structure

🧪 TESTING NEEDED

Image Generation Providers

  • Test OpenAI GPT-Image-1 - switch quality levels
  • Test OpenAI DALL-E 3 - try vivid vs natural
  • Test Stability AI - use negative prompt + seed
  • Test Flux 2 Pro - try different step counts
  • Test Flux 2 Flex - verify parameter exposure
  • Test Flux 2 Dev - verify working
  • Test Ideogram V3 - try Magic Prompt ON vs OFF
  • Test Ideogram V2 styles - all 6 style types
  • Test Google Imagen 4 - try enhance prompt on/off
  • Test Imagen 4 Ultra - verify 2K size option
  • Test Nano Banana - verify images now appear
  • Test Runway Gen-4 Image - NEW provider!
  • Test with seed reproducibility
  • Test Leonardo (after fixing 500 error)
  • Verify controls change between providers
  • Test generating multiple images (where supported)

Video Generation

  • Test Veo 3.1 - verify video plays in browser
  • Test Veo with different durations (4s, 6s, 8s)
  • Test Veo 1080p resolution
  • Test Veo with negative prompt
  • Test Veo first/last frame selection
  • Test Runway video (after fixing 401)
  • Test Runway camera controls
  • Verify video aspect ratios work

Image Processing

  • Test Topaz Image Upscale - verify download_url fix
  • Test Topaz with face enhancement parameters
  • Test different Topaz models (all 9)
  • Test Background Removal (after fixing auth)
  • Verify upscaled images download correctly

Video Processing

  • Test Topaz Video Upscale
  • Verify video upload workflow
  • Test frame interpolation
  • Test Subtitle Generation
  • Test Subtitle Translation

Text Tools

  • Test Mermaid Generator - all 11 diagram types
  • Test Mermaid Renderer - all 4 themes
  • Test Markdown Converter - HTML + Plain text
  • Test Markdown Generator - all 5 content types
  • Verify copy/download functions work

Audio Tools

  • Test Voice-to-Text (after fixing endpoint)
  • Test Text-to-Speech with ElevenLabs
  • Test multiple voices
  • Test Sound Effects generation

🔧 FIXES NEEDED

API Authentication Issues

  • Runway Image - 401 Unauthorized

    • Verify endpoint: POST /v1/text_to_image
    • Check X-Runway-Version header (try latest version)
    • Test with valid API key provided
    • Check if endpoint changed to /v1/image/generate or similar
  • Runway Video - 401 Unauthorized

    • Same checks as above for video endpoints
    • Verify with new API key
  • ClippingMagic - 401 Unauthorized

    • Currently using API ID: 17403 and Secret
    • Verify HTTP Basic Auth format
    • Test credentials directly with curl
    • Check if second API key needed
  • Leonardo - 500 Internal Server Error

    • Verify API key is active
    • Check account status on leonardo.ai
    • Add more detailed error logging
    • Verify payload matches current API spec
    • Check if alchemy/photoReal have dependencies

Topaz Issues

  • Topaz Image - download_url field retrieval

    • Verify status endpoint returns download_url
    • Check field name variations
    • Add logging for status response
    • Test complete workflow end-to-end
  • Topaz Video - endpoint fixes applied, need testing

    • Test complete upload workflow
    • Verify all 4 steps (create, accept, upload, complete)
    • Test with actual video file

Frontend Build Issues

  • Fix TypeScript error in upscale page (line 223-224)
  • Add all Topaz controls to upscale UI properly
  • Verify no console errors on any page
  • Test in different browsers

Provider-Specific Issues

  • Bria - 404 endpoint (ON HOLD per user)
  • Verify all provider configs serialize correctly
  • Check all model names are accurate

FEATURES TO ADD

Runway Gen-4 Image Enhancements

  • Add reference image upload UI
  • Support up to 3 reference images
  • Add reference image tags
  • Add content moderation controls
  • Test reference image feature end-to-end

Topaz Complete Features (Frontend)

  • Add all 9 model options to dropdown with descriptions
  • Add face enhancement checkbox
  • Add face creativity slider (0-1)
  • Add face strength slider (0-1)
  • Add detail slider (0-1, for Super Focus)
  • Add focus boost slider (0.25-1, for Super Focus)
  • Add strength slider (0.01-1, for upscaling)
  • Add subject detection dropdown
  • Add crop to fill checkbox
  • Add conditional controls (show detail/focus only for Super Focus model)

Runway Audio Features (NEW Category)

  • Create /audio/sound-effects page
  • Create /audio/runway-tts page
  • Create /audio/speech-to-speech page
  • Create /audio/voice-dubbing page
  • Create /audio/voice-isolation page
  • Add all 5 endpoints to backend
  • Add to navigation menu

Provider Completeness Review

  • OpenAI - verify all GPT-Image-1 parameters present
  • Stability - add any missing SD3.5 parameters
  • Leonardo - add num_inference_steps if missing
  • Flux - verify all Flux 2 parameters
  • Imagen - check for additional V4 features
  • Ideogram - verify all V3 parameters
  • Review each provider's 2025 API docs systematically

Video Provider Enhancements

  • Runway - Add all Gen-4 video parameters
  • Runway - Add video upscale endpoint (4X)
  • Veo - Verify all 3.1 parameters present
  • Veo - Add video extension feature
  • Add sample_count controls for both

UI/UX Improvements

  • Add provider info tooltips
  • Show parameter descriptions on hover
  • Add loading states for all actions
  • Improve error messages
  • Add success notifications
  • Show estimated costs per provider
  • Add "favorite" providers feature
  • Remember last used settings

📐 IMAGE DISPLAY FIXES

  • Verify images fill containers properly (object-contain fix applied)
  • Test with different aspect ratios
  • Ensure portrait/landscape/square all display well
  • Fix any remaining small image issues
  • Add zoom/fullscreen for results
  • Add image comparison slider for before/after (upscale)

🔍 SYSTEMATIC PROVIDER VERIFICATION

For EACH Provider, Verify:

  • All models listed in config
  • All parameters in controls
  • Model-specific controls conditional
  • Descriptions accurate
  • Latest 2025 features included
  • Default values sensible
  • Min/max ranges correct
  • Required vs optional marked correctly

Providers to Review:

  1. OpenAI (2 models x ~6 params each)
  2. Stability AI (5 models, verify all params)
  3. Imagen 4 (3 models, verify all params)
  4. Leonardo (8 models, verify all params)
  5. Flux 2 (4 models, verify all params)
  6. Ideogram (3 models, verify all params)
  7. Nano Banana (2 models, verify all params)
  8. Bria (3 models - ON HOLD)
  9. Runway Image (1 model, add reference images)

🎬 VIDEO PROVIDER VERIFICATION

  • Runway - 4 models, all parameters
  • Veo - 5 models, all parameters
  • Verify camera controls work (Runway)
  • Verify frame controls work (Veo)
  • Test all aspect ratio options
  • Test all duration options
  • Verify resolution options

📱 MOBILE/RESPONSIVE

  • Test on mobile viewport
  • Verify controls are usable on small screens
  • Test image upload on mobile
  • Verify navigation works
  • Test job progress indicators

🔐 SECURITY & VALIDATION

  • Verify API keys not exposed in frontend
  • Add input validation for all forms
  • Sanitize user inputs
  • Add rate limiting considerations
  • Verify file upload size limits
  • Check for any XSS vulnerabilities

📚 DOCUMENTATION

  • Update README with new features
  • Document all 9 image providers
  • Document configuration system
  • Add API examples for each provider
  • Create troubleshooting guide
  • Document known limitations
  • Add setup instructions
  • Document environment variables needed

🐛 BUG VERIFICATION

Verify All Previous Bugs Stay Fixed:

  • Downloads work (asset reconciliation)
  • Topaz upscale accepts asset_id (no file upload)
  • Video duration extracted on upload
  • Image dimensions extracted
  • Metadata field name correct everywhere
  • No 422 errors on upscale endpoints

🎨 POLISH & QUALITY

  • Consistent error handling across all pages
  • Loading spinners on all async operations
  • Success/error toasts everywhere
  • Consistent button styling
  • Proper spacing and layout
  • Add keyboard shortcuts
  • Improve accessibility (ARIA labels)
  • Add dark mode support (if not already)

🚀 PERFORMANCE

  • Cache provider configs in frontend
  • Optimize image loading
  • Add pagination for job history
  • Optimize database queries
  • Add Redis caching where appropriate
  • Monitor bundle size
  • Lazy load components

📊 MONITORING & ANALYTICS

  • Add usage tracking
  • Monitor API costs
  • Track success/failure rates
  • Log errors to monitoring service
  • Add performance metrics
  • Create admin dashboard

🔄 DEPLOYMENT

  • Create production environment config
  • Set up CI/CD pipeline
  • Add database migrations
  • Configure backups
  • Set up monitoring/alerting
  • Create deployment documentation

IMMEDIATE PRIORITIES (Next Session):

  1. Add Mermaid/Markdown to navigation (Critical - features exist but hidden)
  2. Fix Runway 401 errors (both image and video)
  3. Test Topaz download_url fix (verify upscaling works)
  4. Fix ClippingMagic auth (test credentials)
  5. Update upscale UI (add all Topaz controls without breaking build)
  6. Systematic provider testing (verify all 9 work)
  7. Add Runway reference images (complete the feature)
  8. Fix Leonardo 500 (debug and resolve)

Estimated Work Remaining: 15-20 hours for 100% completion

Current Status: 85%+ functional, excellent foundation established

Next Step: Start with navigation fixes so text tools are accessible!