ferrero-opentext

Author	SHA1	Message	Date
DJP	9d207d0480	chore: Update DAM mTLS base and OAuth URLs in production environment to `/token` endpoint.	2025-11-26 10:18:56 -05:00
DJP	6064b0971e	Fix smoke test to explicitly load .env-prod file	2025-11-26 10:14:26 -05:00
DJP	cabc1d5548	Add production smoke test script for mTLS V2 authentication	2025-11-26 10:08:04 -05:00
DJP	c1f338022c	fix: Ensure type field is added when updating CreativeX URL - Modified _set_field_value to include 'type': 'string' in all code paths - Adds type field when updating existing CreativeX URL field - Ensures consistent structure whether creating or updating field	2025-11-25 09:14:25 -05:00
DJP	80316cad32	fix: Add missing type field to CreativeX URL metadata - Added 'type': 'string' to FERRERO.FIELD.CREATIVEX LINK value structure - Fixes DAM validation error for CreativeX URL field - Structure now matches DAM requirements	2025-11-25 09:11:41 -05:00
DJP	548c30344b	feat: Support multiple CreativeX platforms in metadata - Updated creativex_scoring_storing.py to map multiple placements to platforms - Modified get_mapped_platform to get_mapped_platforms (returns list) - Updated a2_to_a3_upload_polling.py to retrieve platforms list from DB - Enhanced metadata_extractor_mvp.py to build multi-value CreativeX field - Added DAM-CX mappings.csv for channel/placement to platform mapping - Supports single channel with multiple placements generating multiple Platform^Score values	2025-11-24 14:44:11 -05:00
DJP	491fc8e938	feat: Add A1→A3 campaign advance script, introduce systemd service for orchestrator, and ref	2025-11-24 13:50:16 -05:00
DJP	0af15563bc	feat: Implement new Python script locking, relocate PHP workflow, and update Python scripts and documentation.	2025-11-21 17:20:34 -05:00
DJP	22069ed66f	refactor: Relocate test scripts to a dedicated `tests/` directory and introduce `orchestrator.py`.	2025-11-21 17:10:04 -05:00
DJP	2cad2c2955	Fix AttributeError in DAMClient.test_connection	2025-11-21 16:57:43 -05:00
DJP	5aeab8d9a3	Update a1_to_a2_download.py to support Auth V2	2025-11-21 16:53:32 -05:00
DJP	6fe2ba234b	Implement Auth V2 (Hybrid mTLS/OAuth) and update field mappings	2025-11-21 16:46:37 -05:00
DJP	b906434f67	Add A1->A2 and A4 Box CSV uploader scripts	2025-11-20 22:52:26 -05:00
DJP	20a187a61d	feat: Add Python project dependencies.	2025-11-20 22:45:04 -05:00
DJP	0bf3c34cd0	Add asset type mapping from 3-letter codes to DAM codes Maps frontend naming tool 3-letter codes (EHI, IMG, TVC) to DAM's lowercase descriptive codes (heroimage, keyvisual, tvc) for uploads. New File: config/asset_type_mappings.yaml - 45 asset type mappings from DAM lookup domains - Maps 3-letter codes to DAM codes - E-Commerce types: EHI→heroimage, EBS→beautyshot, etc. - Standard types: IMG→keyvisual, TVC→tvc, etc. - Expandable as new asset types added Field Mappings Update (field_mappings.yaml): - Added FERRERO.FIELD.MKTG.ASSET TYPE to filename_updates - Source: asset_type (from parsed filename) - Required: true Metadata Extractor Updates (metadata_extractor_mvp.py): - Added _load_asset_type_mappings() method - Added _map_asset_type() method - Integrated mapping into _update_fields() - Logs mappings: "Asset type mapping: EHI -> heroimage" - Warns if no mapping found (may fail DAM validation) Example Flow: 1. Filename: ROC_TEST-E2E2_EHI_1x1_DE_de.png 2. Parse: asset_type = "EHI" 3. Map: EHI → "heroimage" 4. Update field: FERRERO.FIELD.MKTG.ASSET TYPE = "heroimage" 5. DAM accepts "heroimage" (valid domain value) Without Mapping (before): - Field value: "EHI" - DAM validation: FAILS (EHI not in domain) With Mapping (after): - Field value: "heroimage" - DAM validation: PASSES ✅ Complete Mapping List (45 types): E-Commerce: ECA, ECB, EBS, EBR, EEM, EHI, EIL, EOP, EUG, EWB Standard: 3RT, APC, BBK, BRC, BSG, CKV, CID, DAT, FLA, FNT, GDT, GRG, IMG, FPO, LGL, LOG, MLF, OLV, PAW, PKI, POS, PDM, PRI, QRC, SND, SIP, SGL, TVC, VIE DAM Answer: Uses descriptive lowercase codes, not 3-letter codes. Frontend uses 3-letter for brevity, backend maps to DAM format. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-14 13:08:01 -05:00
DJP	d1ab8551e5	Add asset type filename update and make field updates configurable Enables asset type to be updated from derivative filename and refactors _update_fields() to use filename_updates configuration dynamically. Field Mappings Configuration (field_mappings.yaml): Added to filename_updates: - FERRERO.FIELD.MKTG.ASSET TYPE: source: asset_type required: true Now updates from derivative filename: - ROC_TEST-E2E2_EHI_1x1_DE_de.png → Asset Type = "EHI" Metadata Extractor Refactor (metadata_extractor_mvp.py): Old _update_fields(): - Hardcoded field updates (ASSET NAME, DESCRIPTION, STATE) - Not using filename_updates configuration - Required code changes to add new fields New _update_fields(): - Dynamically processes filename_updates from config - Supports transform: uppercase/lowercase - Supports any source field from parsed_filename - Uses forced_values from config (was hardcoded before) - Add new fields via config, no code changes needed Configuration-Driven Updates: - ARTESIA.FIELD.ASSET NAME ← clean_filename - ARTESIA.FIELD.ASSET DESCRIPTION ← subject_title - FERRERO.FIELD.MKTG.ASSET TYPE ← asset_type (NEW) - MAIN_LANGUAGES ← language_code (uppercase) - FERRERO.FIELD.STATE ← "Local" (forced value) Benefits: - Asset type now correctly populated from filename - Configuration-driven (add fields without code changes) - Cleaner code (uses config instead of hardcoded logic) - Forced values also configurable - Easier to maintain and extend Example: Filename: ROC_TEST-E2E2_EHI_1x1_DE_de.png Parsed asset_type: "EHI" Field FERRERO.FIELD.MKTG.ASSET TYPE updated to: "EHI" Impact: All A2→A3 uploads will now have correct Asset Type from derivative filename instead of inheriting from master (which may be different). 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-14 12:58:10 -05:00
DJP	e44f42dad5	Add master CreativeX score extraction and storage in A1→A2 workflow Stores master asset CreativeX scores from DAM metadata during A1→A2 download for reference/reporting purposes (not used in uploads). Database Changes: creativex_scores table: - Added: tracking_id VARCHAR(6) column - Added: idx_creativex_tracking_id index - Updated comment: status can be 'active', 'superseded', or 'master-cx-score' Status Values: - 'active' - Current derivative score (from PDF extraction) - 'superseded' - Old derivative score (version history) - 'master-cx-score' - Master asset score (from A1→A2 DAM metadata) ← NEW Migration SQL (for existing databases): ALTER TABLE creativex_scores ADD COLUMN tracking_id VARCHAR(6); CREATE INDEX idx_creativex_tracking_id ON creativex_scores(tracking_id); Database Method Updates (database.py): store_creativex_score() signature: - Added: tracking_id parameter (optional, default None) - Added: status parameter (optional, default 'active') Logic: - If status='master-cx-score': Simple insert, no versioning - If status='active': Soft delete versioning as before - Always stores tracking_id if provided A1→A2 Script Updates (a1_to_a2_download.py): New Function: extract_creativex_from_dam_metadata() - Searches metadata_element_list for CREATIVEX fields - Extracts FERRERO.TAB.FIELD.CREATIVEX (score) - Extracts FERRERO.FIELD.CREATIVEX LINK (url) - Returns dict with score/url or None if not found - Handles tabular field structure for score - Handles nested value structure for URL Integration: - After successful master asset storage - Extracts CreativeX from asset metadata - If found: Stores in creativex_scores with status='master-cx-score' - Links to master via tracking_id - Logs when score found/stored - Logs "normal" when not found (not all masters are scored) Use Cases: A2→A3 Upload: - Still uses filename-based lookup ONLY ✅ - No changes to A2→A3 logic ✅ - Master scores not used for uploads ✅ Reporting/Analytics Tools: - Can query master score by tracking_id - Compare master vs derivative scores - Track score improvements - Audit trail Query Examples: -- Get master score for tracking ID SELECT * FROM creativex_scores WHERE tracking_id = '7xXgKp' AND status = 'master-cx-score'; -- Get derivative score for filename SELECT * FROM creativex_scores WHERE filename = 'file.mp4' AND status = 'active'; Test Record Created: - Filename: nutella_pbased.jpg - Tracking ID: 7xXgKp - Score: 85 - Status: master-cx-score Benefits: - Historical reference of master scores - Enables score comparison analytics - No impact on A2→A3 upload logic - Automatic extraction during A1→A2 - Optional (works even if masters don't have scores) 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-13 13:56:22 -05:00
DJP	68180b23cf	Update filename parser to V2.1 structure with new field positions Complete rewrite of filename parser to support new field order where Subject/Asset moved up and Country/Language moved down, plus new Social Media field. BREAKING CHANGE: V2.1 Structure (November 2025) Old (V1): [JOB]_[BRAND]_[COUNTRY]_[LANG]_[SUBJECT]_[ASSET]_[SPOT]_[DUR]_[RATIO]_[TRACKING] New (V2.1): [JOB]_[BRAND]_[SUBJECT]_[ASSET]_[DUR]_[RATIO]_[SPOT]_[COUNTRY]_[LANG]_[SOCIAL]_[TRACKING] Field Position Changes: - Subject Title: Position 5 → 3 (MOVED UP) - Asset Type: Position 6 → 4 (MOVED UP) - Duration: Position 8 → 5 (MOVED UP) - Aspect Ratio: Position 9 → 6 (MOVED UP) - Spot Version: 7 → 7 (SAME) - Country Code: Position 3 → 8 (MOVED DOWN) - Language Code: Position 4 → 9 (MOVED DOWN) - Social Media: NEW → Position 10 - Tracking ID: Position 10 → 11 New Social Media Field: - Field: social_media_version - Position: 10 (after language, before tracking) - Format: 3 uppercase letters - Codes: FBP, FBR, IGF, IGR (expandable) - Optional: Only present for social media assets Parse Algorithm Changes: - Positions 1-4 now: Job, Brand, Subject, Asset (fixed) - Positions 5-11: Pattern-based detection (flexible) - Duration detected by \d+S pattern - Aspect ratio detected by \d+x\d+ or contains 'x' - Spot detected by MST/REF - Country detected by 2 upper alpha (after ratio) - Language detected by 2-3 lower alpha (after country) - Social detected by known codes (after language) - Tracking ID detected by 6 alphanumeric + optional -N strip_upload_components() Updated: Now outputs: [BRAND]_[SUBJECT]_[ASSET]_[DUR]_[RATIO]_[SPOT]_[COUNTRY]_[LANG]_[SOCIAL] - Includes social media version if present - Still strips job number and tracking ID Testing: All 7 test cases from specification passed: ✅ All fields present ✅ Minimal (no duration/social/tracking) ✅ No duration ✅ No spot version ✅ With -N tracking (folder-only mode) ✅ No social media (most common) ✅ No tracking ID Example: Input: 1234567_RAF_TEST_OLV_6S_1x1_REF_GL_it_IGF_abc123.mp4 Parsed: brand=RAF, subject=TEST, country=GL, lang=it, social=IGF Clean: RAF_TEST_OLV_6S_1x1_REF_GL_it_IGF.mp4 Backward Compatibility: None - system not live yet, clean cutover to V2.1 format only. Backup: filename_parser_v1_backup.py contains old version for reference. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-13 13:41:19 -05:00
DJP	15eb47fc43	Add CreativeX fields to asset representation if missing from master metadata Fixes issue where CreativeX score field was not appearing in final upload because it didn't exist in the master metadata from DAM. Problem: - Master metadata from A1→A2 doesn't include CREATIVEX fields (new fields) - _update_creativex_fields() only UPDATED existing fields - If field not present, it logged error but didn't add the field - Result: CREATIVEX score missing from upload, only URL appeared Solution: - Check if CREATIVEX Score field exists in mvp_fields - If NOT found: Create and append field with proper structure - If found: Update value as before - Same logic for CREATIVEX URL field Field Structures Created: CREATIVEX Score (FERRERO.TAB.FIELD.CREATIVEX): - Type: MetadataTableField (tabular field) - Parent: FERRERO.TABULAR.FIELD.PLATFORMRATING - Data type: INTEGER - Value structure: {'value': {'value': score}} CREATIVEX URL (FERRERO.FIELD.CREATIVEX LINK): - Type: MetadataField (regular field) - Data type: CHAR - Value structure: {'value': {'value': url}} Logging: - Changed from ERROR to WARNING when field not found - Logs "adding it now" instead of just error - Confirms field added with value Impact: Both CreativeX fields will now appear in uploads even if master metadata doesn't have them (common for older campaigns downloaded before CreativeX integration). Testing: Run with --dryrun to verify both CREATIVEX fields in JSON output. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-13 12:44:17 -05:00
DJP	39a41df21d	Fix CreativeX lookup to use original Box filename not stripped version Changes database lookup strategy to match on full filename as it appears in Box and in the CreativeX PDF report filename field. Critical Design Change: Old (incorrect): - Strip job number and tracking ID from Box filename - Lookup: NUT_PL_pl_TEST-E2E_EHI_1x1.png - Database has: 6487512_NUT_PL_pl_TEST-E2E_EHI_1x1_7xXgKp.png - RESULT: No match found, uses defaults New (correct): - Use original Box filename for lookup - Lookup: 6487512_NUT_PL_pl_TEST-E2E_EHI_1x1_7xXgKp.png - Database has: 6487512_NUT_PL_pl_TEST-E2E_EHI_1x1_7xXgKp.png - RESULT: Match found, uses actual score Rationale: The CreativeX PDF report contains a "filename" field that stores the actual asset filename including job number and tracking ID. This is the name that gets extracted by LlamaExtract and stored in database. The A2→A3 workflow receives files from Box with the SAME filename structure (job_brand_country_lang_subject_trackingID.ext). Therefore, we match on the complete original filename, not the stripped version. Database Storage Pattern: - CreativeX PDF named: anything.pdf (name doesn't matter) - PDF contains field: filename = "6487512_NUT_PL_pl_TEST-E2E_EHI_1x1_7xXgKp.png" - Database stores: filename = "6487512_NUT_PL_pl_TEST-E2E_EHI_1x1_7xXgKp.png" - A2→A3 receives: 6487512_NUT_PL_pl_TEST-E2E_EHI_1x1_7xXgKp.png from Box - Lookup matches exactly Clean filename still used for DAM upload, only the lookup is on original. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-13 12:29:55 -05:00
DJP	2fef8878cd	Add verbose debugging to _set_field_value for CreativeX troubleshooting Adds detailed logging to trace exactly how field values are being set and diagnose why CreativeX score/URL aren't appearing in final JSON. Changes to _set_field_value(): - Logs field ID being updated - Logs current field['value'] structure BEFORE setting - Logs which code path is taken (nested vs created) - Logs field['value'] structure AFTER setting - Shows full JSON structure at each step Output Example: _set_field_value called for: FERRERO.TAB.FIELD.CREATIVEX with value: 85 Current field['value']: { "is_locked": false, "domain_value": false, ... } Created field['value'] = {'value': {'value': 85}} After setting, field['value']: { "value": { "value": 85 } } Purpose: Diagnose why CreativeX fields show empty value dicts in asset representation even though logs say "Set CREATIVEX Score to: 0". This verbose logging will show: 1. What the field structure looks like before we set it 2. Which code path is executed 3. What the field structure looks like after we set it 4. Whether the value is actually being placed in the right location Run with --dryrun to see full debug output without uploading. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-13 12:16:53 -05:00
DJP	d3722d1bb2	Fix CreativeX URL logging to say 'from database' not 'from Box' Corrects misleading log messages and adds debugging for URL field. Changes: - Changed: "Updating CreativeX URL from Box" - To: "Updating CreativeX URL from database" - Added url_field_found flag - Added URL field structure logging - Added error handling with traceback - Logs error if URL field not found in mvp_fields Now both CreativeX fields log correctly: - "Updating CreativeX Score from database: 0" - "Updating CreativeX URL from database: https://..." Accurate logging shows data source is PostgreSQL creativex_scores table, not Box metadata templates (which are no longer used). 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-13 12:11:58 -05:00
DJP	899d15322b	Improve CreativeX field value setting with better error handling Enhances _set_field_value() to handle empty value structures and adds detailed logging for debugging CreativeX field issues. Changes to metadata_extractor_mvp.py: _set_field_value() Enhancement: - Handles empty nested dicts by creating value structure - If field['value']['value'] is empty dict, creates {'value': value} - If field['value'] is empty dict, creates {'value': {'value': value}} - Preserves existing behavior for populated structures CreativeX Score Field Debugging: - Added score_field_found flag to detect if field exists in mvp_fields - Logs field structure before attempting to set value - Shows dict keys to understand nesting - Catches and logs full traceback on errors - Errors if CREATIVEX Score field not found in mvp_fields - Changed log: "from Box" → "from database" (accurate) CreativeX URL Field: - Existing logic preserved - Uses same enhanced _set_field_value() Purpose: Diagnose why CreativeX score not appearing in asset representation even though logs show "Set CREATIVEX Score to: 0" Next Steps: Run with --dryrun to see field structure logging and verify values are being set correctly in the JSON output. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-13 12:11:22 -05:00
DJP	cdb839e3a8	Add --dryrun mode to A2→A3 and fix Agency name to Oliver Adds debugging mode to A2→A3 workflow that builds full asset metadata but doesn't upload to DAM, displaying complete JSON for field validation. Changes to A2→A3 Script (a2_to_a3_upload_polling.py): --dryrun Flag: - New argument: --dryrun (build metadata but don't upload) - Displays full asset representation as formatted JSON - Shows field count - Shows CreativeX lookup status - Keeps file in Box (no deletion) - Logs "DRYRUN MODE" clearly - Returns success with 'DRYRUN_NO_UPLOAD' as asset_id Dryrun Output Includes: - Complete asset_representation JSON (all MVP fields) - Field count (should be 27 fields) - CreativeX status (found/missing) - CreativeX score and URL values - Clean separation with === lines Usage: python scripts/a2_to_a3_upload_polling.py --dryrun Benefits: - Debug metadata issues without DAM uploads - Verify all fields present before going live - Check CreativeX integration working - Validate field values and formatting - Safe testing with production data Changes to Field Mappings (config/field_mappings.yaml): Agency Name Fixed: - Changed: FERRERO.MARKETING.FIELD.AGENCY NAME: "-" - To: FERRERO.MARKETING.FIELD.AGENCY NAME: "Oliver" - Exact case as required by DAM - Comment updated to reflect this is final value Impact: - All A2→A3 uploads now have Agency Name = "Oliver" - Not "Oliver Agency" (wrong) - Not "-" placeholder (old) - Exact case: "Oliver" (capital O, lowercase liver) Use Case: Run with --dryrun to see full JSON metadata, verify Agency name is "Oliver", check all 27 MVP fields are present, then remove --dryrun flag to perform actual uploads. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-13 11:59:55 -05:00
DJP	ca43aaf1bc	Remove email notification when no CreativeX files found Changes behavior to only log when Box folder is empty, not send emails. Rationale: - Empty folder is normal operation (not an error condition) - Reduces email noise when script runs on cron - Still logs the event for monitoring - Similar to other workflows (a1_to_a2, etc.) that don't email when no work to do Changes: - Removed notifier.send_email() call for 'creativex_no_files' template - Enhanced log message: "No PDF files found - this is normal when folder is empty" - Added: "Script completed successfully with no files to process" - Still returns success=True (not an error) Template 'creativex_no_files' retained for potential future use but not called. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-11 20:58:09 -05:00
DJP	946c580024	Integrate CreativeX database lookup into A2→A3 upload workflow Replaces Box metadata template with database lookup for CreativeX scores, adding automatic fallback to default values when scores are missing. Changes to A2→A3 Script (a2_to_a3_upload_polling.py): - Removed: box.get_file_metadata() for CreativeX data - Added: db.get_creativex_score_by_filename() database lookup - Uses clean filename (stripped of OMG Job + Tracking ID) for lookup - Implements fallback when score not found: - Default score: 0 - Default URL: https://app.creativex.com/preflight/pretests - Tracks creativex_found flag for email notifications - Logs warnings when defaults are used Email Template Updates (notifier.py): - Shows "✅ CreativeX Score Added: 85 (from database)" when found - Shows "⚠️ CreativeX Score: Not found - used default (0)" when missing - Adds orange warning box when defaults used: - Lists default values (Score: 0, placeholder URL) - Provides instructions to add score - References Box folder 350605024645 and scoring script Benefits: - Automatic CreativeX lookup (no manual Box metadata entry) - Graceful degradation (uploads succeed even without scores) - Clear notification when scores are missing - Preserves history (uses latest active version) - No breaking changes (existing workflow continues to work) Default Value Strategy: - Score 0 indicates "not scored" but doesn't block upload - Placeholder URL is valid CreativeX domain - Email clearly shows when defaults are used - Provides actionable instructions for adding scores Workflow Integration: 1. CreativeX PDFs uploaded to folder 350605024645 2. creativex_scoring_storing.py extracts and stores scores 3. A2→A3 automatically looks up scores by filename 4. Uploads proceed with actual scores OR defaults 5. Email indicates which path was taken Documentation: A2_A3_CREATIVEX_INTEGRATION.md 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-11 17:02:25 -05:00
DJP	6fee0cc725	Add version tracking and remove .0 decimals from CreativeX scores Implements version counter for re-scored files and cleans up numeric formatting. Decimal Removal: - Strip .0 suffix from creativex_id (6864255.0 → 6864255) - Strip .0 suffix from quality_score (80.0 → 80) - Converts float → int → string before storing - Cleaner data for display and DAM integration Version Tracking: - Counts total versions per filename (active + superseded) - Returns version_number in database result - Logs show version: "Score 80 extracted (Version 3)" - Email templates display version badges for updates Email Template Updates: - Complete template: Shows "Version 3 (Updated)" badge in header - Includes note: "This is version 3 of this file" - Partial template: Shows "(Version 3)" inline - Only displays version info if > 1 Database Changes: - Query counts ALL versions before insert - Returns version_number in result dict - Logs include version in success/update messages Benefits: - Clean numeric values without unnecessary decimals - Users can see if file was re-scored - Version history visible in emails - Still preserves all history in database - A2→A3 integration unaffected (always gets latest active) Example progression: Upload 1: Score 80 (no version shown - it's the first) Upload 2: Score 85 (Version 2 badge shown) Upload 3: Score 90 (Version 3 badge shown) Documentation: CREATIVEX_VERSION_UPDATES.md 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-11 16:55:07 -05:00
DJP	1264eaf9bc	Implement soft delete for CreativeX scores with history preservation Adds upsert logic that marks old records as 'superseded' while creating new 'active' records, preserving full history for audit/analysis. Changes: - Updated store_creativex_score() to check for existing filename - Old records marked status='superseded' before inserting new 'active' record - Returns is_update flag to indicate if this was an update vs new insert - Logs score changes (e.g., "Score: 80.0 -> 85.0") Documentation updates: - Added "Understanding Status Field" section with soft delete explanation - Separated queries into "Latest Scores" vs "History/Audit" sections - Added A2→A3 integration guide with example code - Documented query logic and behavior table for future integration - Added migration notes for existing data Query patterns for A2→A3: - status='active' → Latest/current score (use this in workflows) - status='superseded' → Previous scores (history/audit trail) - get_creativex_score_by_filename() automatically filters for active Benefits: - Easy lookup of latest scores (just filter status='active') - Full history preserved for tracking score changes over time - No data loss when files are re-scored - Clear audit trail of when scores changed Tested and verified: - Existing record (80.0) marked as superseded - New record (85.0) created as active - Queries correctly return only active record 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-11 16:40:46 -05:00
DJP	b6b9d7337a	Add CreativeX score extraction and storage system Implements new workflow to extract CreativeX quality scores from PDFs using LlamaExtract AI and store results in PostgreSQL database. Components added: - creativex_scoring_storing.py: Main script to process PDFs from Box - creativex_scores table: Database table with JSONB for full JSON storage - Database methods: store_creativex_score() and get_creativex_score_by_filename() - Email templates: creativex_complete, creativex_partial, creativex_no_files - Configuration: creativex section in config.yaml - CREATIVEX_DEPLOYMENT.md: Complete deployment and usage guide Features: - Monitors Box folder 350605024645 for PDFs - Extracts scores using LlamaExtract agent "Creativex-Extract" - Stores 4 key fields (filename, ID, URL, score) + full JSON - Deletes processed PDFs from Box after successful extraction - Sends email notifications for success/partial/no-files scenarios - Manual execution (python scripts/creativex_scoring_storing.py) Database schema: - Table: creativex_scores with 10 columns - Indexes on filename, box_file_id, status for fast lookups - JSONB column stores complete extraction for future flexibility Future integration ready: db.get_creativex_score_by_filename() available for DAM upload workflows to attach CreativeX metadata during asset processing. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-11 16:15:45 -05:00
DJP	60a2f09e42	Add CreativeX fields to MVP and fix score update logic Added CreativeX fields to MVP fields list: - FERRERO.TAB.FIELD.CREATIVEX (CreativeX Score from Box) - FERRERO.FIELD.CREATIVEX LINK (CreativeX URL from Box) Fixed _update_creativex_fields method: - Now actually SETS CreativeX Score value (was only logging before) - Uses _set_field_value() to update the field - Added try/except for tabular field structure handling - Both score and URL now properly set from Box metadata Flow: 1. Box metadata retrieved from Ferrero-DAM-Metadata template 2. creativexScore and creativexUrl extracted 3. Passed to metadata extractor 4. Applied to MVP fields during upload 5. CreativeX data preserved from Box → DAM 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-11 13:54:39 -05:00
DJP	0c2edc560f	Add targeted campaign status update script Created update_campaign_status.py - targeted campaign status control: - Update specific campaign by number (--camp C000000078) - Or search by partial campaign name (--camp "CONTENT SCALING") - Set to any status (--status A1, A2, A3, etc.) - Searches all statuses to find the campaign - If multiple matches, lets user choose - Interactive yes/no confirmation - Shows current and target status - Supports --auth-pfx for mTLS authentication Usage examples: python scripts/update_campaign_status.py --camp C000000078 --status A2 python scripts/update_campaign_status.py --camp "KINDER" --status A4 python scripts/update_campaign_status.py --camp C000000551 --status A1 --auth-pfx Features: - Required flags: --camp and --status - Searches across all statuses (A1-A6, B1-B2) - Partial name matching (case insensitive) - Multiple match handling with user selection - Clear success/failure feedback Updated README with complete documentation and examples 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-09 12:49:46 -05:00
DJP	a2e1058d20	Add interactive campaign status management scripts Created 2 new interactive scripts for manual campaign status control: 1. reset_campaign_to_a1.py - Resets any campaign from any A# status back to A1 - Allows filtering by specific status (--status A2, A3, A4, etc.) - Interactive yes/no confirmation for each campaign - Shows campaign name, number, and current status - Processes one at a time, waits for user input - Summary statistics at the end 2. advance_a1_to_a3.py - Advances A1 campaigns directly to A3 (skips A2) - Useful for testing A2→A3 upload workflow - Interactive yes/no confirmation for each campaign - Shows campaign details before advancing - Summary statistics at the end Both scripts: - Support --auth-pfx flag for mTLS authentication - Process campaigns one at a time (not batch) - Wait for user input before proceeding - Provide clear feedback on success/failure - Useful for testing and manual workflow control Updated README.md with complete documentation and examples 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-07 15:23:26 -05:00
DJP	15aa68da92	Add --auth-pfx flag to A2→A3 upload script for mTLS support - Added --auth-pfx argument to support mTLS certificate authentication - DAMClient now initialized with use_mtls parameter - Logs authentication method on startup - Matches authentication options in other scripts (A1→A2, A4, A5→A6, B1→B2) Usage: python scripts/a2_to_a3_upload_polling.py # OAuth2 (default) python scripts/a2_to_a3_upload_polling.py --auth-pfx # mTLS certificate 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-05 19:44:06 -05:00
DJP	d5c71b4c45	Add country code mapping system (ISO -> DAM codes) - Created config/country_code_mappings.yaml with 165 country mappings - ISO codes (used in filenames) now map to DAM-specific codes - Codes under review set to XX as placeholder - Added load_country_code_mappings() to config_loader.py - Updated MetadataExtractorMVP to load and apply country mappings - Added _map_country_code() and _get_field_value() helper methods - Country mapping applies in both full and folder-only modes Key mappings: - BD (Bangladesh) -> BG (DAM code, though appears incorrect) - DE (Germany) -> DE (same) - IT (Italy) -> IT (same) - Most codes under review -> XX (placeholder) Mapping file can be edited without code changes - updates apply automatically 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-05 19:15:26 -05:00
DJP	914a178dc5	Implement V2 naming convention updates and folder structure support Major changes: 1. Updated filename_parser.py for new V2 naming convention: - Spot version now accepts only MST or REF (optional) - Duration field is now optional - Tracking ID supports -N suffix for folder-only mode - Reduced minimum required parts from 9 to 7 - Improved asset type detection logic 2. Added recursive folder scanning to box_client.py: - New list_folder_files_recursive() method - Skips first-level job/batch folders - Preserves folder structure from 2nd level onwards - Skips hidden folders (starting with . or _) 3. Updated A2→A3 upload workflow: - Uses recursive folder scanning - Extracts and logs tracking mode (full vs folder_only) - Handles subfolder paths for DAM uploads - Shows folder distribution in logs 4. Added folder-only mode to metadata_extractor_mvp.py: - New tracking_mode parameter (full/folder_only) - folder_only mode builds metadata entirely from filename - New _build_fields_from_filename() method 5. Added DAM subfolder creation to dam_client.py: - New get_or_create_subfolder_path() method - Creates matching folder structure in DAM - Helper methods _find_subfolder_by_name() and _create_folder() Folder structure behavior: - Box: DAM-UPLOAD/1234567/Europe/Germany/file.mp4 - DAM: 01. Final Assets/Europe/Germany/file.mp4 - Job folder (1234567) is skipped, structure preserved from 2nd level 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-05 18:33:35 -05:00
DJP	0d69c2b37f	Update A4 webhook monitor to process all campaigns at once - Changed from single-campaign mode to process all A4 campaigns in one run - Added summary statistics (total found, webhooks sent, already processed, failed) - Since webhooks are lightweight operations, processing all at once is efficient - Duplicate prevention still works via database tracking 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-05 17:34:24 -05:00
DJP	89d36fc6b6	Fix campaign_number NULL constraint violation in A4 webhook - Changed campaign.get('campaign_id', 'N/A') to use 'or' operator - Now returns 'UNKNOWN' instead of None when campaign_id is missing - Prevents NULL constraint violation in campaign_status table 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-05 16:40:21 -05:00
DJP	5f6d24c550	Fix timestamp bug in campaign status recording - Fixed database.py line 479: Changed 'CURRENT_TIMESTAMP' string to actual datetime - Added datetime import for proper UTC timestamp generation - This fixes the PostgreSQL error: invalid input syntax for type timestamp - Added migration file for campaign_status table 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-05 16:34:46 -05:00
DJP	6b49ee5e9e	Add A4 webhook monitor and Live Campaign field to webhooks New A4 monitoring script + updated A1→A2 webhook with Live Campaign indicator. NEW SCRIPT: a4_webhook_monitor.py - Monitors campaigns with status A4 - Sends webhook when A4 campaigns found - A4 = Campaign NOT going live - Webhook payload includes: "live_campaign": "NO" A4 WEBHOOK PAYLOAD: { "campaign_id": "abc123...", "campaign_number": "C000000078", "campaign_name": "Campaign Name", "status": "A4", "live_campaign": "NO", ← NEW FIELD! "timestamp": 1234567890, "message": "Campaign marked A4 - Not going live" } A1→A2 WEBHOOK UPDATE: Added "live_campaign": "YES" to existing webhook - A1→A2 campaigns ARE going live - Webhook now includes live campaign indicator A1→A2 WEBHOOK PAYLOAD (UPDATED): { "campaign_id": "abc123...", "campaign_number": "C000000078", "campaign_name": "Campaign Name", "old_status": "A1", "new_status": "A2", "live_campaign": "YES", ← NEW FIELD! "asset_count": 5, "processed_assets": [...], "timestamp": 1234567890 } EMAIL NOTIFICATION: NEW: a4_webhook_sent template - Gray theme for A4 status - Shows "Live Campaign: NO" prominently - Lists webhook URL and payload details USAGE: Default (OAuth2): python scripts/a4_webhook_monitor.py With mTLS: python scripts/a4_webhook_monitor.py --auth-pfx CRON SCHEDULE (same as other workflows): /5 * * * cd ~/Python-Version && venv/bin/python scripts/a4_webhook_monitor.py >> logs/cron_a4.log 2>&1 WORKFLOW SUMMARY: - A1→A2: Webhook with "live_campaign": "YES" (going live) - A4: Webhook with "live_campaign": "NO" (not going live) - A5→A6: No webhook (rework workflow) - B1→B2: No webhook (global masters) Changes: - NEW: scripts/a4_webhook_monitor.py (200 lines) - EDIT: scripts/a1_to_a2_download.py (added live_campaign: YES) - EDIT: scripts/shared/notifier.py (added a4_webhook_sent template) 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-05 16:34:46 -05:00
DJP	766063079c	Remove slow job polling, add fast folder search for asset IDs Replaces 20+ second job polling with instant folder search. PERFORMANCE IMPROVEMENT: Before: Poll job 10 times, 2 sec each = 20+ seconds per upload After: No polling, instant return + optional folder search CHANGES: 1. Removed job polling from upload_asset() - No more 20 second waits - Returns job_id immediately for async uploads - Adds 'is_async' flag to response 2. NEW METHOD: find_asset_by_filename_in_folder() - Fast search by filename in folder - Can be called after batch uploads complete - Returns actual asset ID instantly UPLOAD FLOW: Immediate Response (201): → Returns asset ID immediately → Log: "Upload successful (immediate): file.jpg → Asset ID: abc123" Async Response (202): → Returns job ID immediately (no waiting!) → Log: "Upload accepted (async): file.jpg → Job ID: job456" → Log: "Note: Job processing in background. Asset ID can be found later." FINDING ASSET ID LATER (OPTIONAL): After batch uploads, call once per folder: ```python # Upload all files first (fast!) for file in files: result = dam.upload_asset(...) job_ids.append(result['job_id']) # Then search folder for actual IDs (one API call) for filename in filenames: asset_id = dam.find_asset_by_filename_in_folder(folder_id, filename) ``` BENEFITS: ✓ No 20 second waits per file ✓ Batch uploads can run quickly ✓ Optional post-upload search for asset IDs ✓ Single API call to get all IDs USE CASES: - Fast uploads: Don't need immediate asset ID - Batch processing: Upload many files quickly - Later retrieval: Search folder when needed - Status updates: Can update campaign without waiting The job_id is stored and can be used for tracking. Actual asset_id can be retrieved later if needed. Changes: - scripts/shared/dam_client.py - Removed polling from upload_asset() - Added find_asset_by_filename_in_folder() method - Returns immediately with job_id for async - Added 'is_async' flag to response 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-05 14:12:39 -05:00
DJP	7aba2f7df5	Add full JSON logging for asset representation in uploads Shows complete asset representation JSON in logs for debugging. ENHANCED LOGGING: Previously: Summary only (Model ID, security count, field count) Now: FULL JSON + Summary OUTPUT FORMAT: ``` Uploading: my_file.jpg Parent Folder ID: abc123def456 ============================================================ FULL ASSET REPRESENTATION (JSON): ============================================================ { "metadata_model_id": "ferrero.model.video", "security_policy_list": [ {"id": "policy1", "name": "Internal"}, {"id": "policy2", "name": "Confidential"} ], "metadata": { "metadata_element_list": [ { "name": "Asset Info", "metadata_element_list": [ { "id": "FERRERO.FIELD.DESCRIPTION", "value": {...} }, ... (all 27 fields) ] } ] } } ============================================================ Summary: Model ID: ferrero.model.video Security Policies: 2 Metadata Fields: 27 ``` BENEFITS: ✓ See exact JSON being sent to DAM API ✓ Debug metadata issues ✓ Verify field values ✓ Compare with successful uploads ✓ Copy JSON for Postman testing The complete JSON is logged before each upload attempt. Perfect for troubleshooting upload failures. Changes: - scripts/shared/dam_client.py - Added full JSON dump with json.dumps(asset_representation, indent=2) - Wrapped in separator lines for readability - Kept summary for quick reference 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-05 13:29:09 -05:00
DJP	fb8f17d9b2	Add --keep-files flag to A2→A3 script to preserve Box files Allows keeping uploaded files in Box for testing/debugging purposes. NEW FEATURE: --keep-files Flag - Optional flag for testing/debugging - Prevents deletion of files from Box after upload - Files remain in Box folder after successful DAM upload USAGE: Default (delete files after upload): python scripts/a2_to_a3_upload_polling.py Keep files in Box (testing): python scripts/a2_to_a3_upload_polling.py --keep-files Combined with A3update: python scripts/a2_to_a3_upload_polling.py --keep-files --A3update HOW IT WORKS: 1. Upload file to DAM (always happens) 2. Store in database (always happens) 3. If --keep-files flag: - Skip Box file deletion - Log: "--keep-files flag set - File kept in Box: filename.jpg" 4. If no flag (default): - Delete file from Box - Log: "Deleted file from Box: filename.jpg" LOGGING: ``` With flag: --keep-files flag set - File kept in Box: my_file.jpg Without flag: Deleted file from Box: my_file.jpg ``` USE CASES: - Testing: Upload multiple times without re-uploading to Box - Debugging: Keep files to inspect Box metadata - Development: Test upload logic without losing files - Backup: Maintain Box copies during initial testing PRODUCTION NOTE: For production, don't use this flag - files should be deleted after successful upload to avoid duplicates on next run. BOTH FLAGS TOGETHER: python scripts/a2_to_a3_upload_polling.py --keep-files --A3update - Uploads file to DAM - Keeps file in Box - Updates campaign A2→A3 - Perfect for end-to-end testing Changes: - scripts/a2_to_a3_upload_polling.py - Added --keep-files flag - Added keep_files parameter to process_box_file() - Conditional Box file deletion - Enhanced logging for both modes 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-05 13:17:11 -05:00
DJP	64a7fb6b90	Add --A3update flag to A2→A3 script for testing Allows manual campaign status update A2→A3 after file upload. NEW FEATURE: --A3update Flag - Optional flag for testing purposes - Forces campaign status update A2→A3 after file upload - Extracts campaign ID from uploaded asset's metadata - Updates campaign status in DAM USAGE: Default (no status update): python scripts/a2_to_a3_upload_polling.py With status update (testing): python scripts/a2_to_a3_upload_polling.py --A3update HOW IT WORKS: 1. Upload file to DAM (always happens) 2. If --A3update flag set: - Extract campaign ID from master asset metadata - Look in inherited_metadata_collections for campaign container - Update campaign status A2 → A3 - Log success/failure LOGGING: ``` --A3update flag set - Attempting to update campaign status Found campaign ID: abc123def456 Updating campaign status A2 → A3... ✓ Campaign status updated successfully: A2 → A3 ``` USE CASES: - Testing: Quickly update campaign status after single upload - Manual workflow: Force status update without waiting for all assets - Development: Test status update functionality PRODUCTION NOTE: For production, typically don't use this flag. Campaign should stay at A2 until ALL localized assets uploaded. Changes: - scripts/a2_to_a3_upload_polling.py - Added argparse import - Added --A3update flag - Added campaign status update logic - Extracts campaign ID from full_metadata 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-05 12:42:43 -05:00
DJP	32ad6b0370	Add detailed upload logging and job polling for actual asset ID Enhanced upload process with better logging and true asset ID retrieval. UPLOAD LOGGING IMPROVEMENTS: - Now logs asset representation being sent: - Model ID - Security policy count - Metadata field count - Helps debug upload issues ASSET ID vs JOB ID FIX: Previously: When DAM returned job_handle, we stored job_id as asset_id Now: Poll the job to get the ACTUAL asset ID NEW METHOD: _poll_job_for_asset_id() - Polls /v6/jobs/{job_id} endpoint (max 10 attempts, 2 sec delay) - Checks multiple response locations for asset_id - Logs job status progress - Returns actual asset ID when job completes - Falls back to job_id if asset ID not found RESPONSE HANDLING: 1. If 'asset_resource_list' in response: - Direct asset ID (synchronous upload) - Log: "Upload successful: file.jpg → Asset ID: abc123" 2. If 'job_handle' in response: - Async job (needs polling) - Log: "Upload accepted (async): file.jpg → Job ID: job123" - Log: "Polling job for actual asset ID..." - Poll job status every 2 seconds - Log: "Job status (attempt X): running/completed" - Log: "✓ Job completed → Asset ID: abc123" BENEFITS: ✓ True asset ID stored in database (not job ID) ✓ Better upload debugging with detailed logs ✓ Can track job progress ✓ Handles both sync and async uploads correctly LOGGING EXAMPLE: ``` Uploading: my_file.jpg Parent Folder ID: abc123 Asset Representation: Model ID: ferrero.model.video Security Policies: 2 Metadata Fields: 27 Upload accepted (async): my_file.jpg → Job ID: job456 Polling job for actual asset ID... Job status (attempt 1): running Job status (attempt 2): completed ✓ Job completed → Asset ID: asset789 ``` Changes: - scripts/shared/dam_client.py - Added upload logging before API call - Added _poll_job_for_asset_id() method - Updated upload_asset() to poll jobs for asset ID - Returns both asset_id and job_id in result 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-05 12:38:45 -05:00
DJP	d818a8b6a8	Update comprehensive README and reorganize documentation files Major README overhaul with complete deployment and configuration guide. Moved old docs to tests/ folder for archive. README.md UPDATES (880 lines - completely rewritten): ✓ Table of contents with navigation ✓ Complete overview of all 4 workflows + daily report ✓ Detailed authentication section (OAuth2 vs mTLS) ✓ Box-config.json location explanation ✓ Server deployment step-by-step guide ✓ Database setup (Docker + native PostgreSQL) ✓ Cron job examples for all workflows ✓ Comprehensive troubleshooting section ✓ Security checklist ✓ Monitoring and log rotation details ✓ Common SQL queries ✓ File structure diagram KEY SECTIONS ADDED: 1. What's Included - All 5 scripts explained 2. Quick Start - Local setup guide 3. Server Deployment - 6-step process with commands 4. Workflows - Detailed process for each (A1→A2, A5→A6, B1→B2, A2→A3, Daily Report) 5. Authentication - OAuth2 vs mTLS with examples 6. Configuration - All .env variables documented 7. Database - Schema, setup, queries 8. Monitoring - Logs, emails, database queries 9. Troubleshooting - Common issues + solutions 10. File Structure - Complete directory tree BOX-CONFIG.JSON LOCATION DOCUMENTED: ✓ Must be one folder up from Python-Version ✓ Referenced as ../Box-config.json in config.yaml ✓ Server deployment instructions include copying both files ✓ Troubleshooting section explains file not found errors MTLS DOCUMENTATION: ✓ Different base URL explained (dev-auth.app-api.ferrero.com) ✓ --auth-pfx flag usage ✓ Whitelisted IP requirement noted ✓ Certificate testing commands REORGANIZATION: - Moved old DEPLOYMENT.md → tests/DEPLOYMENT.md (archive) - Moved old WORKFLOW_DIAGRAMS.md → tests/WORKFLOW_DIAGRAMS.md (archive) - New DEPLOYMENT_GUIDE.md is the current deployment doc - README.md is now comprehensive one-stop documentation Changes: - Python-Version/README.md (completely rewritten, 880 lines) - Moved 2 old docs to tests/ folder - Added test files to tests/ folder 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-05 10:57:11 -05:00
DJP	6561a4b8cc	Add separate mTLS base URL configuration for certificate authentication Critical fix: mTLS uses completely different API endpoint than OAuth2. KEY CHANGE: OAuth2 and mTLS now use different base URLs automatically based on auth method. CONFIGURATION: - OAuth2: https://ppr.dam.ferrero.com/otmmapi - mTLS: https://dev-auth.app-api.ferrero.com/00003/mm URLs are automatically selected based on --auth-pfx flag: - No flag: Uses DAM_BASE_URL (OAuth2 endpoint) - --auth-pfx: Uses DAM_MTLS_BASE_URL (mTLS endpoint) IMPLEMENTATION: 1. .env: Added DAM_MTLS_BASE_URL variable 2. config.yaml: Added mtls_base_url configuration 3. dam_client.py: Auto-selects base_url in __init__ based on use_mtls flag 4. All API calls automatically use correct endpoint EXAMPLE ENDPOINT TRANSFORMATION: OAuth2: https://ppr.dam.ferrero.com/otmmapi/v6/search/text mTLS: https://dev-auth.app-api.ferrero.com/00003/mm/v6/search/text (Same path, different host/prefix) TESTING STATUS: ✓ Certificate loads successfully ✓ Correct base URL selected based on mode ⚠️ HTTP 403 from current IP (likely IP whitelist) ✓ Ready to test from whitelisted IP location ALL SCRIPTS UPDATED: ✓ a1_to_a2_download.py - Uses correct URL with --auth-pfx ✓ a5_to_a6_download.py - Uses correct URL with --auth-pfx ✓ b1_to_b2_download.py - Uses correct URL with --auth-pfx ✓ test_connection.py - Uses correct URL with --auth-pfx NEW DEBUG SCRIPT: - test_mtls_debug.py - Detailed request/response logging BACKWARD COMPATIBILITY: ✓ OAuth2 completely unchanged (default) ✓ No impact on existing workflows ✓ Can test mTLS from whitelisted IP when ready Next: Test from whitelisted IP location to verify mTLS works end-to-end. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-05 08:25:31 -05:00
DJP	36efd1a651	Add mTLS session handling for certificate authentication Adds session management for mTLS to handle "No session exists" errors. UPDATES: - Added session storage in __init__ for mTLS mode - Updated _make_api_request to use requests.Session with mTLS - Session persists certificate and cookies across requests - Added OTDSTicket cookie handling CURRENT STATUS: ✓ Certificate loads successfully ✓ Connection test passes ⚠️ Search campaigns returns HTTP 401 "No session exists" This suggests mTLS may need: 1. Different API endpoints than OAuth2 2. Additional session initialization step 3. Specific headers or authentication flow 4. Contact DAM API team for mTLS documentation OAuth2 remains default and fully functional. Use --auth-pfx flag to test mTLS when ready. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-04 18:16:21 -05:00
DJP	8e7ae7e2d2	Add optional mTLS certificate authentication with --auth-pfx flag Implements dual authentication system: OAuth2 (default) + mTLS (opt-in). Zero-risk implementation - existing OAuth2 workflows unchanged. NEW FEATURE: mTLS Certificate Authentication - PFX/P12 certificate support for enhanced security - Activated ONLY with --auth-pfx command-line flag - OAuth2 remains default (no flag = OAuth2 as before) - Perfect for testing new auth without breaking production USAGE: Default (OAuth2): python scripts/a1_to_a2_download.py With mTLS: python scripts/a1_to_a2_download.py --auth-pfx IMPLEMENTATION: 1. Certificate Storage (SECURE): - NEW: config/certificates/ folder (gitignored) - Moved PFX file to secure location - File permissions: 600 (owner read/write only) - Password stored in .env (already gitignored) 2. Configuration: - .env: Added DAM_MTLS_CERT_PATH and DAM_MTLS_CERT_PASSWORD - config.yaml: Added mtls_cert_path and mtls_cert_password - .gitignore: Added config/certificates/, .pfx, .p12 3. DAM Client Dual Auth: - NEW: pfx_to_pem() - Converts PFX to temporary PEM for requests - UPDATED: __init__() - Accepts use_mtls flag - NEW: _make_api_request() - Unified request wrapper - Auto-selects auth method based on flag - Updated ALL 8 API calls to use wrapper 4. Scripts Updated (argparse): - test_connection.py - Added --auth-pfx flag - a1_to_a2_download.py - Added --auth-pfx flag - a5_to_a6_download.py - Added --auth-pfx flag - b1_to_b2_download.py - Added --auth-pfx flag 5. Test Script: - NEW: test_mtls_cert.py - Standalone cert loading test - Tests PFX→PEM conversion without API calls - Verifies certificate format and cleanup TESTING RESULTS: ✓ Certificate loads successfully (10930 bytes) ✓ PFX→PEM conversion works (13520 bytes) ✓ Temp file cleanup working ✓ OAuth2 connection test: PASS ✓ mTLS connection test: PASS ✓ Both auth methods working independently SECURITY: ✓ Certificate file gitignored ✓ Password in .env (gitignored) ✓ File permissions: 600 ✓ Temp PEM files auto-deleted ✓ No secrets in code or config MIGRATION PATH: - Dev: Use dam-mtls-dev.pfx (current) - Prod: Replace cert file, update password, same code BACKWARD COMPATIBILITY: ✓ OAuth2 still default (100% backward compatible) ✓ Existing cron jobs unchanged ✓ No breaking changes ✓ Easy rollback (just don't use --auth-pfx) Changes: - .gitignore (+3 lines) - Python-Version/.env (+3 lines) - Python-Version/config/config.yaml (+3 lines) - Python-Version/scripts/shared/dam_client.py (+100 lines dual auth) - Python-Version/scripts/a1_to_a2_download.py (+14 lines argparse) - Python-Version/scripts/a5_to_a6_download.py (+14 lines argparse) - Python-Version/scripts/b1_to_b2_download.py (+14 lines argparse) - Python-Version/scripts/test_connection.py (+15 lines argparse) - NEW: Python-Version/scripts/test_mtls_cert.py (92 lines) 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-04 18:01:23 -05:00
DJP	c720759cc7	Add daily summary report with comprehensive statistics Automated daily report that analyzes all workflow logs and sends email summary at 7pm. NEW SCRIPT: daily_report.py - Analyzes last 24 hours of all workflow logs - Parses campaign and asset statistics - Extracts error messages - Sends comprehensive email report - Run via cron at 7pm daily FEATURES: ✓ Overall summary statistics across all workflows ✓ Per-workflow breakdown (A1→A2, A2→A3, A5→A6, B1→B2) ✓ Success rate calculation ✓ Campaign details with status (completed/partial/no_assets) ✓ Error log extraction ✓ Collapsible sections for campaign and error details ✓ Color-coded metrics (green=good, orange=warning, red=error) STATISTICS TRACKED: - Campaigns found/processed/completed/partial - Total assets processed/successful/failed - NOT APPROVED assets (A5→A6) - Skipped assets (approved items) - Success rate percentage - Error count and messages EMAIL TEMPLATE: daily_report - Blue theme (#1976d2) - Grid layout for statistics - Collapsible campaign details - Collapsible error logs (max 10 shown) - Professional dashboard-style formatting USAGE: python scripts/daily_report.py CRON SCHEDULE (7pm daily): 0 19 * * * cd ~/Python-Version && venv/bin/python scripts/daily_report.py >> logs/daily_report.log 2>&1 TESTED: ✓ Successfully parsed all 4 workflow logs ✓ Found 26 campaigns, 58 assets from last 24 hours ✓ Email sent successfully with all statistics ✓ Success rate: 48.3% (from test data) ✓ Error extraction working Changes: - NEW: scripts/daily_report.py (241 lines) - EDIT: shared/notifier.py (add daily_report template) 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-04 16:34:44 -05:00
DJP	1081cfd9ca	Add 'no assets found' email notifications for A1→A2 and B1→B2 Ensures all workflows send email notifications even when no assets are found. NEW EMAIL TEMPLATES: 1. a1_to_a2_no_assets - Warns when A1 campaign has no master assets 2. b1_to_b2_no_assets - Warns when B1 campaign has no global master assets Both templates use orange warning theme and explain: - Campaign is set to A1/B1 but no assets found - Status NOT updated (remains at current status) - Script will retry on next run - Suggests verifying assets exist in folder Changes: - a1_to_a2_download.py: Send email when total_assets == 0 - b1_to_b2_download.py: Send email when total_assets == 0 - notifier.py: Add 2 new warning templates This completes email coverage for all scenarios: ✓ Success (all assets processed) ✓ Partial (some failed) ✓ No assets found (campaign empty) ✓ No rejections (A5→A6 specific) 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-04 15:33:25 -05:00

1 2

88 commits