GraphDone
diff --git a/‎tests/e2e/VISUAL_REGRESSION_README.md‎
Lines changed: 216 additions & 0 deletions b/‎tests/e2e/VISUAL_REGRESSION_README.md‎
Lines changed: 216 additions & 0 deletions
diff --git a/‎tools/analyze-test-timing.sh‎
Lines changed: 128 additions & 0 deletions b/‎tools/analyze-test-timing.sh‎
Lines changed: 128 additions & 0 deletions
@@ -0,0 +1,216 @@
+# Visual Regression Testing Suite
+
+## Overview
+
+Comprehensive screenshot collection system for GraphDone UI monitoring and visual regression testing. This suite captures screenshots across 21 different device configurations and 10+ core application screens, generating 250-300 total screenshots per test run.
+
+## Purpose
+
+- **Visual Regression Testing**: Compare UI changes across releases
+- **DevOps Monitoring**: Automated visual change detection in CI/CD pipeline
+- **Cross-Device Compatibility**: Verify UI renders correctly on all devices
+- **UI/UX Documentation**: Maintain visual records of application state
+- **Design Review**: Provide stakeholders with visual references
+
+## Device Coverage
+
+### Mobile Phones (Portrait)
+- iPhone SE (375×667, 2x)
+- iPhone 12/13/14 (390×844, 3x)
+- iPhone 14 Pro Max (430×932, 3x)
+- Samsung Galaxy S21 (360×800, 3x)
+- Google Pixel 7 (412×915, 2.625x)
+
+### Mobile Phones (Landscape)
+- iPhone 14 Landscape (844×390, 3x)
+- Samsung Galaxy Landscape (800×360, 3x)
+
+### Tablets (Portrait)
+- iPad Mini (768×1024, 2x)
+- iPad Air (820×1180, 2x)
+- iPad Pro 11" (834×1194, 2x)
+- iPad Pro 12.9" (1024×1366, 2x)
+- Samsung Galaxy Tab (800×1280, 2x)
+
+### Tablets (Landscape)
+- iPad Pro 11" Landscape (1194×834, 2x)
+- iPad Pro 12.9" Landscape (1366×1024, 2x)
+
+### Desktop
+- HD (1366×768, 1x)
+- Full HD (1920×1080, 1x)
+- QHD (2560×1440, 1x)
+- 4K (3840×2160, 1x)
+
+### Ultrawide
+- QHD Ultrawide (3440×1440, 1x)
+- 4K Ultrawide (5120×2160, 1x)
+
+## Screens Captured
+
+- **Landing Page** (`/`)
+- **Login** (`/login`)
+- **Workspace** (`/workspace`)
+- **Graph View** (`/graph`) - Core visualization
+- **Projects** (`/projects`)
+- **Settings** (`/settings`)
+- **Profile** (`/profile`)
+- **Admin Panel** (`/admin`)
+- **Admin Users** (`/admin/users`)
+- **Admin System** (`/admin/system`)
+
+Plus interactive states:
+- Button hover states (up to 5 buttons)
+- Modal/dialog states (up to 3 modals)
+
+## Running the Suite
+
+### Standalone Execution
+```bash
+npm run test:e2e:visual
+```
+
+### As Part of Full E2E Test Suite
+```bash
+npm run test:e2e
+# Or in VM:
+./tools/test-vm-e2e.sh
+```
+
+### Disable Visual Regression in E2E Suite
+```bash
+RUN_VISUAL_REGRESSION=false ./tools/test-vm-e2e.sh
+```
+
+## Output Structure
+
+```
+test-artifacts/visual-regression/{timestamp}/
+├── iPhone-SE/
+│   ├── landing-page.png
+│   ├── login.png
+│   ├── workspace.png
+│   └── ...
+├── iPad-Pro-11/
+│   ├── landing-page.png
+│   └── ...
+├── Desktop-Full-HD/
+│   ├── landing-page.png
+│   └── ...
+├── SUMMARY.md
+└── ...
+```
+
+Each test run creates a timestamped directory with:
+- Device-specific subdirectories
+- PNG screenshots for each screen
+- `SUMMARY.md` with test metadata and configuration
+
+## Integration with GraphDone-DevOps
+
+The visual regression suite is designed to provide comprehensive data for the GraphDone-DevOps repository to consume and analyze. It does NOT include a complex results viewer - that responsibility belongs to GraphDone-DevOps.
+
+### Expected DevOps Integration:
+
+1. **Automated Comparison**: Use tools like Pixelmatch or Percy for visual diff analysis
+2. **Artifact Storage**: Upload screenshots to S3/artifact storage for historical tracking
+3. **CI/CD Alerts**: Trigger notifications when visual changes exceed thresholds
+4. **Baseline Management**: Store approved screenshots as baselines for comparison
+5. **Reporting Dashboard**: Build viewing and organization tools in GraphDone-DevOps
+
+### Data Format
+
+Screenshots are organized by:
+- **Timestamp**: ISO format (YYYY-MM-DDTHH-mm-ss)
+- **Device**: Descriptive name (e.g., "iPhone-14-Pro-Max", "Desktop-Full-HD")
+- **Screen**: Sanitized route name (e.g., "landing-page", "admin-users")
+
+All filenames are consistent and parseable for automated processing.
+
+## Configuration
+
+### Adjusting Device List
+
+Edit `tests/e2e/visual-regression-suite.spec.ts`:
+
+```typescript
+const DEVICES = [
+  { name: 'Custom-Device', width: 1024, height: 768, deviceScaleFactor: 1 },
+  // ... add more devices
+];
+```
+
+### Adjusting Screens
+
+Edit the `SCREENS` array:
+
+```typescript
+const SCREENS = [
+  { route: '/custom-route', name: 'custom-screen' },
+  // ... add more screens
+];
+```
+
+### Adjusting Timeouts
+
+- **Page Load**: Line 141 - `timeout: 30000` (30 seconds)
+- **Content Wait**: Line 148 - `waitForTimeout(2000)` (2 seconds)
+- **Screenshot Retry**: Line 81 - `maxRetries = 3`
+
+## Performance Considerations
+
+### Test Duration
+- ~5-10 seconds per device configuration
+- Total runtime: ~3-5 minutes for all 21 devices
+
+### Disk Usage
+- ~50-200KB per screenshot (depends on content)
+- ~250-300 screenshots per run
+- Total: ~15-60MB per test run
+
+### Resource Requirements
+- Memory: ~2GB RAM for Playwright + browsers
+- CPU: Moderate (screenshot capture is CPU-intensive)
+- Disk I/O: Moderate (writing many PNG files)
+
+## Best Practices
+
+1. **Run on Stable State**: Execute after UI changes are complete
+2. **Consistent Environment**: Use same browser versions for comparisons
+3. **Network Independence**: Tests should not depend on external services
+4. **Baseline Updates**: Update baselines when intentional UI changes occur
+5. **Artifact Cleanup**: Regularly archive or delete old screenshot sets
+
+## Troubleshooting
+
+### Screenshots Failing
+- Check if application is running (`npm run dev`)
+- Verify routes exist in the application
+- Increase timeouts if content loads slowly
+
+### Missing Browsers
+- Run `npx playwright install --with-deps`
+- Verify in VM: `ls -la ~/.cache/ms-playwright/`
+
+### Incomplete Screenshot Sets
+- Check disk space
+- Review Playwright logs for specific errors
+- Verify network connectivity to localhost:3127
+
+## Future Enhancements
+
+- [ ] Add visual diff comparison tool integration
+- [ ] Implement baseline screenshot management
+- [ ] Add screenshot annotations (highlights, labels)
+- [ ] Support for authenticated routes
+- [ ] Dark mode screenshot variants
+- [ ] Accessibility contrast analysis
+- [ ] Mobile gesture simulation capture
+- [ ] Video recording for interactions
+
+## Related Documentation
+
+- E2E Test Suite: `tests/e2e/`
+- Test VM Setup: `tools/test-vm-e2e.sh`
+- Playwright Config: `playwright.config.ts`
+- DevOps Integration: (See GraphDone-DevOps repository)
@@ -0,0 +1,128 @@
+#!/bin/bash
+
+# GraphDone Test Timing Analyzer
+# Analyzes test report logs and generates detailed timing breakdowns
+
+set -e
+
+REPORT_FILE="$1"
+
+if [ -z "$REPORT_FILE" ] || [ ! -f "$REPORT_FILE" ]; then
+    echo "Usage: $0 <test-report-file>"
+    exit 1
+fi
+
+echo "# Test Timing Analysis"
+echo ""
+echo "Analyzing: $REPORT_FILE"
+echo ""
+
+# Extract timestamps from log files
+REPORT_DIR=$(dirname "$REPORT_FILE")
+
+declare -A STEP_TIMES
+
+# Parse build log for duration
+if [ -f "$REPORT_DIR/build.log" ]; then
+    BUILD_TIME=$(grep -oP 'Done in \K[\d.]+s' "$REPORT_DIR/build.log" | tail -1 || echo "N/A")
+    echo "Build Duration: $BUILD_TIME"
+fi
+
+# Parse unit test log for duration
+if [ -f "$REPORT_DIR/unit-tests.log" ]; then
+    TEST_TIME=$(grep -oP 'Test Files.*\(\K[\d.]+s' "$REPORT_DIR/unit-tests.log" | tail -1 || echo "N/A")
+    echo "Unit Tests Duration: $TEST_TIME"
+fi
+
+# Parse E2E test log for duration
+if [ -f "$REPORT_DIR/e2e-tests.log" ]; then
+    E2E_TIME=$(grep -oP '\d+ passed.*\(\K[\d.]+s' "$REPORT_DIR/e2e-tests.log" | tail -1 || echo "N/A")
+    echo "E2E Tests Duration: $E2E_TIME"
+fi
+
+# Parse visual regression log for duration
+if [ -f "$REPORT_DIR/visual-regression.log" ]; then
+    VR_TIME=$(grep -oP '\d+ passed.*\(\K[\d.]+s' "$REPORT_DIR/visual-regression.log" | tail -1 || echo "N/A")
+    echo "Visual Regression Duration: $VR_TIME"
+fi
+
+echo ""
+echo "## Detailed Breakdown"
+echo ""
+
+# Analyze file modification times to estimate step durations
+cd "$REPORT_DIR"
+
+if [ -f "vm-launch.log" ]; then
+    VM_LAUNCH_START=$(stat -c %Y vm-launch.log 2>/dev/null || echo "0")
+fi
+
+if [ -f "cloud-init.log" ]; then
+    CLOUD_INIT_START=$(stat -c %Y cloud-init.log 2>/dev/null || echo "0")
+fi
+
+if [ -f "lint.log" ]; then
+    LINT_START=$(stat -c %Y lint.log 2>/dev/null || echo "0")
+fi
+
+if [ -f "typecheck.log" ]; then
+    TYPECHECK_START=$(stat -c %Y typecheck.log 2>/dev/null || echo "0")
+fi
+
+if [ -f "build.log" ]; then
+    BUILD_START=$(stat -c %Y build.log 2>/dev/null || echo "0")
+fi
+
+if [ -f "unit-tests.log" ]; then
+    UNIT_START=$(stat -c %Y unit-tests.log 2>/dev/null || echo "0")
+fi
+
+if [ -f "e2e-tests.log" ]; then
+    E2E_START=$(stat -c %Y e2e-tests.log 2>/dev/null || echo "0")
+fi
+
+# Calculate durations from file timestamps
+if [ "$VM_LAUNCH_START" != "0" ] && [ "$CLOUD_INIT_START" != "0" ]; then
+    LAUNCH_DURATION=$((CLOUD_INIT_START - VM_LAUNCH_START))
+    echo "VM Launch: ${LAUNCH_DURATION}s"
+fi
+
+if [ "$LINT_START" != "0" ] && [ "$TYPECHECK_START" != "0" ]; then
+    LINT_DURATION=$((TYPECHECK_START - LINT_START))
+    echo "Linting: ${LINT_DURATION}s"
+fi
+
+if [ "$TYPECHECK_START" != "0" ] && [ "$BUILD_START" != "0" ]; then
+    TYPECHECK_DURATION=$((BUILD_START - TYPECHECK_START))
+    echo "Type Checking: ${TYPECHECK_DURATION}s"
+fi
+
+if [ "$BUILD_START" != "0" ] && [ "$UNIT_START" != "0" ]; then
+    BUILD_DURATION=$((UNIT_START - BUILD_START))
+    echo "Build: ${BUILD_DURATION}s"
+fi
+
+if [ "$UNIT_START" != "0" ] && [ "$E2E_START" != "0" ]; then
+    UNIT_DURATION=$((E2E_START - UNIT_START))
+    echo "Unit Tests: ${UNIT_DURATION}s"
+fi
+
+echo ""
+echo "## Recommendations"
+echo ""
+
+# Add intelligent recommendations based on timing
+if [ "$BUILD_DURATION" -gt 180 ] 2>/dev/null; then
+    echo "- Build took >${BUILD_DURATION}s: Consider build caching or Turbo optimization"
+fi
+
+if [ "$UNIT_DURATION" -gt 300 ] 2>/dev/null; then
+    echo "- Unit tests took >${UNIT_DURATION}s: Consider test parallelization or filtering"
+fi
+
+if [ -n "$E2E_TIME" ] && [ "$E2E_TIME" != "N/A" ]; then
+    E2E_SECONDS=$(echo "$E2E_TIME" | sed 's/s//')
+    if [ "$(echo "$E2E_SECONDS > 900" | bc)" -eq 1 ] 2>/dev/null; then
+        echo "- E2E tests took >${E2E_SECONDS}s (15+ min): Consider parallelization or reducing scope"
+    fi
+fi