Test 2: Verifying action items count for Michal... Found 5 ground truth file(s) ❌ 1.json: Insufficient action items Expected at least 1, got 0 ✓ 2.json: Action items count OK (ground truth: 0, output: 3) ✓ 3.json: Action items count OK (ground truth: 0, output: 0) ✓ 4.json: Action items count OK (ground truth: 1, output: 1) ❌ 5.json: Failed to load JSON: Extra data: line 1 column 133 (char 132) ============================================================ Results: 3/5 files pass ============================================================ FAILED: Some files have insufficient action items