Test 2: Verifying action items count for Michal... Found 5 ground truth file(s) ❌ 1.json: Insufficient action items Expected at least 1, got 0 ✓ 2.json: Action items count OK (ground truth: 0, output: 0) ✓ 3.json: Action items count OK (ground truth: 0, output: 0) ❌ 4.json: Insufficient action items Expected at least 1, got 0 ✓ 5.json: Action items count OK (ground truth: 0, output: 0) ============================================================ Results: 3/5 files pass ============================================================ FAILED: Some files have insufficient action items