Run of 2025-12-17 15:00:16 / task13_meeting_action_items

Task `task13_meeting_action_items`

# Meeting Action Items Extraction for Michal

You are analyzing meeting transcripts to extract action items specifically for a person named **Michal**.

## Input Files

The `input/` directory contains JSON files with meeting transcripts. Each file has the following structure:

- `title` - Meeting title
- `timestamp` - Meeting date in ISO format
- `raw_data.room_name` - Room/channel name
- `raw_data.short_summary` - Brief summary
- `raw_data.long_summary` - Detailed summary
- `raw_data.transcript` - Full conversation transcript

## Your Task

For each JSON file in the `input/` directory, extract **all action items assigned to or involving Michal**.

Check for variations like "Michal", "michal", "Michael", "Michał".

## Output Format

Create corresponding JSON files in the current directory with the same names as the input files (e.g., `1.json`, `2.json`, etc.).

Each output file must follow this exact schema:

```json
{
"action_items": [
{"action_item": "Description of task for Michal", "deadline": null},
{"action_item": "Another task with date", "deadline": "2025-12-20"},
{"action_item": "Task with datetime", "deadline": "2025-12-20T10:00:00"}
]
}
```

## Action Item Fields

- **action_item**: A clear description of what Michal needs to do
- **deadline**: Any mentioned deadline in ISO 8601 format (e.g., "2025-12-20" or "2025-12-20T10:00:00"), or null if not specified

## Important Guidelines

- **Be conservative**. Only create an action item if you're sure it stems from the conversation and is clearly assigned to or involves Michal.
- Include all commitments, tasks, and follow-ups for Michal that are explicitly mentioned in the meeting.
- Deadlines must be in valid ISO 8601 format (date only like "2025-12-20" or full datetime like "2025-12-20T10:00:00")
- If Michal has no action items in a meeting, return an empty action_items array: `{"action_items": []}`

PS: You are currently working in an automated system and cannot ask any questions or have back and forth with a user.

Results

Models Tested

8.3%

Success Rate

3m 12s

Avg Duration

9s - 10m 0s

Duration Range

Details

Score	Model	Duration	Session (KB)	test_0_create_ground_truth.sh	test_1_json_schema.sh	test_2_action_items_count.sh	test_3_empty_cases.sh
100.0%	openrouter/google/gemini-3-pro-preview	5m 59s	322.9	✅	✅	✅	✅
100.0%	openrouter/openai/gpt-5.2	2m 16s	172.3	✅	✅	✅	✅
75.0%	openrouter/openai/gpt-5	4m 20s	384.2	✅	✅	❌	✅
75.0%	openrouter/anthropic/claude-opus-4.5	1m 28s	201.7	✅	✅	❌	✅
75.0%	openrouter/qwen/qwen3-coder	1m 24s	142.0	✅	✅	❌	✅
75.0%	openrouter/x-ai/grok-3-mini	1m 15s	532.7	✅	✅	❌	✅
75.0%	openrouter/google/gemini-2.5-pro	53s	54.7	✅	✅	❌	✅
75.0%	openrouter/openai/gpt-oss-20b	1m 15s	1094.8	✅	✅	❌	✅
75.0%	openrouter/deepseek/deepseek-v3.1-terminus	1m 24s	88.7	✅	✅	❌	✅
75.0%	litellm/GLM-4.5-Air-FP8-dev	43s	73.1	✅	✅	❌	✅
75.0%	openrouter/anthropic/claude-sonnet-4.5	1m 2s	96.1	✅	✅	❌	✅
75.0%	openrouter/x-ai/grok-code-fast-1	46s	73.6	✅	✅	❌	✅
75.0%	openrouter/openai/gpt-4.1-mini	31s	95.2	✅	✅	❌	✅
50.0%	openrouter/google/gemini-2.5-flash-preview-09-2025	37s	53.8	✅	❌	❌	✅
50.0%	openrouter/openai/gpt-5-nano	1m 39s	161.8	✅	✅	❌	❌
50.0%	openrouter/openai/gpt-oss-120b	20s	52.9	✅	❌	❌	✅
50.0%	openrouter/openai/gpt-4o-mini	44s	105.1	✅	✅	❌	❌
50.0%	openrouter/google/gemini-2.5-flash-lite-preview-09-2025	9s	19.3	✅	❌	❌	✅
50.0%	openrouter/deepseek/deepseek-chat-v3-0324	22s	45.6	✅	❌	❌	✅
0.0%	litellm/DeepSeek-V3.2-sandbox	10m 0s	0.0	—	—	—	—
0.0%	openrouter/anthropic/claude-haiku-4.5	10m 0s	0.0	—	—	—	—
0.0%	litellm/GLM-4.6-trtllm-sandbox	10m 0s	0.0	—	—	—	—
0.0%	openrouter/openai/gpt-4.1-nano	10m 0s	0.0	—	—	—	—
0.0%	openrouter/openai/gpt-5-mini	10m 0s	0.0	—	—	—	—

Task task13_meeting_action_items

Results

Details

Task `task13_meeting_action_items`