Summary Reports¶
This guide explains how to generate summary reports that analyze performance across one or more benchmarks.
Basic Usage¶
Single Benchmark Summary¶
To analyze one benchmark:
masbench summary benchmark-name
This generates a report showing:
Total levels solved
Total time taken
Per-level breakdown of performance
Multiple Benchmark Summary¶
To analyze multiple benchmarks together:
masbench summary benchmark1 benchmark2 benchmark3
For example:
masbench summary astar-v1 bfs-v1 dijkstra-v1
This creates a report showing:
Which benchmark solved the most levels
Which benchmark finished fastest (timeout applies to unsolved levels)
For each level: which benchmark had the fastest time and fewest actions
Generated Output¶
When you run a summary, masbench creates an HTML report:
benchmark-results/
└── summaries/
├── benchmark1_summary.html (single benchmark)
└── multi_benchmark_summary.html (multiple benchmarks)
Opening the Report¶
Open the HTML file in your browser:
# Linux
xdg-open benchmark-results/summaries/benchmark1_summary.html
# macOS
open benchmark-results/summaries/benchmark1_summary.html
# Windows
start benchmark-results/summaries/benchmark1_summary.html
Report Features¶
Overall Performance¶
The report header shows:
Levels Solved: Total count for each benchmark
Total Time: Cumulative time (unsolved levels count as timeout)
Winner: Benchmark with best overall performance
Level-by-Level Analysis¶
Each level shows:
Fastest Time: Which benchmark solved it quickest
Fewest Actions: Which benchmark used the shortest solution
Example Workflow¶
Run multiple benchmarks:
masbench run astar-heuristic-1 masbench run astar-heuristic-2 masbench run astar-heuristic-3
Generate a summary:
masbench summary astar-heuristic-1 astar-heuristic-2 astar-heuristic-3
Open the HTML report
Analyze results: - Check which heuristic solved the most levels - Identify levels where each heuristic excels - Look for patterns in performance differences - Use the data to choose the best approach
For detailed comparison between two specific benchmarks:
masbench compare astar-heuristic-1 astar-heuristic-2
Use Cases¶
Heuristic Tuning¶
Compare variations of the same algorithm:
masbench summary baseline heuristic-v1 heuristic-v2 heuristic-v3
Find which parameter values work best.
Single Benchmark Review¶
Analyze one benchmark’s performance:
masbench summary my-algorithm
Review which levels were solved, which failed, and where time was spent.
See also
For comparing two benchmarks, see the Comparison guide
For running benchmarks, see the Running Benchmarks guide
For initial setup, see the Getting Started guide