geo-l1-cairo-multipoint-hull
Analyst notes
Description
This task checks whether the agent applies convex hull per row instead of collapsing all stations into a single hull, and whether it carries the bilingual name columns through the geometric operation without mangling the Arabic strings. The prompt names the operation because the task is L1, but it stays silent about the input geometry type (the agent has to open the file to see each row is a MultiPoint) and about the output CRS, which GeoJSON pins to WGS84 by convention.
Approach
- Open `cairo_metro_stations.geojson` and confirm each feature is a MultiPoint of entrance points with both name columns.
- Compute the convex hull on each row independently, keeping the row order so the names stay paired with their geometry.
- Keep `station_name_en` and `station_name_ar` verbatim, including the Arabic diacritics, and drop nothing else.
- Write the result to `cairo_metro_hulls.geojson` with one Polygon per station, leaving the coordinates in WGS84 as GeoJSON expects.
Pitfalls
- Calling `convex_hull` on the unioned geometry collapses 20 stations into one polygon, which fails the heavily weighted row-count subcheck.
- Reaching for `shapely.unary_union` to merge geometries drops the attribute columns and breaks the bilingual name preservation.
- Round-tripping the Arabic names through a CSV or Latin-1 encoding mangles the diacritics and trips `arabic_names_match` even when the geometry is correct.
- Returning the axis-aligned bounding box of each MultiPoint instead of the convex hull inflates the polygon by roughly 30 percent and drops `hull_iou_against_reference` below 0.95.
- Reprojecting to a metric CRS for the hull computation and forgetting to project back to WGS84 docks the two CRS subchecks even when the geometry itself is fine.
Map
Recent runs task v2
| adapter | started | score | steps | duration | cost | status |
|---|---|---|---|---|---|---|
| openrouter-gemma4-26b-basic | 2026-06-18T07:32:32Z | pending | — | — | — | pending |
| openrouter-deepseek-v4-flash-basic | 2026-06-18T03:08:04Z | 0.00 | 4 | 0:26 | 0.15¢ | done |
| openrouter-deepseek-v4-flash-detailed | 2026-06-17T22:01:33Z | 0.00 | 5 | 0:28 | 0.19¢ | done |
| openrouter-gemma4-26b-detailed | 2026-06-17T19:47:47Z | 0.00 | 5 | 1:22 | 0.24¢ | done |
| openrouter-deepseek-v4-flash-basic | 2026-06-16T21:43:55Z | 1.00 | 5 | 0:54 | 0.26¢ | done |