dd-l1-london-parks-count
Analyst notes
Description
An L1 data-discovery probe testing whether the agent can read a bundled FlatGeobuf in a projected national CRS (EPSG:27700), filter by planar area in the right units, and report a bounding box back in WGS84. The hidden gotcha is that area must be computed in the projected CRS and converted from square metres to hectares, while the bbox must be reprojected the other way to lon/lat. The instruction deliberately omits the input CRS and the input format, so the agent has to discover both from the file itself; the bbox shape and component order are pinned in the prompt since version 2.
Approach
- Open london_parks.fgb and inspect its CRS, columns, and geometry type before doing anything else.
- Filter the features by area in the projected CRS, taking care that one hectare means ten thousand square metres.
- Sum the filtered areas and convert to hectares for total_area_ha.
- Reproject the filtered subset (or its bounding box corners) to EPSG:4326 and read the bbox in lon/lat as [xmin, ymin, xmax, ymax].
- Write parks_summary.json with count, total_area_ha, and bbox_wgs84 as a four-number list.
Pitfalls
- Computing area on the geometries while still in EPSG:4326 gives degree-squared values, so the one-hectare filter rejects almost everything and the totals come out astronomically wrong.
- Reporting total_area_ha as raw square metres (skipping the divide-by-ten-thousand step) puts the number off by a factor of ten thousand and fails the area tolerance.
- Returning the bounding box in EPSG:27700 metres instead of WGS84 lon/lat puts the values around 5x10^5 and 1.8x10^5, far outside any valid latitude or longitude range.
- Swapping lon and lat (writing [ymin, xmin, ymax, xmax]) keeps the numbers inside WGS84 range but flips every bbox componentwise check.
- Reprojecting only the four EPSG:27700 envelope corners to WGS84 instead of reprojecting the filtered features dilates the box by roughly 50 to 200 metres on every side, so all four componentwise checks fail even though count and area are perfect.
- Shaping bbox_wgs84 as a dict instead of the requested four-number array is only sometimes recoverable by the grader's coercion, and an unrecognised key spelling forfeits all four componentwise checks even when the values themselves are correct.
- Computing the bbox over the full unfiltered set rather than the >= 1 ha subset shifts the bbox edges enough to fail several of the componentwise checks.
Map
Recent runs task v2
| adapter | started | score | steps | duration | cost | status |
|---|---|---|---|---|---|---|
| openrouter-gemma4-26b-basic | 2026-06-18T07:32:32Z | pending | — | — | — | pending |
| openrouter-deepseek-v4-flash-basic | 2026-06-18T03:08:04Z | 1.00 | 6 | 0:30 | 0.09¢ | done |
| openrouter-deepseek-v4-flash-detailed | 2026-06-17T22:01:33Z | 1.00 | 5 | 0:40 | 0.09¢ | done |
| openrouter-gemma4-26b-detailed | 2026-06-17T19:47:47Z | 1.00 | 10 | 2:00 | 0.29¢ | done |
| openrouter-deepseek-v4-flash-basic | 2026-06-16T21:43:55Z | 1.00 | 6 | 0:40 | 0.11¢ | done |