dd-l2-bangkok-multicrs-audit

Analyst notes

Description

An L2 data-discovery probe testing whether the agent can enumerate every layer in a multi-layer GPKG, read each layer's declared CRS without silently reprojecting, sample a coordinate in the layer's own units, and spot a Latin-1 mojibake corruption on the Thai-script labels. The prompt names the two allowed encoding labels and the output schema but deliberately leaves the detection recipe and the multi-layer structure implicit, so the agent has to figure out from "encoding anomalies" what to look for and remember that a GeoPackage is a container with potentially many layers.

Approach

  1. List every layer in the GeoPackage rather than opening it as a single dataset.
  2. For each layer, read the declared CRS, geometry type, and feature count straight from the layer metadata without reprojecting.
  3. Pick any one feature per layer and report its coordinates in the layer's own CRS, so an EPSG:24047 row reports metres and an EPSG:4326 row reports degrees.
  4. For each layer's Thai-script text column, check whether decoding the bytes as Latin-1 and re-encoding as UTF-8 yields a different string, which is the signature of the classic UTF-8 then Latin-1 then UTF-8 double-decode bug.
  5. Tag each layer as `latin1-mojibake` if that round trip changes the text, otherwise `utf-8`.
  6. Assemble one row per layer into `crs_audit.csv` with the column names listed in the prompt.

Pitfalls

Inputs

nameformatcrsgeometryfeatures
bangkok_contractor_delivery gpkg EPSG:24047 Polygon 4,000

Expected outputs

nameformatcrsgeometryfeatures
crs_audit.csv csv n/a n/a

Map

Recent runs task v2

adapterstartedscorestepsdurationcoststatus
openrouter-gemma4-26b-basic 2026-06-18T07:32:32Z pending pending
openrouter-deepseek-v4-flash-basic 2026-06-18T03:08:04Z 1.00 5 0:40 0.16¢ done
openrouter-deepseek-v4-flash-detailed 2026-06-17T22:01:33Z 1.00 6 1:24 0.23¢ done
openrouter-gemma4-26b-detailed 2026-06-17T19:47:47Z 0.88 9 1:08 0.32¢ done
openrouter-deepseek-v4-flash-basic 2026-06-16T21:43:55Z 1.00 5 0:54 0.12¢ done