dd-l2-tokyo-overture-schools

Analyst notes

Description

Tests whether the agent can map the persona's age-8–14 framing onto Overture's place taxonomy. The hidden judgement is that Japan's compulsory-education range (小学校 + 中学校) lines up with the school-category family in `places.place` (school, elementary_school, middle_school, plus the ownership-tagged private_school and public_school), and that the bare `school` catch-all carries most actual schools in the bundled slice and so must be kept. Underneath that, the task probes reading a Hive-partitioned GeoParquet, filtering on a nested struct field, spatial-joining points against a polygon, and emitting GeoJSON with CJK names round-tripped intact.

Approach

  1. Open the bundled places file and inspect the schema: confirm the categories struct, the names struct, the address list, and the Point geometry in WGS84.
  2. Pick the primary categories that fit the 8–14 age range. Include the generic `school` because it is the catch-all that holds most of the data, plus the labeled school subtypes that map onto Japanese compulsory education. Exclude preschool, high_school, and the specialty schools like driving or language schools.
  3. Read the 23-wards bbox polygon and crop the filtered places to points that sit inside it.
  4. Project the output schema to exactly the six required keys, pulling the first address record's freeform, locality, and postcode fields and keeping the place name verbatim with its CJK characters intact.
  5. Write the result as a GeoJSON FeatureCollection of Points and double-check that the CJK names survive the JSON encoding.

Pitfalls

Inputs

nameformatcrsgeometryfeatures
tokyo_places geoparquet {"$schema": "https://proj.org/schemas/v0.7/projjson.schema.json", "type": "GeographicCRS", "name": "WGS 84 (CRS84)", "datum_ensemble": {"name": "World Geodetic System 1984 ensemble", "members": [{"name": "World Geodetic System 1984 (Transit)"}, {"name": "World Geodetic System 1984 (G730)"}, {"name": "World Geodetic System 1984 (G873)"}, {"name": "World Geodetic System 1984 (G1150)"}, {"name": "World Geodetic System 1984 (G1674)"}, {"name": "World Geodetic System 1984 (G1762)"}, {"name": "World Geodetic System 1984 (G2139)"}, {"name": "World Geodetic System 1984 (G2296)"}], "ellipsoid": {"name": "WGS 84", "semi_major_axis": 6378137, "inverse_flattening": 298.257223563}, "accuracy": "2.0", "id": {"authority": "EPSG", "code": 6326}}, "coordinate_system": {"subtype": "ellipsoidal", "axis": [{"name": "Geodetic longitude", "abbreviation": "Lon", "direction": "east", "unit": "degree"}, {"name": "Geodetic latitude", "abbreviation": "Lat", "direction": "north", "unit": "degree"}]}, "scope": "Not known.", "area": "World.", "bbox": {"south_latitude": -90, "west_longitude": -180, "north_latitude": 90, "east_longitude": 180}, "id": {"authority": "OGC", "code": "CRS84"}} Point 13,402
tokyo_23wards_bbox geojson EPSG:4326 Polygon 1

Expected outputs

nameformatcrsgeometryfeatures
tokyo_schools.geojson geojson EPSG:4326 Point 1,506

Map

Recent runs task v2

adapterstartedscorestepsdurationcoststatus
openrouter-gemma4-26b-basic 2026-06-18T07:32:32Z pending pending
openrouter-deepseek-v4-flash-basic 2026-06-18T03:08:04Z 0.00 33 2:53 0.94¢ done
openrouter-deepseek-v4-flash-detailed 2026-06-17T22:01:33Z done 16 3:11 0.76¢ done
openrouter-gemma4-26b-detailed 2026-06-17T19:47:47Z done 17 3:43 1.04¢ done
openrouter-deepseek-v4-flash-basic 2026-06-16T21:43:55Z 0.83 16 2:07 0.95¢ done