Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Use OSM data for geocoding in all boroughs (#179)
* output direct to intersections.csv * get_intersection_center * rename intersection -> grid * generate all-intersections file * geocode broadway & 59 * extract avenue parsing code * exact geocoding above 125 * checkpoint; this abstraction is breaking * update test logs * pare back logging, track parse vs. grid * factor out a single grid.geocode_intersection function * Sutton Place * Riverside Drive * update test logs * clean up logging * debug logging * interpolate between streets * track interpolations * loosen up street matching * parse ordinals; +26 * logging * generate all intersections * 49532 NYC intersections * de-dupe on name; 46420 intersections * move Grid into a class * normalize Fifth Avenue -> 5th Avenue * exact intersection geocoding for OSM (all boroughs) * normalize second * expand_abbrevs * pare back normalization a bit * require full match for ordinal rewrite * fix St. Nicholas bug * try stripping dirs * bug fix * try double-strip; probably overkill * copy-paste mode and filters for geogpt batch * prompt variation asking to avoid ave/ave intersection * ask for an array response * refactor coders * update some tests * fix special cases coder; able to repro current results * rv irrelevant change * pare back to direction stripping * TODO * default to images.ndjson * Be more careful about matching "Park Ave" not just "Park"; handle Riverside Park * fix odd 144 bug * Exclude Central Park South/West/East/North * pare back logging, update tests * update data, stats * stats, sizes for static site * attempt to restore generate_intersections.csv * write both * I am confused * never match St/Dr at start of street name * so many intersection files * generate all three * St at start is always Saint, not Street * update test stats * pare back logging; geocode.py runs in ~5s * update site stats * ruff check * no status bar * one more place * spell it right * consistent rounding * update data for rounding * keep the old name
- Loading branch information