-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Clean up Santa Clara County street import #51
Comments
By the way, #47 has an idea for dating streets with more certainty back to 1992. But if we want to stick with this outdated county dataset, we should replace the |
@jeffreyameyer do you remember how the start dates came about? Were the years for real but with placeholder months and days? |
Ok - clearly, I've left some incomplete work - my apologies! But, I do think things can be cleaned up quickly. Please see notes / comments below. The years were largely set by choosing an arbitrary (sorry!) old year, then comparing slowly to old maps and adjusting backward as the maps got older. Roads that stopped showing up as you went back in time didn't get older years, those that did show up continued to get older years. This is not a foolproof method, but is directionally useful and having edtf tags is indeed a better solution than the "arbitrary" markings.
|
Yes, both the |
In 2020, Stanford Libraries republished a public domain dataset of streets throughout Santa Clara County that the Santa Clara County Planning Office used to publish on its open data portal.1 Last August, @jeffreyameyer imported an extract of this dataset into OHM, 1,886 features in all, ahead of a presentation at Stanford. The import covers the Stanford campus, downtown Mountain View, and some major streets in that part of the San Francisco Peninsula. This issue tracks cleaning up the import to follow OHM norms.
The dataset has a
date_creat
field, but this only indicates when the feature was added to the database in ArcGIS, generally between 2004 and 2008. By contrast, the import tagged every street as if it started on March 1 in various years in the 19th and 20th centuries.2 These seem to be estimates based on some old maps, but the placeholder month and day leave me a bit uncertain about that.Aside from dates, most of the other attributes need to be cleaned up. For example, on this stretch of San Antonio Road:
group=t
. I have no idea what it means,3 but it doesn’t appear in the dataset and isn’t an established OHM tag.fcc
field) tohighway=*
tags, for the benefit of future imports.name=*
based on thestreetpref
,streetsuff
, andstreettype
fields.oneway=ft
andoneway=tf
withoneway=yes
andoneway=-1
, respectively. (Better yet, deleteoneway=tf
and reverse those ways.)roadlabel=*
, which is redundant toname=*
but less polished.streetname=*
,streetpref=*
,streetsuff=*
, andstreettype=*
. (Alternatively, propose a more structured tagging scheme for street names that isn’t specific to this dataset.)surface=asphalt
withsurface=paved
. Thesurface
field’sPAV
value doesn’t specify the kind of pavement, and I don’t think we’d be able to track minute changes in pavement material over time without massive effort.Footnotes
This dataset has been superseded by a continuously updated Road Centerlines dataset, also in the public domain. ↩
1924 was not a leap year, so every software package in our stack interprets
start_date=1924-02-29
as March 1, 1924. ↩Unfortunately, the original dataset is no longer available online, and although it came with an FGDC metadata file, this file says nothing about each attribute. ↩
The text was updated successfully, but these errors were encountered: