-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Endpoint to match agencies for submission #550
Labels
Comments
6 tasks
josh-chamberlain
transferred this issue from Police-Data-Accessibility-Project/scrapers
Nov 13, 2024
josh-chamberlain
changed the title
Utility for getting PDAP agencies for data source collectors
Endpoint to match agencies for submission
Dec 4, 2024
josh-chamberlain
transferred this issue from Police-Data-Accessibility-Project/data-source-identification
Dec 4, 2024
Looking at this now! @josh-chamberlain When we say it should accept a batch, are we thinking of providing that batch via csv, json, or both? |
In terms of implementation, here's my initial thinking: Steps
Possible OutcomesA. Exact match Other Considerations
|
maxachis
added
the
fixed_in_dev
This is merged into the dev environment and waiting to be merged into main
label
Dec 11, 2024
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
Context
We have tools which generate batches of Data Sources to submit to our db. The biggest hurdle is typically that agencies out in the world aren't named the exact same thing as they are in our database; we need an
agency_described
which matches our database. There are 3 potential things that can happen:* happy
: the name is exactly the same in our database, and we get a single match. hooray!~ different
: the agency exists in our database by a different name.+ new
: the agency does not exist in our database and we need to add it.x unknown
: we don't even have anexternal name
for the agency; we have no idea what to guess. that's out of scope but begun here: Identify agencies data-source-identification#15Requirements
agency-match
endpoint which we can use to find proper names for our agenciesexternal name
county
state
locality
agency name
: great for CSV submissionsagency id
: good for hitting the APIexact match
, if one confident match is detected; this covers thehappy
anddifferent
pathspossible match
, an exact match is not found, but there are one or more agencies within the same location which might matchno match
, No agencies found for that location and/or meet the threshold for being a possible match.Open questions
How would we do this internally?
We have several possible ways of approaching this, depending on our preferences and how thorough we want to be. This includes:
The text was updated successfully, but these errors were encountered: