Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Different names for the same name ID: need to use name IDs only when grouping #11

Open
lmullen opened this issue Aug 18, 2016 · 3 comments

Comments

@lmullen
Copy link
Member

lmullen commented Aug 18, 2016

In elections, NNV sometimes reports votes separately for people with similar names WHO HAVE THE SAME NAME ID. In effect they are treating them like they are different people, but they are likely the same person. For instance, Lucas Elmendorf and Lucas Elmendorf Jr. both get columns in this election. But they are the same person with the same name ID.

So when we are doing grouping, we have to rely on the name ID only, and disregard the names altogether. But we will have to check that everyone has a name ID. Anyone without a name ID (e.g., null) will have to have their name as a name ID, or we will have to add them.

@lmullen
Copy link
Member Author

lmullen commented Aug 18, 2016

@lmullen
Copy link
Member Author

lmullen commented Aug 18, 2016

There are approximately 3,300+ distinct names without name IDs. There are a lot of people with just last names. But there are some like Lucas E. Elmendorf and Lucas Elmendorf who are undoubtedly the same person as the Lucas Elmendorf and the Lucas Elmendorf Jr who do have name IDs.

@lmullen
Copy link
Member Author

lmullen commented Aug 18, 2016

There are 3,765 name ID which have duplicate names, some of them with as many as 10 variant names.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant