This repository contains data and code analyzing NYPD Hate Crime data.
This analysis uses the NYPD Hate Crimes public dataset (produced by requirement under New York City Administrative Code § 14-161.
- Name of source:
NYPD_Hate_Crimes_20241206.csv
: Raw data- Live and updated at
NYC OpenData
on a quarterly basis
The sheet contains the following columns (as defined by the OpenData dataset info):
Full Complaint ID
— Identifier for each hate crimes incidentComplaint Year Number
— Year in which incident occurredMonth Number
— Month in which incident occurredRecord Create Date
— Date report was filedComplaint Precinct Code
— NYPD Precinct in which incident occurredPatrol Borough Name
— NYPD Patrol Borough in which incident occurredCounty
— County in which incident occurredLaw Code Category Description
— Category of offenseOffense Description
— A description of the offensePD Code Description
— The NYPD description of the offenseBias Motive Description
— NYPD category of hate crime or bias typeOffense Category
— General categorization of hate crime typeArrest Date
— Date arrest was made (if arrest happened)Arrest Id
— Identifier for arrest (if made)
We'll also be using a key for further contextualizing NYPD Hate Crime data in the context of UCR/NIBRS Federal reporting and NY State DCJS reporting.
- Name of source:
offense_categorization_key.xlsx
This sheet contains the following columns (further explained in the 'Data Dictionary' tab of the file):
Offense
— Offense sourced from aggregate 'PD Code Descriptions' from NYPD Hate Crimes datasetLegal_Level
— Law Code Category Description keypairs sourced from NYPD Hate Crimes datasetNYS_Specified
— "Specified Offense" status defined in NYS Penal Code § 485.05(3) Hate CrimesNIBRS_Code
— Matching NIBRS Offense Code(s) from the federal NIBRS User ManualNIBRS_Group
— NIBRS Offense group classification. Group A offenses are considered more serious and are reported federally even without an arrest. Group B offenses are only reported federally if there has been an arrest.Crime_Against
— UCR/NIBRS Crime Classification by motivation: crimes against persons, crimes against society, crimes against property. Defined in UCR portal methodology.DCJS_Hate_Crimes
— Whether the offense is listed in the DCJS 'Hate Crime Penal Law Reference Table'Violent
— Whether the offense code indicates direct physical harm in the complaint
The notebook 2019-24-nypd-hate-crimes-analysis.ipynb
performs the following analyses on NYPD hate crime data:
- Examines count of incidents per:
- NIBRS Crime category (persons/property/society)
- Borough
- Precinct
- Offense category
- Bias motive
- Analyzes three groupings:
- Full list of Bias Motive Description categories from NYPD
- Aggregated Offense Categories grouping bias motives from NYPD
- Custom aggregate comparison of Anti-LGBTQ, Anti-Religious and Anti-Race/Ethnicity bias motives
- Creates dataframes with calculated statistics about hate crime data per grouping
- Creates plotly graph visualizations of:
- Monthly incident counts with overlaid bar charts showing total vs violent incidents
- Quarterly percentage trends of violent incidents on the opposite axis
- Iteratively creates and displays comparative graphs at three levels of aggregation:
- Full dataset by NYPD Offense Category
- Full dataset by specific Bias Motive Description
- Individual category deep-dives (e.g., "Sexual Orientation" or "Anti-Jewish")
For this section we move to a second notebook precinct_hate_crime_map.ipynb
to map our data across the NYPD precinct does in our dataset using the datawrapper library.
- Creates interactive choropleth maps showing hate crime distribution across NYPD precincts
- Includes detailed tooltips showing breakdowns by category, violence rates, and arrest rates
- Creates filtered datasets of reported hate crimes with arrests
- Outputs arrest data that can be used to request NYPD Arrest Reports or identify specific cases in ECourts.
The notebooks output these spreadsheets:
output/hate_crimes_w_codecats.csv
which contains the raw data with the added column futher categorizing PD Code Descriptions from 'Part 1.'output/anti_lgbt_hate_crime_arrests.csv
which contains the arrest data from 'Part 3.'
It also generates an interactive datawrapper map on the user's datawrapper account.
You can run the analysis yourself. To do so, you'll need the following installed on your computer:
- Python 3
- pandas for data analysis
- plotly for visualizations
- datawrapper for mapping (requires API token)
- The rest of the Python libraries specified in
requirements.txt
All code in this repository is available under the MIT License. The data file in the output/ directory is available under the Creative Commons Attribution 4.0 International (CC BY 4.0) license. All files in the data/ directory are released into the public domain.
Contact Audrey Nielsen at nielsenau@gmail.com