How We Assessed the Accuracy of Predictive Policing Software

In the first installment of our series Prediction: Bias, we investigated the use of Geolitica, a software that claims to take historical crime data and predict where and when future crime is most likely to occur. We found the software disproportionately directed officers to patrol neighborhoods with relatively higher percentages of low-income, Black, and Latino residents compared to those cities or counties on the whole. Our analysis was based on data produced by Geolitica and provided directly to police departments in 38 jurisdictions across the U.S. Geolitica was formerly known as PredPol until 2021.

To assess what impact the software had on policing, we also compared the predictions to arrests that occurred during the same time period for 11 departments that shared arrest data with us. We found that rates of arrest in predicted areas remained the same, regardless of whether Geolitica predicted a crime that day. In other words, we did not find a strong correlation between predictions and arrests.

At the time of our initial investigation, we could not definitively say how police acted on any individual prediction, because only one jurisdiction shared enough of the software’s “dosage” data, which indicates when an officer went to the location of a predicted crime and how long they stayed in the area. Another jurisdiction shared only two days’ worth of dosage data. The rest claimed that they either didn’t have the data or it wasn’t public information.

This information is necessary to properly investigate the software’s accuracy because Geolitica asserts that patrols in response to its predictions reduce the likelihood of crimes occuring there. If a police officer visits a prediction location and a crime doesn’t occur, it would be impossible for us to determine if the prediction itself was inaccurate, or if the officer’s presence at the location deterred someone from committing a crime in the first place. To analyze the accuracy of Geolitica’s software in predicting crimes, we needed to exclude all the prediction locations that were visited by a police officer.

Only the city of Plainfield, New Jersey, provided us dosage data over a long enough time period to conduct our analysis. However, the dosage rate, indicating the frequency that an officer visited a Geolitica prediction box, was so low that we had assumed the dataset was somehow incomplete and therefore not suitable for analysis. At the very end of the reporting process for that story, we learned from Plainfield law enforcement officials that, while the agency had purchased and fed data into the system, it was rarely, if ever, used to direct patrols. Officials insisted that any time the data showed an officer visiting a prediction location, it was a coincidence—meaning the low dosage rate accurately reflected the reality on the ground rather than an error in the data.

This follow-up analysis uses the crime reports and dosage data received from the Plainfield Police Department (PD) to determine Geolitica’s accuracy in predicting crimes.

How We Assessed the Accuracy of Predictive Policing Software

Share This Article

See our data here

The Data

Crime Types

Analysis

Matching Predictions to Reported Crimes

Measuring the Software’s Prediction Success Rate

Robberies and Aggravated Assaults

Burglaries

Alternative Analysis Including “Dosage”

How Many Reported Crimes Did Geolitica’s Software Predict?

Limitations

Company Responses

Conclusion

The Latest

It’s hard to find resources while on the transplant list. These sites want to change that

Is the patient Black? Check this box for yes

Google wasn't against this privacy bill, officially. Behind the scenes, it orchestrated opposition