My research question is: Do racial disparities in fatal police shootings persist after controlling for whether the individual was armed?
In class we discussed how there is a 7-year age gap between White and Black victims in fatal police shootings. We also noted that the data cannot fully determine whether bias exists, because it does not directly measure bias. However, I think one way to approach the issue is to look for patterns that might suggest bias, even if they cannot prove it directly. That is why I am focusing on my question about racial disparities and armed status.
The outcome variable is already fixed because the dataset only contains fatal police shootings. My main variable of interest is race, and my key control variable is whether the individual was armed or unarmed. I started by looking at the number of armed versus unarmed individuals in the dataset. From there, I planned to calculate percentages of armed versus unarmed across racial groups, which helps show whether disparities appear different when weapon status is considered.
To go deeper, I will use logistic regression in Python. The dependent variable will be whether someone was unarmed (1 = unarmed, 0 = armed), with race as the main predictor and armed status as a control. This will allow me to see if racial disparities remain significant after controlling for whether the person was armed. In addition, I plan to explore age patterns, since earlier discussion in class suggested there may be meaningful differences between White and Black victims in terms of age.
While setting this up, I installed Python and VS Code to run my analysis. I ran into technical challenges when trying to install pandas, the main library I plan on using. Pip was not being recognized correctly, which prevented me from completing the setup. I used ChatGPT to help me format my Python code and troubleshoot the installation error. Once I finish fixing this issue, I will be able to implement my analysis.
At this stage, I am working backwards to get my environment set up properly so that I can install pandas and finish preparing the dataset. Once that is fixed, I will move forward with cleaning the armed variable into a simple “Armed vs. Unarmed” grouping, run descriptive statistics, and then build the logistic regression to directly test my research question.