Tag: Data Analysis

  • Are US Police Trigger Happy?

    Recently, I stumbled upon a Washington Post article discussing the statistics of police-involved shootings and fatalities over recent years. The article referenced a comprehensive dataset, which I managed to download before encountering the paywall. This dataset documented all fatal police shootings spanning roughly a decade. While the data extended into 2024, I’ve excluded those entries from my analysis since the year is still ongoing.

    The dataset contained several key parameters:

    1. Date
    2. Name
    3. Age
    4. Gender
    5. Armed
    6. Race
    7. City
    8. State
    9. Flee
    10. Body Camera
    11. Signs of Mental Illness
    12. Police Departments Involved

    The dataset revealed a staggering 9893 cases – an alarmingly high number of individuals who lost their lives without due process, regardless of their alleged criminal activities. Each number represents a person denied their constitutional right to a fair trial.

    After examining the data quality and addressing missing values across various columns, I had to exclude approximately 400 entries, leaving me with 9509 cases for analysis. This sample size remains statistically significant enough to draw meaningful conclusions about the patterns present in the overall dataset.

    Demographics of the Victims

    My initial analysis focused on examining the age distribution of police shooting victims. The data showed a concentration in the 25-60 age range, which aligns with general crime statistics. This age group typically shows higher involvement in criminal activities or presence in high-crime areas.

    Agedistribution

    Further investigation revealed interesting patterns when analyzing racial demographics.

    The data initially appears to reflect expected proportions, given that White Americans comprise roughly 65-70% of the total population, explaining their higher representation among police shooting victims. However, a deeper analysis reveals concerning trends: Black and Hispanic victims show a notably skewed age distribution toward younger ages, with victims predominantly in their late teens and early twenties. In contrast, White victims follow a more normal distribution pattern, typically falling in their late twenties or thirties. This raises questions about whether social changes over the past few decades have led to increased police interactions with younger people of color. While this observation warrants further investigation, additional data would be needed to draw definitive conclusions about these demographic disparities.

    Age Distribution Based On Race

    While raw numbers provide one perspective, examining the percentage of population affected by police interactions offers deeper insights. I analyzed the Washington Post dataset in conjunction with US demographic data (sourced from here) to calculate these proportional impacts.

    Percentofvictimsbyrace

    In my analysis, I excluded the “Unknown” race category due to the discrepancy between census data accuracy and the Washington Post dataset’s limitations, likely stemming from incomplete police documentation. It’s worth noting that approximately 10% of victims in the original dataset had unspecified racial classifications.

    The proportional analysis reveals striking disparities: Native American and Black populations face double the likelihood of fatal police encounters compared to white populations. Hispanic individuals experience similar rates of fatal police interactions as white populations, while Asian Americans show half the likelihood of such encounters. The “Multiple Races” category contained insufficient data points for meaningful analysis, possibly due to inconsistent reporting in police records.

    One potential explanation for Asian Americans’ lower representation in police shooting statistics could be their generally reduced frequency of police interactions. While Asian Americans are widely recognized as one of America’s most successful immigrant groups, their family structure, as documented in Pew Research findings, might be a contributing factor. However, this remains a preliminary hypothesis requiring further investigation for definitive conclusions.

    Mental Health of the Victims

    My analysis then shifted to examining the mental health status of victims.

    The findings are concerning: approximately 2,000 victims over the past decade exhibited signs of mental illness. This suggests that redirecting resources toward mental health professionals and social workers might be more effective than relying solely on law enforcement.

    Mentalhealth

    Breaking down mental health data by race reveals another pattern: white victims of police shootings are roughly three times more likely to be classified as not mentally ill, while Black, Hispanic, and Native American victims show a five-to-one ratio between those classified as not mentally ill versus those showing signs of mental illness.

    Mentalinessbyrace

    It’s crucial to note that these mental health classifications are based on behavioral signs observed during police encounters, rather than professional diagnoses or established medical histories.

    Circumstantial Trends

    I focused on two key situational factors:

    1. Whether the victims were trying to flee?
    2. Whether the victims were armed?

    Were the Victims trying to Flee?

    Fleeing Behavior

    Analysis across racial demographics indicates that approximately half of the victims were not attempting to escape during their encounters with law enforcement. This suggests that many victims were likely complying with police directives, though definitive conclusions cannot be drawn solely from this dataset.

    Fleeingmode

    When examining the intersection of mental health status and escape attempts, a notable pattern emerges: the majority of mentally ill victims were not attempting to flee. This observation raises significant concerns about the necessity of lethal force in situations where alternative intervention methods might have been viable.

    The behavioral patterns of victims, when analyzed across different racial groups, demonstrate remarkable consistency. Where sufficient data exists, the distribution of victim responses appears uniform across racial categories, suggesting that behavioral responses to police encounters transcend racial boundaries.

    Fleeingbehaviorbyrace

    This consistency prompts a critical inquiry: In cases where victims showed no intention to escape, what circumstances prevented successful arrests without resorting to lethal force?

    Were the Victims armed?

    Analysis of weapon possession among victims reveals firearms as the predominant type of armament. However, a distinct pattern emerges among mentally ill victims within Native American and Hispanic communities, where knife possession was notably more prevalent.

    Werevictimsarmed

    This finding underscores the potential benefits of enhanced firearm regulation in protecting law enforcement officers – a measure that has faced consistent opposition from the National Rifle Association.

    How have Police Shootings trended over time?

    The past decade has witnessed a concerning upward trajectory in police-involved shootings and resultant fatalities. While a ten-year span might seem relatively brief in historical context, the data reveals a disturbing average of approximately 1,000 victims annually.

    Trendovertime

    The implementation of body-worn cameras appears to have limited impact, though it’s important to acknowledge potential delays between policy implementation and observable outcomes.

    Bodycamera

    Particularly concerning is the fact that body cameras were present in only one-third of documented cases.

    Bodycamwithrace

    When analyzing body camera usage across racial categories, while overall utilization shows an increasing trend, the data suggests a concerning pattern: incidents involving body cameras correlate with decreased likelihood of racial identification in victim documentation.

    Conclusion

    While numerous aspects of this issue warrant further investigation, certain data points remain unavailable – notably, comprehensive information about all police interactions, as this dataset exclusively covers fatal encounters.

    Nevertheless, the loss of 10,000 lives over a decade, through police shootings, represents an alarming figure, particularly considering that 20% of victims displayed signs of mental illness, and roughly half were not attempting to flee.

    This analysis, while revealing, highlights the need for more comprehensive research and complete datasets to fully understand and address these critical issues.

  • Spreadsheets: Common man’s programming tool

    #include <stdio.h>
    
    int main() {
        printf("Hello, World!\n");
        return 0;
    }

    I remember sitting in my computer science class about two decades ago and my teacher teaching us how to print “Hello World”. I never became a computer scientist – nor did I become a professional programmer. But I did come to appreciate how useful programming is for most professions.

    As an experimental Materials Scientist, I use programming so often to manipulate data, to analyze data, to predict the best possible set of experiments to run – and all the while I often wonder, why the common student is taught the dry programming of Hello World, that comes with C or C++ or Python or any of the other programming languages that exist, and why students are not introduced to the power of spreadsheets. Don’t get me wrong, I don’t undermine the value of true programming languages, but in my mind, =SUM(A1:A45) has more value than printf("Hello, World!\n"); as they offer a more practical entry point. Spreadsheets may not be sexy, but for most, they’re the perfect tool – since they can reduce errors, increase automation thereby, saving time.

    Here are a few good reasons why I feel spreadsheets are quite important:

    1. Low barrier to Entry
    2. Democratization of data
    3. WYSIWYG
    4. Teaching the fundamentals of programming

    And once someone graduates past the basic spreadsheet like Microsoft Excel, then they can even access VBA (a built-in programming language within excel) or Google Apps Script (a built in programming language within Google Sheets) to enable more complex functionalities.

    Spreadsheets offer a powerful and versatile toolset for anyone who works with data. Their low barrier to entry makes them accessible, while features like formulas and conditional formatting automate tasks, saving time and reducing errors. But spreadsheets hold a hidden gem: VBA in Excel and Apps Script in Google Sheets. These built-in programming languages unlock a whole new level of automation and functionality. Imagine automating complex data analysis, generating reports with a single click, or creating custom functions tailored to your specific needs.

    The next time you find yourself drowning in data, don’t underestimate the power of your spreadsheet. With a little exploration and the help of readily available online resources, you can unlock the hidden potential of VBA or Apps Script and transform your workflow. So, ditch the “Hello World” and dive into the world of spreadsheet programming – the possibilities are endless!