Skip to content

Measuring mainstream media coverage on hate crimes against Asian Americans via text analysis and ML using Twitter data

Notifications You must be signed in to change notification settings

gui33627/AAPI_hate_crimes

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Does Mainstream Media Recognize the Surge in Hate Crimes Against Asian Americans?

— Text Analysis using Twitter Data

Description

The goal of the study is to analyze the top 6 U.S. news media coverage on anti-Asian hate crimes and understand their efforts in combating against the rising tide of coronavirus-related discrimination and violence against Asian Americans. We performed exploratory analysis on the Twitter data collected between March 2020 and March 2021 and trained two supervised machine learning models to predict whether a tweet is related to AAPI hate crime or not. From the model result, we found that between March 2020 and March 2021, these media in total have less than 1% of their tweets that potentially talked about anti-Asian hate crimes. Especially prior to the Atlanta shooting event, the news media was silent in reporting hate crimes against Asian Americans despite the alarming rate of incidents. This emphasized the minimal role news media have played in combating the surge in discriminations and violence against Asian Americans.

(Final project for the Messy Data and Machine Learning class at NYU.)

Figure 1: The Number of Times the Tweets Containing “Asian(s)” Appeared over Time

Figure 2: Top 200 Most Frequently Used Words in Tweets predicted as related to reporting of AAPI hate crimes

Data

We pulled the tweets of six local Asian news media accounts, NextShark, AAPI Data, Stop AAPI Hate, CAA San Francisco, Asian American Legal, and WashTheHate, and manually labeled them as related to AAPI hate crime issues or not for the training set. The testing set is the tweets of the six major U.S. news media that are most-followed on Twitter, i.e. CNN, the New York Times, Fox News, the Wall Street Journal, TIME, and the Washington Post. Using the ‘academictwitteR’ package, we pulled the tweets of these twelve accounts that were made between March 17, 2020 and March 30, 2021. The raw data can be accessed here.

Folder structure

File/Folder Name Description
img Images in the README.rmd
twitter_analysis The script for the analysis
MDML_Report The report for the analysis

About

Measuring mainstream media coverage on hate crimes against Asian Americans via text analysis and ML using Twitter data

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages