TY - GEN AB - These data explore social media platforms’ shortcomings when it comes to white supremacist speech and how it differs from general or nonextremist speech, and recommends ways to improve automated hate speech identification methods. <br> Data include 274,668 posts scraped from Stormfront and 509,982 comments collected from the Reddit API. The following files are included: <ul> <li>stormfront_posts.txt: one post per line, no post metadata</li> <li>reddit_posts.txt: one comment per line, no comment metadata</li> <li>stormfront_post_data_processed.json.gz: preprocessed posts from Stormfront, includes post metadata</li> <li>reddit_sample.csv.gz: preprocessed comments from Reddit, includes comment metadata</li> </ul> Twitter data used in the report is not available for public reuse because of Twitter's terms of service and our data use agreement with VOX-Pol. DA - 2023-01-26 ED - Hemphill, Libby ED - Principal Investigator ID - 1 KW - racism KW - hate speech KW - social media L1 - https://socialmediaarchive.org/record/1/files/VeryFinePeople-Report.pdf L2 - https://socialmediaarchive.org/record/1/files/VeryFinePeople-Report.pdf L4 - https://socialmediaarchive.org/record/1/files/VeryFinePeople-Report.pdf LK - https://socialmediaarchive.org/record/1/files/VeryFinePeople-Report.pdf N2 - These data explore social media platforms’ shortcomings when it comes to white supremacist speech and how it differs from general or nonextremist speech, and recommends ways to improve automated hate speech identification methods. <br> Data include 274,668 posts scraped from Stormfront and 509,982 comments collected from the Reddit API. The following files are included: <ul> <li>stormfront_posts.txt: one post per line, no post metadata</li> <li>reddit_posts.txt: one comment per line, no comment metadata</li> <li>stormfront_post_data_processed.json.gz: preprocessed posts from Stormfront, includes post metadata</li> <li>reddit_sample.csv.gz: preprocessed comments from Reddit, includes comment metadata</li> </ul> Twitter data used in the report is not available for public reuse because of Twitter's terms of service and our data use agreement with VOX-Pol. PY - 2023-01-26 T1 - What Social Media Platforms Miss About White Supremacist Speech TI - What Social Media Platforms Miss About White Supremacist Speech UR - https://socialmediaarchive.org/record/1/files/VeryFinePeople-Report.pdf Y1 - 2023-01-26 ER -