TY - GEN AB - Casual Conversations v2 is composed of over 5,567 participants (26,467 videos) and intended mainly to be used for assessing the performance of already trained models in computer vision and audio applications for the purposes permitted in our data license agreement. The videos feature paid individuals who agreed to participate in the project and explicitly provided Age, Gender, Language/Dialect, Geo-location, Disability, Physical adornments, Physical attributes labels themselves. The videos were recorded in Brazil, India, Indonesia, Mexico, Philippines, United States, and Vietnam with a diverse set of adults in various categories. A group of trained annotators labeled the participants’ apparent skin tone using the Fitzpatrick scale and Monk Scale, in addition to annotations of Voice timbre, Activity and Recording setups. Spoken words in all videos are either scripted (a sample paragraph from The Idiot by Fyodor Dostoevsky provided with the dataset) or nonscripted (answering one of five predetermined questions). DA - 2023-04-07 ED - Porgali, Bilal ED - Albiero, Vıtor ED - Ryda, Jordan ED - Ferrer, Cristian Canton ED - Hazirbas, Caner ED - Data Collector ED - Data Collector ED - Data Collector ED - Data Collector ED - Data Collector ID - 18 KW - artificial intelligence KW - machine learning L1 - https://socialmediaarchive.org/record/18/files/The%20Casual%20Conversations%20v2%20Dataset.pdf L1 - https://socialmediaarchive.org/record/18/files/Casual%20Conversations%20v2-%20Designing%20a%20large%20consent-driven%20dataset%20to%20measure%20algorithmic%20bias%20and%20robustness.pdf L1 - https://socialmediaarchive.org/record/18/files/Casual%20Conversations%20V2%20Dataset%20License%20Agreement.rtf L2 - https://socialmediaarchive.org/record/18/files/The%20Casual%20Conversations%20v2%20Dataset.pdf L2 - https://socialmediaarchive.org/record/18/files/Casual%20Conversations%20v2-%20Designing%20a%20large%20consent-driven%20dataset%20to%20measure%20algorithmic%20bias%20and%20robustness.pdf L2 - https://socialmediaarchive.org/record/18/files/Casual%20Conversations%20V2%20Dataset%20License%20Agreement.rtf L4 - https://socialmediaarchive.org/record/18/files/The%20Casual%20Conversations%20v2%20Dataset.pdf L4 - https://socialmediaarchive.org/record/18/files/Casual%20Conversations%20v2-%20Designing%20a%20large%20consent-driven%20dataset%20to%20measure%20algorithmic%20bias%20and%20robustness.pdf L4 - https://socialmediaarchive.org/record/18/files/Casual%20Conversations%20V2%20Dataset%20License%20Agreement.rtf LK - https://socialmediaarchive.org/record/18/files/The%20Casual%20Conversations%20v2%20Dataset.pdf LK - https://socialmediaarchive.org/record/18/files/Casual%20Conversations%20v2-%20Designing%20a%20large%20consent-driven%20dataset%20to%20measure%20algorithmic%20bias%20and%20robustness.pdf LK - https://socialmediaarchive.org/record/18/files/Casual%20Conversations%20V2%20Dataset%20License%20Agreement.rtf N1 - To download the dataset, please visit the following webpage: <a href="https://ai.facebook.com/datasets/casual-conversations-v2-downloads/">https://ai.facebook.com/datasets/casual-conversations-v2-downloads/</a> N2 - Casual Conversations v2 is composed of over 5,567 participants (26,467 videos) and intended mainly to be used for assessing the performance of already trained models in computer vision and audio applications for the purposes permitted in our data license agreement. The videos feature paid individuals who agreed to participate in the project and explicitly provided Age, Gender, Language/Dialect, Geo-location, Disability, Physical adornments, Physical attributes labels themselves. The videos were recorded in Brazil, India, Indonesia, Mexico, Philippines, United States, and Vietnam with a diverse set of adults in various categories. A group of trained annotators labeled the participants’ apparent skin tone using the Fitzpatrick scale and Monk Scale, in addition to annotations of Voice timbre, Activity and Recording setups. Spoken words in all videos are either scripted (a sample paragraph from The Idiot by Fyodor Dostoevsky provided with the dataset) or nonscripted (answering one of five predetermined questions). PY - 2023-04-07 T1 - Casual Conversations v2 Dataset TI - Casual Conversations v2 Dataset UR - https://socialmediaarchive.org/record/18/files/The%20Casual%20Conversations%20v2%20Dataset.pdf UR - https://socialmediaarchive.org/record/18/files/Casual%20Conversations%20v2-%20Designing%20a%20large%20consent-driven%20dataset%20to%20measure%20algorithmic%20bias%20and%20robustness.pdf UR - https://socialmediaarchive.org/record/18/files/Casual%20Conversations%20V2%20Dataset%20License%20Agreement.rtf Y1 - 2023-04-07 ER -