000000018 001__ 18
000000018 005__ 20250502150805.0
000000018 02470 $$ahttps://ai.facebook.com/datasets/casual-conversations-v2-dataset/$$2URL
000000018 037__ $$aADMIN
000000018 245__ $$aCasual Conversations v2 Dataset
000000018 246__ $$aCCv2
000000018 251__ $$av1
000000018 269__ $$a2023-04-07
000000018 336__ $$aDataset
000000018 500__ $$aTo download the dataset, please visit the following webpage: &lt;a href="https://ai.facebook.com/datasets/casual-conversations-v2-downloads/"&gt;https://ai.facebook.com/datasets/casual-conversations-v2-downloads/&lt;/a&gt;
000000018 510__ $$aPorgali, Bilal, Albiero, Vitor, Ryda, Jordan, Ferrer, Cristian Canton, and Hazirbas, Caner. Casual Conversations v2 Dataset. Inter-university Consortium for Political and Social Research [distributor], 2023-04-07. https://socialmediaarchive.org/record/18
000000018 520__ $$aCasual Conversations v2 is composed of over 5,567 participants (26,467 videos) and intended mainly to be used for assessing the performance of already trained models in computer vision and audio applications for the purposes permitted in our data license agreement. The videos feature paid individuals who agreed to participate in the project and explicitly provided Age, Gender, Language/Dialect, Geo-location, Disability, Physical adornments, Physical attributes labels themselves. The videos were recorded in Brazil, India, Indonesia, Mexico, Philippines, United States, and Vietnam with a diverse set of adults in various categories. A group of trained annotators labeled the participants’ apparent skin tone using the Fitzpatrick scale and Monk Scale, in addition to annotations of Voice timbre, Activity and Recording setups. Spoken words in all videos are either scripted (a sample paragraph from The Idiot by Fyodor Dostoevsky provided with the dataset) or nonscripted (answering one of five predetermined questions).
000000018 540__ $$aData are available through Meta AI at &lt;a href="https://ai.facebook.com/datasets/casual-conversations-v2-downloads/"&gt;https://ai.facebook.com/datasets/casual-conversations-v2-downloads/&lt;/a&gt;.
&lt;br&gt;
Download the file "Casual Conversations V2 Dataset License Agreement" for full dataset terms and conditions.
000000018 650__ $$aartificial intelligence
000000018 650__ $$amachine learning
000000018 651__ $$zBrazil
000000018 651__ $$zIndia
000000018 651__ $$zIndonesia
000000018 651__ $$zMexico
000000018 651__ $$zPhilippines
000000018 651__ $$zUnited States
000000018 651__ $$zVietnam
000000018 655__ $$avideo: film, animation, etc.
000000018 720__ $$aPorgali, Bilal$$eData Collector$$uMeta AI$$7Personal
000000018 720__ $$aAlbiero, Vıtor$$eData Collector$$uMeta AI$$7Personal
000000018 720__ $$aRyda, Jordan$$eData Collector$$uMeta AI$$7Personal
000000018 720__ $$aFerrer, Cristian Canton$$eData Collector$$uMeta AI$$7Personal
000000018 720__ $$aHazirbas, Caner$$eData Collector$$uMeta AI$$7Personal
000000018 791__ $$tThe Casual Conversations v2 Dataset$$aDocument$$eIs Source Of$$whttps://doi.org/10.48550/arXiv.2303.04838$$2DOI
000000018 791__ $$tCasual Conversations v2: Designing a large consent-driven dataset to measure algorithmic bias and robustness$$aConference Presentation$$eIs Source Of$$whttps://doi.org/10.48550/arXiv.2211.05809$$2DOI
000000018 8564_ $$yDescription of and introduction to the dataset, including the link to download the data.$$9cdf2d65d-7797-4da5-8723-dfb34c90b673$$s4007810$$uhttps://socialmediaarchive.org/record/18/files/The%20Casual%20Conversations%20v2%20Dataset.pdf
000000018 8564_ $$yThis document motivates and describes the design of the dataset.$$96bff5111-7932-4bb2-b5ef-616af6cbce09$$s722048$$uhttps://socialmediaarchive.org/record/18/files/Casual%20Conversations%20v2-%20Designing%20a%20large%20consent-driven%20dataset%20to%20measure%20algorithmic%20bias%20and%20robustness.pdf
000000018 8564_ $$yDataset license terms and conditions$$93b0d0f65-3a35-4a19-ab14-d569c916ef5a$$s15798$$uhttps://socialmediaarchive.org/record/18/files/Casual%20Conversations%20V2%20Dataset%20License%20Agreement.rtf
000000018 908__ $$aFacebook
000000018 910__ $$acoded video observation
000000018 911__ $$aAssist in measuring algorithmic fairness and robustness in terms of age, gender, apparent skin tone, language/dialect, geo-location, disability, physical adornment, physical attributes, voice timbre, activity/recording setup conditions.
000000018 912__ $$aVideo recordings of individuals, who are asked predetermined questions from a pre-approved list, to provide their nonscripted answer as well as video recordings of their reading from a scripted text
000000018 913__ $$aTotal number of subjects/actors: 5,567 &lt;br&gt;   
 Total number of video recordings: 26,467 &lt;br&gt;  
 Average per video length: ~1 Minute &lt;br&gt;
000000018 914__ $$aAge (self-provided)  &lt;br&gt; 
Gender (self-provided)   &lt;br&gt;
Language/Dialect (self-provided)   &lt;br&gt;
Geo-location (self-provided)   &lt;br&gt;
Disability (self-provided)   &lt;br&gt;
Physical adornment (self-provided)   &lt;br&gt;
Physical attributes (self-provided)   &lt;br&gt;
Voice timbre (human labeled)   &lt;br&gt;
Apparent skin tone (human labeled)   &lt;br&gt;
 Activity (human labeled) &lt;br&gt;
 Recording setup (human labeled)
000000018 921__ $$amedia unit: video
000000018 980__ $$aDatasets
000000018 981__ $$aPublished