000000018 001__ 18 000000018 005__ 20240801202641.0 000000018 02470 $$ahttps://ai.facebook.com/datasets/casual-conversations-v2-dataset/$$2URL 000000018 037__ $$aADMIN 000000018 245__ $$aCasual Conversations v2 Dataset 000000018 246__ $$aCCv2 000000018 251__ $$av1 000000018 269__ $$a2023-04-07 000000018 336__ $$aDataset 000000018 500__ $$aTo download the dataset, please visit the following webpage: <a href="https://ai.facebook.com/datasets/casual-conversations-v2-downloads/">https://ai.facebook.com/datasets/casual-conversations-v2-downloads/</a> 000000018 510__ $$aPorgali, Bilal, Albiero, Vitor, Ryda, Jordan, Ferrer, Cristian Canton, and Hazirbas, Caner. Casual Conversations v2 Dataset. Inter-university Consortium for Political and Social Research [distributor], 2023-04-07. https://socialmediaarchive.org/record/18 000000018 520__ $$aCasual Conversations v2 is composed of over 5,567 participants (26,467 videos) and intended mainly to be used for assessing the performance of already trained models in computer vision and audio applications for the purposes permitted in our data license agreement. The videos feature paid individuals who agreed to participate in the project and explicitly provided Age, Gender, Language/Dialect, Geo-location, Disability, Physical adornments, Physical attributes labels themselves. The videos were recorded in Brazil, India, Indonesia, Mexico, Philippines, United States, and Vietnam with a diverse set of adults in various categories. A group of trained annotators labeled the participants’ apparent skin tone using the Fitzpatrick scale and Monk Scale, in addition to annotations of Voice timbre, Activity and Recording setups. Spoken words in all videos are either scripted (a sample paragraph from The Idiot by Fyodor Dostoevsky provided with the dataset) or nonscripted (answering one of five predetermined questions). 000000018 540__ $$aData is available through Meta AI at <a href="https://ai.facebook.com/datasets/casual-conversations-v2-downloads/">https://ai.facebook.com/datasets/casual-conversations-v2-downloads/</a>. <br> Download the file "Casual Conversations V2 Dataset License Agreement" for full dataset terms and conditions. 000000018 650__ $$aartificial intelligence 000000018 650__ $$amachine learning 000000018 651__ $$zBrazil 000000018 651__ $$zIndia 000000018 651__ $$zIndonesia 000000018 651__ $$zMexico 000000018 651__ $$zPhilippines 000000018 651__ $$zUnited States 000000018 651__ $$zVietnam 000000018 655__ $$avideo: film, animation, etc. 000000018 720__ $$aPorgali, Bilal$$eData Collector$$uMeta AI$$7Personal 000000018 720__ $$aAlbiero, Vıtor$$eData Collector$$uMeta AI$$7Personal 000000018 720__ $$aRyda, Jordan$$eData Collector$$uMeta AI$$7Personal 000000018 720__ $$aFerrer, Cristian Canton$$eData Collector$$uMeta AI$$7Personal 000000018 720__ $$aHazirbas, Caner$$eData Collector$$uMeta AI$$7Personal 000000018 791__ $$tThe Casual Conversations v2 Dataset$$aDocument$$eIs Source Of$$2DOI$$whttps://doi.org/10.48550/arXiv.2303.04838 000000018 791__ $$tCasual Conversations v2: Designing a large consent-driven dataset to measure algorithmic bias and robustness$$aConference Presentation$$eIs Source Of$$2DOI$$whttps://doi.org/10.48550/arXiv.2211.05809 000000018 8564_ $$yDescription of and introduction to the dataset, including the link to download the data.$$9cdf2d65d-7797-4da5-8723-dfb34c90b673$$s4007810$$uhttps://socialmediaarchive.org/record/18/files/The%20Casual%20Conversations%20v2%20Dataset.pdf 000000018 8564_ $$yThis document motivates and describes the design of the dataset.$$96bff5111-7932-4bb2-b5ef-616af6cbce09$$s722048$$uhttps://socialmediaarchive.org/record/18/files/Casual%20Conversations%20v2-%20Designing%20a%20large%20consent-driven%20dataset%20to%20measure%20algorithmic%20bias%20and%20robustness.pdf 000000018 8564_ $$yDataset license terms and conditions$$93b0d0f65-3a35-4a19-ab14-d569c916ef5a$$s15798$$uhttps://socialmediaarchive.org/record/18/files/Casual%20Conversations%20V2%20Dataset%20License%20Agreement.rtf 000000018 908__ $$aFacebook 000000018 910__ $$acoded video observation 000000018 911__ $$aAssist in measuring algorithmic fairness and robustness in terms of age, gender, apparent skin tone, language/dialect, geo-location, disability, physical adornment, physical attributes, voice timbre, activity/recording setup conditions. 000000018 912__ $$aVideo recordings of individuals, who are asked predetermined questions from a pre-approved list, to provide their nonscripted answer as well as video recordings of their reading from a scripted text 000000018 913__ $$aTotal number of subjects/actors: 5,567 <br> Total number of video recordings: 26,467 <br> Average per video length: ~1 Minute <br> 000000018 914__ $$aAge (self-provided) <br> Gender (self-provided) <br> Language/Dialect (self-provided) <br> Geo-location (self-provided) <br> Disability (self-provided) <br> Physical adornment (self-provided) <br> Physical attributes (self-provided) <br> Voice timbre (human labeled) <br> Apparent skin tone (human labeled) <br> Activity (human labeled) <br> Recording setup (human labeled) 000000018 921__ $$amedia unit: video 000000018 980__ $$aDatasets 000000018 981__ $$aPublished