Politweets: Tweets of politicians, celebrities, news media, and influencers from India and the United States
2023
Description
This dataset contains tweets of politicians, celebrities, news media, and influencers from India and the United States. The data cover accounts of over 9600 public figures in the United States and over 33000 public figures in India.
See additional project details: https://github.com/casmlab/politicians-tweets
Details
Title
Politweets: Tweets of politicians, celebrities, news media, and influencers from India and the United States
Creator
Panda, Anmol Data Manager (University of Michigan)
Hemphill, Libby Project Manager (University of Michigan)
Pal, Joyojeet Project Member (University of Michigan)
Hemphill, Libby Project Manager (University of Michigan)
Pal, Joyojeet Project Member (University of Michigan)
Subject
Issued Date
2023-05-26
Version
v1.1
Alternate Identifiers
Status
Published
Access Rights
This dataset has one level of access: Restricted.
- Restricted data files require a Restricted Data Application and will typically be accessed through a secure virtual data enclave. Learn more about applying for restricted data.
Citation
Panda, Anmol, Hemphill, Libby, and Pal, Joyojeet. Politweets: Tweets of politicians, celebrities, news media, and influencers from India and the United States. Inter-university Consortium for Political and Social Research [distributor], 2023-05-26. https://doi.org/10.3886/xm68-rw44
Record Appears in
Time Period
January 1, 2019, April 20, 2023
Collection Date
January 1, 2019, April 20, 2023
Geographic Coverage
India
United States
United States
Platform
Twitter
Collection Modes
application programming interface (API)
Data Formats
text
Purpose
Tweets of politicians and influencers from India and the US
Design
The users are defined as public figures, and consent requirement was waived
Universe
Politicians including members of political parties, elected representatives, and government agencies.
Celebrities including sports persons, actors, musicians, journalists, media personalities, news media organizations, social media influencers.
Celebrities including sports persons, actors, musicians, journalists, media personalities, news media organizations, social media influencers.
Variables
All attributes of a tweet, such as tweet text, retweet count, favorite count and time of tweet.
Sampling
other
Additional Notes
The date range mentioned above refers to the earliest tweets collected for any account, to the latest tweet collected for any account.
Data may be absent for certain accounts due to the following reasons:
- Accounts have been added iteratively over time, therefore tweet collection for some accounts began later (say August 2020)
- Tweet collection for US politicians' accounts was initiated in 2020, whereas for influencers it started in June 2022.
- For accounts that had posted fewer than 3200 tweets at the time data collection began for that account, we have all of their tweets (including those prior to Jan 2019)
- For accounts that tweet very frequently, we may not have all of their tweets due to limits imposed by the public API
Related Resources
Metadata and Project details, Document, is Part Of, URL, https://github.com/casmlab/politicians-tweets
NivaDuck - A Scalable Pipeline to Build a Database of Political Twitter Handles for India and the United States, Journal Article, Is Source Of, DOI, https://doi.org/10.1145/3400806.3400830
NivaDuck - A Scalable Pipeline to Build a Database of Political Twitter Handles for India and the United States, Journal Article, Is Source Of, DOI, https://doi.org/10.1145/3400806.3400830
Related Items
Relation Type
is Part Of
Related Item
Document
Title
Metadata and Project details
Identifier
Relation Type
Is Source Of
Related Item
Journal Article
Title
NivaDuck - A Scalable Pipeline to Build a Database of Political Twitter Handles for India and the United States
Identifier