Skip to main content
This dataset is collected by Scout and organized into two main tables: profiles and tweets.

Table Schemas

profiles

Column NameTypeNullableDescription
banner_image_urltexttrueURL of the user’s banner image
biotexttrueBiography of the Twitter user
created_timestamptimestamp with timezonetrueTimestamp when the Twitter account was created
display_nametexttrueDisplay name of the Twitter user
followers_countbiginttrueNumber of followers the user has
following_countbiginttrueNumber of accounts the user is following
idtextfalseUnique identifier for the Twitter user profile
is_blue_verifiedbooleantrueIndicates if the user is blue verified
linksARRAYtrueLinks associated with the Twitter user
locationtexttrueLocation of the Twitter user
nametexttrueName of the Twitter user
pinned_tweet_idsARRAYtrueIDs of tweets pinned by the user
profession_categoriesARRAYtrueCategories related to the user’s profession
profession_typetexttrueType of profession of the Twitter user
profile_image_urltexttrueURL of the user’s profile image
tweets_countbiginttrueTotal number of tweets made by the user

tweets

Column NameTypeNullableDescription
article_cover_image_urltexttrueURL of the article’s cover image
article_idtexttrueID of the article if the tweet links to one
article_preview_texttexttruePreview text of the linked article
article_titletexttrueTitle of the linked article
bookmarksbiginttrueNumber of bookmarks of the tweet
hashtagsARRAYtrueHashtags used in the tweet
idtextfalseUnique identifier for the tweet
imagesARRAYtrueImages associated with the tweet
is_notebooleantrueIndicates if the tweet is a note
likesbiginttrueNumber of likes the tweet has received
poster_idtextfalseID of the user who posted the tweet
quotesbiginttrueNumber of quotes of the tweet
quote_parent_idtexttrueID of the parent tweet if this tweet is a quote
quote_poster_idtexttrueID of the user who posted the parent tweet
repliesbiginttrueNumber of replies to the tweet
reply_conversation_idtexttrueID of the conversation this tweet is part of
reply_parent_idtexttrueID of the parent tweet in the reply conversation
reply_poster_idtexttrueID of the user who posted the parent tweet
repostsbiginttrueNumber of times the tweet has been reposted
retweet_parent_idtexttrueID of the parent tweet if this tweet is a retweet
retweet_poster_idtexttrueID of the user who posted the parent tweet
tagged_usersARRAYtrueUsers tagged in the tweet
texttexttrueContent of the tweet
timestamptimestamp with timezonefalseTimestamp when the tweet was posted
videosARRAYtrueVideos associated with the tweet
viewsbiginttrueNumber of views the tweet has received

Enterprise Only

profile_timeseries

Column NameTypeNullableDescription
idtextfalseUnique identifier for the Twitter user profile
timestamptimestamp with timezonefalseTimestamp when the profile stats snapshot was taken
following_countbiginttrueNumber of accounts the user is following at this time
followers_countbiginttrueNumber of followers the user has at this time
tweets_countbiginttrueTotal number of tweets made by the user at this time
Each row represents a snapshot of a profile’s statistics at a specific timestamp. The data is updated periodically at the same time as the profiles table and is ordered by timestamp for optimal query performance when using timestamp in query conditionals.

articles

Column NameTypeNullableDescription
content_jsonjsonbtrueFull article content in JSON format
cover_image_urltexttrueURL of the article’s cover image
idtextfalseUnique identifier for the article
media_entities_jsonjsonbtrueMedia entities associated with the article in JSON
parent_tweet_idtexttrueID of the tweet that links to this article
poster_idtextfalseID of the user who posted the article
preview_texttexttruePreview text of the article
timestampbiginttrueTimestamp when the article was created
titletexttrueTitle of the article

Utilities for the Twitter dataset

  1. Tweet Id to URL: https://twitter.com/i/web/status/{tweetId}
  2. Profile Id to URL: https://twitter.com/intent/user?user_id={userId}
I