Twitter

Posts, likes, retweets, and metadata from all collected public tweets and profiles.

This dataset is structured into two main tables: profiles and tweets.

The profiles table contains information about Twitter user profiles, with the following columns:

Column NameDescription
collect_timestampTimestamp when the profile data was collected
idUnique identifier for the Twitter user profile
nameName of the Twitter user
display_nameDisplay name of the Twitter user
bioBiography of the Twitter user
is_blue_verifiedIndicates if the user is blue verified
created_timestampTimestamp when the Twitter account was created
locationLocation of the Twitter user
profession_typeType of profession of the Twitter user
profession_categoriesCategories related to the user’s profession
followingNumber of accounts the user is following
followersNumber of followers the user has
tweets_countTotal number of tweets made by the user
linksLinks associated with the Twitter user
banner_image_urlURL of the user’s banner image
profile_image_urlURL of the user’s profile image
pinned_tweet_idsIDs of tweets pinned by the user

The tweets table contains information about tweets made by users, with the following columns:

Column NameDescription
collect_timestampTimestamp when the tweet data was collected
idUnique identifier for the tweet
poster_idID of the user who posted the tweet
textContent of the tweet
timestampTimestamp when the tweet was posted
is_noteIndicates if the tweet is a note
imagesImages associated with the tweet
videosVideos associated with the tweet
hashtagsHashtags used in the tweet
tagged_usersUsers tagged in the tweet
repliesNumber of replies to the tweet
repostsNumber of times the tweet has been reposted
likesNumber of likes the tweet has received
viewsNumber of views the tweet has received
quotesNumber of quotes of the tweet
bookmarksNumber of bookmarks of the tweet
quote_parent_idID of the parent tweet if this tweet is a quote
quote_poster_idID of the user who posted the parent tweet
reply_conversation_idID of the conversation this tweet is part of
reply_parent_idID of the parent tweet in the reply conversation
reply_poster_idID of the user who posted the parent tweet
retweet_parent_idID of the parent tweet if this tweet is a retweet
retweet_poster_idID of the user who posted the parent tweet

Utilities for the Twitter dataset

  1. Tweet Id to URL: https://twitter.com/i/web/status/{tweetId}
  2. Profile Id to URL: https://twitter.com/intent/user?user_id={userId}

CoinGecko

Historical and real-time data for all assets in the CoinGecko universe. This dataset is structured into two main tables: history and projects.

The history table contains historical data for cryptocurrencies, with the following columns:

Column NameDescription
COINGECKO_IDUnique identifier for the cryptocurrency
DATEDate of the recorded data
FDMCFully Diluted Market Cap
H24_VOLUME24-hour trading volume
MARKET_CAPMarket capitalization
PRICEPrice of the cryptocurrency
SYMBOLSymbol of the cryptocurrency

The projects table provides detailed information about cryptocurrency projects, with the following columns:

Column NameDescription
CIRCULATING_SUPPLYCurrent circulating supply of the cryptocurrency
COINGECKO_IDUnique identifier for the cryptocurrency
FDMCFully Diluted Market Cap
HIGH_24HHighest price in the last 24 hours
LOW_24HLowest price in the last 24 hours
MARKET_CAPMarket capitalization
MARKET_CAP_CHANGE_24HChange in market cap over the last 24 hours
MARKET_CAP_CHANGE_PERCENTAGE_24HPercentage change in market cap over the last 24 hours
MAX_SUPPLYMaximum supply of the cryptocurrency
NAMEName of the cryptocurrency project
PRICECurrent price of the cryptocurrency
PRICE_ATHAll-time high price
PRICE_ATH_CHANGE_PERCENTAGEPercentage change from all-time high price
PRICE_ATH_DATEDate of the all-time high price
PRICE_ATLAll-time low price
PRICE_ATL_CHANGE_PERCENTAGEPercentage change from all-time low price
PRICE_ATL_DATEDate of the all-time low price
PRICE_CHANGE_24HPrice change over the last 24 hours
PRICE_CHANGE_PERCENTAGE_24HPercentage change in price over the last 24 hours
SYMBOLSymbol of the cryptocurrency
TOTAL_SUPPLYTotal supply of the cryptocurrency
VOLUME_24H24-hour trading volume

DePIN Ninja

Contains project characteristics, financial metrics, and historical revenue data for all DePIN.Ninja datasets.

This dataset is structured into two main tables: projects and historical_revenue.

The projects table provides information about different projects, including their category, chain, and financial metrics.

Column NameDescription
projectIdUnique identifier for the project.
nameName of the project.
categoryCategory of the project (e.g., AI, COMPUTE, SENSORS, etc.).
chainBlockchain network on which the project operates (e.g., ETHEREUM, SOLANA).
arrAnnual Recurring Revenue (ARR) of the project.
mrrMonthly Recurring Revenue (MRR) of the project.
createdAtDate when the project was created.
Project IDNameCategoryChainARRMRRCreated At
wmad1sPAAL AIAIETHEREUM184634510051922025-01-23
2a1uk3VirtualAIBASE231378903160655102025-01-23

The historical_revenue table contains historical revenue data for various projects, allowing for analysis of revenue trends over time.

Column NameDescription
createdAtTimestamp of when the revenue data was recorded.
dateSpecific date for the revenue data.
nameName of the project associated with the revenue data.
projectIdUnique identifier for the project.
revenueRevenue generated by the project on the specified date.
Created AtDateNameProject IDRevenue
2025-01-232025-01-01PAAL AIwmad1s50000
2025-01-232025-01-01Virtual2a1uk375000

This dataset can be used for various analyses, including:

  • Financial performance evaluation of DePIN projects.
  • Trend analysis of revenue over time.
  • Comparison of projects across different categories and chains.

Slice Analytics

Contains user counts and scores of quality users for top DEXs, NFT exchange, and Perp DEX platforms.

It utilizes Slice’s proprietary methodology for segmenting users into quality groups.

Slice builds a score for each wallet based on life-time history of value, historic onchain volume, and fees paid

This dataset is structured into three main tables: dex_trader_aggregates, nft_trader_aggregates, and perps_trader_aggregates.

The dex_trader_aggregates table contains aggregated data for decentralized exchange traders, with the following columns:

Column NameDescription
avg_scoreAverage score of the traders
blockchainBlockchain platform used
idUnique identifier for the record
last_modifiedTimestamp of the last modification
projectName of the project
user_countTotal number of users
users_over_50Number of users over 50 score user quality
users_over_60Number of users over 60 score user quality
users_over_70Number of users over 70 score user quality
users_over_80Number of users over 80 score user quality
users_over_90Number of users over 90 score user quality
users_over_95Number of users over 95 score user quality

The nft_trader_aggregates table provides aggregated data for NFT traders, with the following columns:

Column NameDescription
avg_scoreAverage score of the traders
blockchainBlockchain platform used
idUnique identifier for the record
last_modifiedTimestamp of the last modification
projectName of the project
user_countTotal number of users
users_over_50Number of users over 50 score user quality
users_over_60Number of users over 60 score user quality
users_over_70Number of users over 70 score user quality
users_over_80Number of users over 80 score user quality
users_over_90Number of users over 90 score user quality
users_over_95Number of users over 95 score user quality

The perps_trader_aggregates table contains aggregated data for perpetual contract traders, with the following columns:

Column NameDescription
avg_scoreAverage score of the traders
blockchainBlockchain platform used
idUnique identifier for the record
last_modifiedTimestamp of the last modification
projectName of the project
user_countTotal number of users
users_over_50Number of users over 50 score user quality
users_over_60Number of users over 60 score user quality
users_over_70Number of users over 70 score user quality
users_over_80Number of users over 80 score user quality
users_over_90Number of users over 90 score user quality
users_over_95Number of users over 95 score user quality

This dataset is ideal for analyzing quality users. DAU (daily active users) is an incomplete metric that is easily botted - this dataset provides more nuance and can allow researchers to understand the quality of the composing users.