Datasets for Programmatic SEO

Are you looking to build, or improve, a programmatic SEO site in the social media niche?

If so, today is your lucky day! We've put together the best 10 social media datasets for programmatic SEO (most of them are free) that you can download and use for your own projects.

Let's jump in.

Looking to learn programmatic SEO from scratch?

Check out the full course, that teaches you everything you need to get started building programmatic SEO projects - with code, no-code, or AI content.

View the course →

10 Useful Social Media Datasets for pSEO

Along with a brief description of all the datasets, we have also included the format(s) they are available in.

1. Social Media Influencers

Available format(s): CSV

A dataset containing top 1000 social media influencers from Instagram, YouTube, and TikTok, each with their number of followers and other relevant information.

2. TwineSocial

Available format(s): JSON

TwineSocial allows you to find and access content from multiple social media networks, like Twitter, Instagram, Facebook, Vine, Tumblr, Flickr, and Google+ by using hashtags, account handles, and geo-location. It offers a high-performance, scalable interface with server-side rules and a moderation feature.

3. Emoji Dictionary with R Encodings and Image Files

Available format(s): XLS

A dataset of emojis from Unicode 10.0 with R encodings, Unicode categories, subcategories, and Emojipedia names along with corresponding image files. 2,624 rows in total.

4. Instagram

Available format(s): JSON

The instagram dataset contains basic metadata from Instagram user, hashtag, location feed pages, comments, and people who liked specific posts, followers, and followings from a username.

5. Influencer Search

Available format(s): JSON

A dataset that provides information on influencers through the Social Animal Influencer Search API, including data on twitter profiles, top authors, and best sharers of content for a specific query, with options to sort by followers, number of tweets, location and type of influencer.

6. Usage of social media by students between age 17-22

Available format(s): XLS

A dataset of students between the ages of 17-22, including their age, preferred social media platforms, daily usage time, physical activity time, and perception of exposure to inappropriate content on those platforms. Timestamp is included.

7.Social Networks Global Coverage - Account, Business & Non-business

Available format(s): CSV, JSON, XLS

A dataset of 514 million records of social media accounts from 249 countries with various data points such as followers, profile type, engagement score, location, external links and more. Can be filtered by geography, account type, brand affiliation, hashtags and more.

8. LinkedIn data for 24Million companies

Available format(s): JSON

A dataset of 24 million companies, including company name, country, size, headquarters, website, followers, industry, employees, employees on LinkedIn, about, and founded information.

9. Tagdef

Available format(s): JSON

Tagdef dataset is a large hashtag dictionary containing over 60,000 user-generated definitions for hashtags commonly used on Twitter, Pinterest, and Google+.

10. Twitter Celebrity Tweets And Embeddings

Available format(s): CSV

The Twitter Celebrity Tweets And Embeddings dataset contains tweets and embeddings of top 1000 celebrity Twitter accounts.

And that's it! I hope you found this list of social media datasets useful and if you do use any of them, let me know on Twitter.

Still on the hunt for datasets? Here are some more that might give you shiny object syndrome!

More pSEO datasets

Sports Movies Crime Cell Phones Books Salary Tourism Cars Music Stocks Quotes Social Media NBA Electric Vehicles