Components is a research project that assembles, investigates, and editorializes large datasets. We make most of the data collected for our research freely available on our Datasets page.

You can sign up for our newsletter here, and you can also join our very nascent Discord server. For any correspondence, contact mail@components.one.


  • Andrew Thompson

  • Kyle Paoletta

  • Jules Becker

  • Guido Flichman

Press and studies

A Dataset for The Study of Online Radicalization Through Incel Forum Archives - Journal of Quantitative Description: Digital Media, 4/12/2024

The Many Lives of ‘Sounds of North American Frogs’ - Atlas Obscura, 1/23/2024

Why did the metaverse die? Because Silicon Valley doesn’t understand the concept of fun - Fast Company, 10/15/2023

How Bandcamp makes more money than Spotify - Fast Company, 10/5/2023

Text-to-Image Generation Tools: A Survey and NSFW Content Analysis - Companion Proceedings of the Brazilian Symposium on Multmedia and the Web, 10/23/2023

Billboard 200: The Lessons of Musical Success in the US - Music & Science, 7/12/2023

Moral Framing of Mental Health Discourse and Its Relationship to Stigma: A Comparison of Social Media and News - Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, 4/23/2023

Goal Driven Discovery of Distributional Differences via Language Descriptions - Advances in Neural Processing Information Systems 36, 2/23/2023

Using word embedding models to capture changing media discourses - Journal of Computational Social Science, 9/16/2022

COVID-19 and Changing Values - Values for a Post-Pandemic Future, 9/14/2022

Bandcamp und Epic Games: Ein halbes Jahr später – kritische Betrachtung - DJ Lab, 9/2/2022

Lumen: A Machine Learning Framework to Expose Influence Cues in Text - Frontiers in Computer Science, 8/3/2021

Gender bias recognition in political news articles - Machine Learning with Applications, 6/15/2022

The Future of Streaming Services May Be In The Past - The New Inquiry, 6/2/2022

Mining for Fake News - Advanced Information Networking and Applications, 3/31/2022

What Is the Future of Digital Music After Bandcamp & Epic Games Acquisition? - Remezcla, 3/30/2022

What does Epic Games buying Bandcamp mean for DIY music? - Resident Advisor, 3/16/2022

The Best Online Articles of 2021 - Ted Goia, 12/16/2021

L'insatiable appétit des "Tech review" pour le design pornographique - Mais où va le Web?, 11/24/2021

Cultural cartography with word embeddings - Poetics, October 2021

L’insatiable appétit des « Tech review » pour le design pornographique - Mais Óu Va Le Web?, 7/2/2021

What Spotify Follower Ratio Tells Us About Artist Growth and Fan Engagement - Chartmetric, 6/1/2021

Tous les genres musicaux et leur répartition sur la planète, dans une carte interactive - tsugi, 1/22/2021

Disinformation: analysis and identification - Computational and Mathematical Organization Theory, 06/18/2021

Bandcamp a créé une carte interactive regroupant tous les genres musicaux de chaque ville - Trax, 01/25/2021

The disturbing belly of the 'step' porn trend - Mashable, 8/10/2020

Questa estensione vuole fottere l’algoritmo dei suggeriti di Pornhub - Vice Italy, 12/20/2019

Building a Topic Modeling Pipeline with spaCy and Gensim - Towards Data Science, 9/17/2019

Trump et Twitter, c'est 24 heures sur 24 - Le Soir, 1/17/2019

GraphBTM: Graph Enhanced Autoencoded Variational Inference for Biterm Topic Model - Conference on Empirical Methods in Natural Language Processing, 1/1/2018