Components is a publication and research project that assembles, investigates, and editorializes large datasets. We make most of the data we use in our research freely available. You can see what’s available on our Datasets page.

For any correspondence, contact mail@components.one.


  • Andrew Thompson

  • Kyle Paoletta

  • Jules Becker

  • Guido Flichman

Press and studies

Gender bias recognition in political news articles - Machine Learning with Applications, 06/15/2022

The Future of Streaming Services May Be In The Past - The New Inquiry, 6/2/2022

Mining for Fake News - Advanced Information Networking and Applications, 03/31/2022

What Is the Future of Digital Music After Bandcamp & Epic Games Acquisition? - Remezcla, 03/30/2022

What does Epic Games buying Bandcamp mean for DIY music? - Resident Advisor, 3/16/2022

The Best Online Articles of 2021 - Ted Goia, 12/16/2021

L'insatiable appétit des "Tech review" pour le design pornographique - Mais où va le Web?, 11/24/2021

Cultural cartography with word embeddings - Poetics, October 2021

What Spotify Follower Ratio Tells Us About Artist Growth and Fan Engagement - Chartmetric, 6/1/2021

Tous les genres musicaux et leur répartition sur la planète, dans une carte interactive - tsugi, 1/22/2021

Disinformation: analysis and identification - Computational and Mathematical Organization Theory, 06/18/2021

Bandcamp a créé une carte interactive regroupant tous les genres musicaux de chaque ville - Trax, 01/25/2021

The disturbing belly of the 'step' porn trend - Mashable, 8/10/20

Questa estensione vuole fottere l’algoritmo dei suggeriti di Pornhub - Vice Italy, 12/20/2019

Building a Topic Modeling Pipeline with spaCy and Gensim - Towards Data Science, 9/17/2019

Trump et Twitter, c'est 24 heures sur 24 - Le Soir, 1/17/19

GraphBTM: Graph Enhanced Autoencoded Variational Inference for Biterm Topic Model - Conference on Empirical Methods in Natural Language Processing, 01/01/2018