Datasets

The datasets listed here are public collections within the Open Terms Archive federated ecosystem. Their quality depends on their maintainers, and can vary greatly from collections curated by full-time academic research teams to on-and-off volunteers. Please make sure to read the license and maintenance commitment of each collection before publishing work based on these datasets.

Generative AI

Most popular generative AI services

Services
18
Documents
52
Language
English
Jurisdictions
European Union, China

Platform Governance Archive

Major global social media services

Services
22
Documents
86
Language
English
Jurisdiction
European Union

P2B Compliance

Online intermediation services for businesses subject to the European platforms-to-businesses (“P2B” / 2019/1150) regulation

Services
208
Documents
253
Language
European Union
Jurisdiction
European Union

France Élections

Most used social media in France that could have a systemic impact on the 2022 elections

Services
5
Documents
65
Language
French
Jurisdiction
France

Dating

Online dating

Services
25
Documents
65
Language
English
Jurisdictions
European Union, Switzerland

France

Largest digital services used in France

Services
108
Documents
216
Language
French
Jurisdiction
France

Contrib

Documents added by volunteer contributors and historically imported from TOSBack.org

Services
322
Documents
683
Languages
English, French
Jurisdictions
European Union, United States
Volunteer contributors

Demo

Services needed to operate the Open Terms Archive engine

Services
4
Documents
9
Language
English
Jurisdiction
European Union

Third-party related datasets

The datasets below are independent from Open Terms Archive and are listed for researchers’ convenience. If you know of other relevant datasets, please add them.