Datasets

The datasets listed here are public collections within the Open Terms Archive federated ecosystem. Their quality depends on their maintainers, and can vary greatly from collections curated by full-time academic research teams to on-and-off volunteers. Please make sure to read the license and maintenance commitment of each collection before publishing work based on these datasets.

Generative AI

Most popular generative AI services
Services
19
Documents
55
Language
English
Jurisdictions
European Union, China

Platform Governance Archive

Major global social media services
Services
22
Documents
86
Language
English
Jurisdiction
European Union

P2B Compliance

Online intermediation services for businesses subject to the European platforms-to-businesses (“P2B” / 2019/1150) regulation
Services
208
Documents
253
Language
European Union
Jurisdiction
European Union

France Élections

Most used social media in France that could have a systemic impact on the 2022 elections
Services
5
Documents
65
Language
French
Jurisdiction
France

Dating

Online dating
Services
25
Documents
65
Language
English
Jurisdictions
European Union, Switzerland

France

Largest digital services used in France
Services
108
Documents
216
Language
French
Jurisdiction
France

Contrib

Documents added by volunteer contributors and historically imported from TOSBack.org
Services
322
Documents
683
Languages
English, French
Jurisdictions
European Union, United States
Volunteer contributors

Demo

Services needed to operate the Open Terms Archive engine
Services
4
Documents
9
Language
English
Jurisdiction
European Union

Third-party related datasets

The datasets below are independent from Open Terms Archive and are listed for researchers’ convenience. If you know of other relevant datasets, please add them.