Fig. 3

Distribution of the number of papers per NLP related tag category: (a) dataset language (b) dataset type (c) NLP task (d) model type.Values that fell below the 5% threshold were aggregated into “Other” category for the purposes of analysis, except for dataset language, where we display the top three dataset languages