Wals Roberta Sets 37-70.zip -

: Ordinal (53A) and distributive (54A) numerals, and numeral classifiers (55A). Nominal Syntax (Chapters 58–64) :

: Perfective/imperfective aspect (65A), past tense (66A), future tense (67A), and the perfect (68A). WALS roberta sets 37-70.zip

: Position of tense-aspect affixes (69A) and the morphological imperative (70A). Use Cases for the Dataset : Ordinal (53A) and distributive (54A) numerals, and

: Inclusive/exclusive distinctions (39A–40A), distance contrasts in demonstratives (41A), and third-person pronouns (43A). Use Cases for the Dataset : Inclusive/exclusive distinctions

: Using the WALS database features as labels to see if a model's internal representations (embeddings) cluster according to known linguistic traits, such as whether a language uses definite articles.

: Definite (37A) and Indefinite (38A) article systems.

The "RoBERTa" designation suggests this data has been pre-processed or formatted for use with the (Robustly Optimized BERT Pretraining Approach) large language model, likely for tasks like cross-lingual transfer or testing a model's metalinguistic knowledge. Included Linguistic Features (Chapters 37–70)

Наверх