Submission RuLeanALBERT

Sept. 13, 2022, 8:59 a.m.

Team: Yandex Research

Model url: https://huggingface.co/yandex/RuLeanALBERT


Total score: 0.698

Dataset Score Metric
LiDiRus 0.403 Matthew`s Corr
RCB 0.361 / 0.413 F1/Acc
PARus 0.796 Accuracy
MuSeRC 0.874 / 0.654 F1a/Em
TERRa 0.812 Accuracy
RUSSE 0.789 Accuracy
RWSD 0.669 Accuracy
DaNetQA 0.76 Accuracy
RuCoS 0.9 / 0.902 F1/EM
Model description:

RuLeanALBERT is a 2.9B Transformer encoder for Russian language trained on a mixture of the YaLM corpus, Taiga, RDT, and Wikipedia. It has several architectural differences from regular Transformers, namely weight sharing, rotary positional embeddings, GeGLU activations, and pre-normalization. Repo: https://github.com/yandex-research/RuLeanALBERT Habr: https://habr.com/ru/company/yandex/blog/688234/


Parameter description:

Diagnostic (Matthew`s Correlation): 0.403

Category Score
LOGIC 0.25766842280097046
KNOWLEDGE 0.33438659081581373
PREDICATE-ARGUMENT STRUCTURE 0.3900261443729555
LEXICAL SEMANTICS 0.5105617927842396
Lexical Semantics - Lexical Entailment 0.45392064950160177
Lexical Semantics - Morphological Negation 0.45175395145262565
Lexical Semantics - Factivity 0.43381600524612796
Lexical Semantics - Symmetry/Collectivity 0.43852900965351466
Lexical Semantics - Redundancy 0.5897678246195885
Lexical Semantics - Named Entities 0.5590169943749475
Lexical Semantics - Quantifiers 0.5348810406899015
Predicate-Argument Structure Core Args 0.5070925528371099
Predicate-Argument Structure Prepositional Phrases 0.6715144652340956
Predicate-Argument Structure Ellipsis/Implicits 0.3263956049169334
Predicate-Argument Structure Anaphora/Coreference 0.15646620130993677
Predicate-Argument Structure Active/Passive 0.4083133966424866
Predicate-Argument Structure Nominalization 0.4445906192369001
Predicate-Argument Structure Genitives/Partitives 0.4900980294098034
Predicate-Argument Structure Datives 0.49099025303098287
Predicate-Argument Structure Relative Clauses 0.2537340189666186
Predicate-Argument Structure Coordination Scopes 0.42365927286816174
Predicate-Argument Structure Intersectivity 0.34502923976608185
Predicate-Argument Structure Restrictivity -0.014760248092334923
Logic Negation 0.11177050727031347
Logic Double Negation 0.20759971844307282
Logic Interval/Numbers 0.08877935232369505
Logic Conjuction 0.5270462766947299
Logic Disjunction 0.1396215401769611
Logic Conditionals 0.1972421118046462
Logic Universal 0.6700593942604899
Logic Existential 0.25677629550654774
Logic Temporal 0.10776235844876533
Logic Upward Monotone 0.6701754385964912
Logic Downward Monotone -0.3212669683257919
Logic Non-Monotonic 0.1286978904175574
Knowledge Common Sense 0.44831667596163405
Knowledge World Knowledge 0.20313571895099006

Performance:

Dataset Speed RAM
LiDiRus - -
RCB - -
PARus - -
MuSeRC - -
TERRa - -
RUSSE - -
RWSD - -
DaNetQA - -
RuCoS - -