Submission ruElectra-medium finetune

Feb. 3, 2023, 2:21 p.m.

Team: SberDevices

Model url: https://huggingface.co/sberbank-ai/ruElectra-medium


Total score: 0.524

Dataset Score Metric
LiDiRus 0.182 Matthew`s Corr
RCB 0.413 / 0.525 F1/Acc
PARus 0.576 Accuracy
MuSeRC 0.615 / 0.189 F1a/Em
TERRa 0.544 Accuracy
RUSSE 0.649 Accuracy
RWSD 0.669 Accuracy
DaNetQA 0.6 Accuracy
RuCoS 0.63 / 0.624 F1/EM
Model description:

ruElectra-medium model of critic encoder class, pretraining with google ELECTRA code. It has 12 layers and hidden size 576. It was trained on a Russian language corpus (100GB). The dataset is the same as for sbert_large_mt_nlu_ru models. Wordpiece tokenizer. This model we use as reward-critic model for RLHF and black-box attack. Source data for training: Taiga, Lenta, OpenSubtitles, Wiki, etc. - all with ru lang. domain. Scoring pipeline: https://github.com/ai-forever/rsg-baselines


Parameter description:

Diagnostic (Matthew`s Correlation): 0.182

Category Score
LOGIC 0.13877200806086512
KNOWLEDGE 0.2542761568913483
PREDICATE-ARGUMENT STRUCTURE 0.16065050857951518
LEXICAL SEMANTICS 0.21126894813889507
Lexical Semantics - Lexical Entailment 0.05541499624957016
Lexical Semantics - Morphological Negation 0.05143444998736397
Lexical Semantics - Factivity 0.32387513781564786
Lexical Semantics - Symmetry/Collectivity -0.271746488194703
Lexical Semantics - Redundancy 0.33267391956523024
Lexical Semantics - Named Entities 0.35355339059327373
Lexical Semantics - Quantifiers 0.32360209754104324
Predicate-Argument Structure Core Args 0.19599157740244455
Predicate-Argument Structure Prepositional Phrases 0.29310451774759705
Predicate-Argument Structure Ellipsis/Implicits 0.07042952122737638
Predicate-Argument Structure Anaphora/Coreference 0.2811258418541589
Predicate-Argument Structure Active/Passive 0.018620327436868072
Predicate-Argument Structure Nominalization 0.09607689228305229
Predicate-Argument Structure Genitives/Partitives 0.050251890762960605
Predicate-Argument Structure Datives 0.2182178902359924
Predicate-Argument Structure Relative Clauses -0.01642880193633814
Predicate-Argument Structure Coordination Scopes 0.12087912087912088
Predicate-Argument Structure Intersectivity 0.2724196464492864
Predicate-Argument Structure Restrictivity -0.2958081738859997
Logic Negation -0.004593297481561526
Logic Double Negation 0.2119213177352503
Logic Interval/Numbers -0.040522044923655395
Logic Conjuction 0.08944271909999159
Logic Disjunction 0.09855138041620624
Logic Conditionals 0.1111111111111111
Logic Universal 0.4029114820126901
Logic Existential 0.2058790548922549
Logic Temporal 0.15244937348544793
Logic Upward Monotone 0.05923488777590923
Logic Downward Monotone 0.014678923792502562
Logic Non-Monotonic 0.18917776913478015
Knowledge Common Sense 0.26383624620352564
Knowledge World Knowledge 0.22110440420299576

Performance:

Dataset Speed RAM
LiDiRus - -
RCB - -
PARus - -
MuSeRC - -
TERRa - -
RUSSE - -
RWSD - -
DaNetQA - -
RuCoS - -