Submission RuGPT3XL few-shot

Jan. 27, 2021, 12:11 p.m.

Team: SberDevices

Model url: https://huggingface.co/sberbank-ai/rugpt3xl


Total score: 0.535

Dataset Score Metric
LiDiRus 0.096 Matthew`s Corr
RCB 0.302 / 0.418 F1/Acc
PARus 0.676 Accuracy
MuSeRC 0.74 / 0.546 F1a/Em
TERRa 0.573 Accuracy
RUSSE 0.565 Accuracy
RWSD 0.649 Accuracy
DaNetQA 0.59 Accuracy
RuCoS 0.67 / 0.665 F1/EM
Model description:

This submission was made by Few-Shot methods only ruGPT-3 Xl - 1.3 billion parameters All tasks solved by ranging the perplexity of the answer variants Checkpoints and documentation can be found here: https://github.com/sberbank-ai/ru-gpts https://github.com/sberbank-ai/ru-gpts/tree/master/gw


Parameter description:

Diagnostic (Matthew`s Correlation): 0.096

Category Score
LOGIC -0.007250629987786855
KNOWLEDGE 0.059773386339694506
PREDICATE-ARGUMENT STRUCTURE 0.13997903121271693
LEXICAL SEMANTICS 0.18737835348183993
Lexical Semantics - Lexical Entailment 0.029950995334911953
Lexical Semantics - Morphological Negation 0.0
Lexical Semantics - Factivity 0.06900655593423542
Lexical Semantics - Symmetry/Collectivity 0.2921186973360886
Lexical Semantics - Redundancy 0.45652173913043476
Lexical Semantics - Named Entities 0.24253562503633297
Lexical Semantics - Quantifiers 0.20823613034492347
Predicate-Argument Structure Core Args 0.18433668044583404
Predicate-Argument Structure Prepositional Phrases 0.23123776351036351
Predicate-Argument Structure Ellipsis/Implicits 0.08554415042912275
Predicate-Argument Structure Anaphora/Coreference -0.01692566771665739
Predicate-Argument Structure Active/Passive 0.05353033790313108
Predicate-Argument Structure Nominalization 0.0
Predicate-Argument Structure Genitives/Partitives -0.25
Predicate-Argument Structure Datives 0.3563483225498992
Predicate-Argument Structure Relative Clauses 0.09759000729485333
Predicate-Argument Structure Coordination Scopes 0.35186577527449836
Predicate-Argument Structure Intersectivity 0.12100224017315647
Predicate-Argument Structure Restrictivity -0.1455213750217998
Logic Negation 0.0399615133948266
Logic Double Negation 0.21102672654165874
Logic Interval/Numbers -0.25225301366160086
Logic Conjuction -0.1084652289093281
Logic Disjunction 0.15291057030852148
Logic Conditionals -0.14285714285714285
Logic Universal 0.2548235957188128
Logic Existential -0.15724272550828775
Logic Temporal -0.13261933174731558
Logic Upward Monotone 0.10054155780296499
Logic Downward Monotone 0.03008284187980934
Logic Non-Monotonic -0.34151450937027694
Knowledge Common Sense -0.03115501519756839
Knowledge World Knowledge 0.14184850482497574

Performance:

Dataset Speed RAM
LiDiRus - -
RCB - -
PARus - -
MuSeRC - -
TERRa - -
RUSSE - -
RWSD - -
DaNetQA - -
RuCoS - -