9 октября 2023 г. 17:38
Команда Saiga team
Ссылка на модель https://huggingface.co/IlyaGusev/saiga_mistral_7b_lora
| Датасет | Результат | Метрика |
|---|---|---|
| LiDiRus | 0,322 | Кор, коэффициент Мэтью |
| RCB | 0,436 / 0,5 | F1/Точность |
| PARus | 0,698 | Точность |
| MuSeRC | 0,84 / 0,553 | F1a/Em |
| TERRa | 0,807 | Точность |
| RUSSE | 0,587 | Точность |
| RWSD | 0,727 | Точность |
| DaNetQA | 0,839 | Точность |
| RuCoS | 0,58 / 0,571 | F1/EM |
Saiga Mistral 7B zero-shot Mistral 7B (https://huggingface.co/mistralai/Mistral-7B-v0.1) was tuned on 5 Russian datasets: ru_turbo_saiga, ru_sharegpt_cleaned, oasst1_ru_main_branch, gpt_roleplay_realm, ru_instruct_gpt4 See more information on HuggingFace: https://huggingface.co/IlyaGusev/saiga_mistral_7b_lora Zero-shot evaluation script for the inference for Saiga Mistral model: https://github.com/IlyaGusev/rulm/blob/master/self_instruct/src/benchmarks/eval_zs_rsg.py
| Категория | Результат |
|---|---|
| LOGIC | 0,24068909266898375 |
| KNOWLEDGE | 0,3467006882891795 |
| PREDICATE-ARGUMENT STRUCTURE | 0,32835497921408513 |
| LEXICAL SEMANTICS | 0,38897213279554893 |
| Lexical Semantics - Lexical Entailment | 0,39133162093562956 |
|---|---|
| Lexical Semantics - Morphological Negation | 0,4816727030991569 |
| Lexical Semantics - Factivity | 0,16140004156875074 |
| Lexical Semantics - Symmetry/Collectivity | 0,43852900965351466 |
| Lexical Semantics - Redundancy | 0,6922186552431729 |
| Lexical Semantics - Named Entities | 0,4472135954999579 |
| Lexical Semantics - Quantifiers | 0,404651319111256 |
| Predicate-Argument Structure Core Args | 0,40987803063838396 |
| Predicate-Argument Structure Prepositional Phrases | 0,3020666145593326 |
| Predicate-Argument Structure Ellipsis/Implicits | 0,39148014634313566 |
| Predicate-Argument Structure Anaphora/Coreference | 0,33606722016672236 |
| Predicate-Argument Structure Active/Passive | 0,3244428422615251 |
| Predicate-Argument Structure Nominalization | 0,6255432421712243 |
| Predicate-Argument Structure Genitives/Partitives | 0,6666666666666666 |
| Predicate-Argument Structure Datives | 0,35043832202523123 |
| Predicate-Argument Structure Relative Clauses | 0,13912166872805048 |
| Predicate-Argument Structure Coordination Scopes | 0,0 |
| Predicate-Argument Structure Intersectivity | 0,32489314482696546 |
| Predicate-Argument Structure Restrictivity | 0,2748737083745107 |
| Logic Negation | 0,23037198846919968 |
| Logic Double Negation | 0,19312181983410703 |
| Logic Interval/Numbers | 0,033731512431528755 |
| Logic Conjuction | 0,25819888974716115 |
| Logic Disjunction | 0,2364331218717302 |
| Logic Conditionals | 0,3646984043128985 |
| Logic Universal | 0,15228622596829317 |
| Logic Existential | 0,24459979523511427 |
| Logic Temporal | 0,007053982594841415 |
| Logic Upward Monotone | 0,27640672769878033 |
| Logic Downward Monotone | 0,16238586255274184 |
| Logic Non-Monotonic | 0,18389242812245682 |
| Knowledge Common Sense | 0,3170077616914835 |
| Knowledge World Knowledge | 0,37712430952994663 |
| Датасет | Speed | RAM |
|---|---|---|
| LiDiRus | - | - |
| RCB | - | - |
| PARus | - | - |
| MuSeRC | - | - |
| TERRa | - | - |
| RUSSE | - | - |
| RWSD | - | - |
| DaNetQA | - | - |
| RuCoS | - | - |