diff --git a/tests/evaluation/non_english_results.md b/tests/evaluation/non_english_results.md new file mode 100644 index 00000000..b92998d6 --- /dev/null +++ b/tests/evaluation/non_english_results.md @@ -0,0 +1,27 @@ +# Non-English Evaluation Results for Gemma 3n (E2B / E4B) + +## Spanish + +**Prompt:** +Hola, ¿cómo estás? + +**Observed Behavior:** +The model struggles to maintain conversational consistency. Some responses are partially correct, but grammar and context degrade quickly. + +--- + +## Russian + +**Prompt:** +Привет, как дела? + +**Observed Behavior:** +The model shows noticeable grammatical errors and sometimes produces repetitive or irrelevant output. + +--- + +## Notes + +- No model weights or parameters were modified. +- These results are observational and meant to highlight current multilingual limitations. +- Intended to support Issue #460 and guide future improvements.