kjr_llm

KJR's internal LLM-testing framework.

For comprehensive documentation, see the API reference.

About

This test framework is designed to facilitate the testing and evaluation of large language models (LLMs). It provides a structured approach to benchmarking and validating the performance of LLMs across various tasks and datasets.

Features

Comprehensive Benchmarking: Evaluate LLMs on a wide range of tasks.
Customizable Test Suites: Easily create and run custom test suites.
Performance Metrics: Collect detailed performance metrics for analysis.
Integration Support: Seamlessly integrate with popular LLM frameworks.

Installation

Pip

Install the framework by running

pip install git+https://github.com/KJR-AU/llm_test_framework

then import with

import kjr_llm

Quickstart

from kjr_llm.app import App
from kjr_llm.targets import LangChainTarget
from kjr_llm.tests.lib import Criminality

from your_app import your_chain

# Set up the test application
app = App(context=your_chain,
            app_name="llm-powered-autonomous-agents",
            reset_database=True)

# Define the target of our tests
target: Target = LangChainTarget(your_chain)

# Define and execute the tests
Criminality.evaluate(target, app_id=f"{app.app_name}-{test.name}", default_provider="openai")

# Run the test dashboard to evaluate results
app.run_dashboard()

Name		Name	Last commit message	Last commit date
Latest commit History 168 Commits
.devcontainer		.devcontainer
docs		docs
kjr_llm		kjr_llm
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
Makefile		Makefile
README.md		README.md
__version__.py		__version__.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

kjr_llm

Table of Contents

About

Features

Installation

Pip

Quickstart

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

KJR-AU/llm_test_framework

Folders and files

Latest commit

History

Repository files navigation

kjr_llm

Table of Contents

About

Features

Installation

Pip

Quickstart

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages