Simple LLM Evaluation

Welcome to the simple LLM evaluation framework—simpleval, for short.

simpleval is a Python package designed to make evaluating Large Language Models (LLMs) easier, using the "LLM as a Judge" technique.

It supports a variety of LLM providers, including OpenAI, Google (Gemini API, Vertex), AWS Bedrock, Anthropic, Azure, and more (via LiteLLM).

simpleval also includes several reports to help you analyze, compare, and summarize your evaluation results. See the available reports for more details.

Getting Started

See the 📚 Quickstart Guide 📚

Documentation

See 📚 Project Documentation 📚

Contributing

We appreciate your help in making this project better! ✨

If you would like to contribute to this project, please follow the guidelines outlined in the CONTRIBUTING.md file.

License

simpleval is released under the Apache License. See the LICENSE file for more details.

Contact

If you have any questions or suggestions, feel free to join our GitHub discussions forum 💬

If you want to report a bug or request a feature, please open an issue in the GitHub issues tracker 🐛

Name	Name	Last commit message	Last commit date
Latest commit History 166 Commits
.config	.config
.github	.github
ci	ci
docs	docs
reports-frontend	reports-frontend
simpleval	simpleval
tests	tests
tools	tools
.gitattributes	.gitattributes
.gitignore	.gitignore
CHANGELOG.md	CHANGELOG.md
CODEOWNERS	CODEOWNERS
CODE_OF_CONDUCT.md	CODE_OF_CONDUCT.md
CONTRIBUTING.md	CONTRIBUTING.md
LICENSE	LICENSE
NOTICES.txt	NOTICES.txt
README.md	README.md
SECURITY.md	SECURITY.md
__init__.py	__init__.py
mkdocs.yml	mkdocs.yml
pyproject.toml	pyproject.toml
requirements.txt	requirements.txt
uv.lock	uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Simple LLM Evaluation

Getting Started

Documentation

Contributing

License

Contact

About

Uh oh!

Releases 3

Packages

Uh oh!

Contributors 5

Uh oh!

Languages

Search code, repositories, users, issues, pull requests...

License

cyberark/simple-llm-eval

Folders and files

Latest commit

History

Repository files navigation

Simple LLM Evaluation

Getting Started

Documentation

Contributing

License

Contact

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Uh oh!

Contributors 5

Uh oh!

Languages

Packages