openml-python-dev/examples/Advanced/suites_tutorial.py at develop · staru09/openml-python-dev

History

92 lines (68 loc) · 2.09 KB

Raw

# %% [markdown]

# How to list, download and upload benchmark suites.

# %%

import uuid

import numpy as np

import openml

# %% [markdown]

# ## Listing suites

# * Use the output_format parameter to select output type

# * Default gives ``dict``, but we'll use ``dataframe`` to obtain an

# easier-to-work-with data structure

# %%

suites = openml.study.list_suites(status="all")

print(suites.head(n=10))

# %% [markdown]

# ## Downloading suites

# This is done based on the dataset ID.

# %%

suite = openml.study.get_suite(99)

print(suite)

# %% [markdown]

# Suites also feature a description:

# %%

print(suite.description)

# %% [markdown]

# Suites are a container for tasks:

# %%

print(suite.tasks)

# %% [markdown]

# And we can use the task listing functionality to learn more about them:

# %%

tasks = openml.tasks.list_tasks()

# %% [markdown]

# Using ``@`` in

# [pd.DataFrame.query](https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.query.html)

# accesses variables outside of the current dataframe.

# %%

tasks = tasks.query("tid in @suite.tasks")

print(tasks.describe().transpose())

# %% [markdown]

# We'll use the test server for the rest of this tutorial.

# %%

openml.config.start_using_configuration_for_example()

# %% [markdown]

# ## Uploading suites

# Uploading suites is as simple as uploading any kind of other OpenML

# entity - the only reason why we need so much code in this example is

# because we upload some random data.

# We'll take a random subset of at least ten tasks of all available tasks on

# the test server:

# %%

all_tasks = list(openml.tasks.list_tasks()["tid"])

task_ids_for_suite = sorted(np.random.choice(all_tasks, replace=False, size=20))

# The study needs a machine-readable and unique alias. To obtain this,

# we simply generate a random uuid.

alias = uuid.uuid4().hex

new_suite = openml.study.create_benchmark_suite(

name="Test-Suite",

description="Test suite for the Python tutorial on benchmark suites",

task_ids=task_ids_for_suite,

alias=alias,

)

new_suite.publish()

print(new_suite)

# %%

openml.config.stop_using_configuration_for_example()

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Expand file tree

Search code, repositories, users, issues, pull requests...

FilesExpand file tree

suites_tutorial.py

Latest commit

History

suites_tutorial.py

File metadata and controls

Expand file tree